Using regular expressions for mining data in large software repositories

The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this...

Full description

Bibliographic Details
Main Author: Awang Abu Bakar, Normi Sham
Format: Proceeding Paper
Language:English
English
Published: IEEE 2014
Subjects:
Online Access:http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
_version_ 1848782349932691456
author Awang Abu Bakar, Normi Sham
author_facet Awang Abu Bakar, Normi Sham
author_sort Awang Abu Bakar, Normi Sham
building IIUM Repository
collection Online Access
description The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly.
first_indexed 2025-11-14T16:04:03Z
format Proceeding Paper
id iium-42896
institution International Islamic University Malaysia
institution_category Local University
language English
English
last_indexed 2025-11-14T16:04:03Z
publishDate 2014
publisher IEEE
recordtype eprints
repository_type Digital Repository
spelling iium-428962017-09-20T01:07:10Z http://irep.iium.edu.my/42896/ Using regular expressions for mining data in large software repositories Awang Abu Bakar, Normi Sham T Technology (General) The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly. IEEE 2014 Proceeding Paper PeerReviewed application/pdf en http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf application/pdf en http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf Awang Abu Bakar, Normi Sham (2014) Using regular expressions for mining data in large software repositories. In: 2014 The 5th International Conference on Information and Communication Technology for The Muslim World (ICT4M), 17th-18th November 2014, Kuching, Sarawak, Malaysia. http://ieeexplore.ieee.org/document/7020649/ 10.1109/ICT4M.2014.7020649
spellingShingle T Technology (General)
Awang Abu Bakar, Normi Sham
Using regular expressions for mining data in large software repositories
title Using regular expressions for mining data in large software repositories
title_full Using regular expressions for mining data in large software repositories
title_fullStr Using regular expressions for mining data in large software repositories
title_full_unstemmed Using regular expressions for mining data in large software repositories
title_short Using regular expressions for mining data in large software repositories
title_sort using regular expressions for mining data in large software repositories
topic T Technology (General)
url http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf