Using regular expressions for mining data in large software repositories
The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this...
| Main Author: | |
|---|---|
| Format: | Proceeding Paper |
| Language: | English English |
| Published: |
IEEE
2014
|
| Subjects: | |
| Online Access: | http://irep.iium.edu.my/42896/ http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf |
| _version_ | 1848782349932691456 |
|---|---|
| author | Awang Abu Bakar, Normi Sham |
| author_facet | Awang Abu Bakar, Normi Sham |
| author_sort | Awang Abu Bakar, Normi Sham |
| building | IIUM Repository |
| collection | Online Access |
| description | The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly. |
| first_indexed | 2025-11-14T16:04:03Z |
| format | Proceeding Paper |
| id | iium-42896 |
| institution | International Islamic University Malaysia |
| institution_category | Local University |
| language | English English |
| last_indexed | 2025-11-14T16:04:03Z |
| publishDate | 2014 |
| publisher | IEEE |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | iium-428962017-09-20T01:07:10Z http://irep.iium.edu.my/42896/ Using regular expressions for mining data in large software repositories Awang Abu Bakar, Normi Sham T Technology (General) The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly. IEEE 2014 Proceeding Paper PeerReviewed application/pdf en http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf application/pdf en http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf Awang Abu Bakar, Normi Sham (2014) Using regular expressions for mining data in large software repositories. In: 2014 The 5th International Conference on Information and Communication Technology for The Muslim World (ICT4M), 17th-18th November 2014, Kuching, Sarawak, Malaysia. http://ieeexplore.ieee.org/document/7020649/ 10.1109/ICT4M.2014.7020649 |
| spellingShingle | T Technology (General) Awang Abu Bakar, Normi Sham Using regular expressions for mining data in large software repositories |
| title | Using regular expressions for mining data in large software repositories |
| title_full | Using regular expressions for mining data in large software repositories |
| title_fullStr | Using regular expressions for mining data in large software repositories |
| title_full_unstemmed | Using regular expressions for mining data in large software repositories |
| title_short | Using regular expressions for mining data in large software repositories |
| title_sort | using regular expressions for mining data in large software repositories |
| topic | T Technology (General) |
| url | http://irep.iium.edu.my/42896/ http://irep.iium.edu.my/42896/ http://irep.iium.edu.my/42896/ http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf |