A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challengi...
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
2010
|
| Subjects: | |
| Online Access: | http://psasir.upm.edu.my/id/eprint/12693/ |
| _version_ | 1848841904337190912 |
|---|---|
| author | Shaker, Mahmoud Ibrahim, Hamidah Mustapha, Aida Abdullah, Lili Nurliyana |
| author_facet | Shaker, Mahmoud Ibrahim, Hamidah Mustapha, Aida Abdullah, Lili Nurliyana |
| author_sort | Shaker, Mahmoud |
| building | UPM Institutional Repository |
| collection | Online Access |
| description | Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. This paper proposes a framework for extracting, classifying, analyzing, and presenting semi-structured web data sources. The framework is able to extract relevant information from different web data sources, and classify the extracted information based on the standard classification scheme of Nokia products, which has been chosen as the case study. |
| first_indexed | 2025-11-15T07:50:39Z |
| format | Article |
| id | upm-12693 |
| institution | Universiti Putra Malaysia |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-15T07:50:39Z |
| publishDate | 2010 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | upm-126932011-11-24T04:50:32Z http://psasir.upm.edu.my/id/eprint/12693/ A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources Shaker, Mahmoud Ibrahim, Hamidah Mustapha, Aida Abdullah, Lili Nurliyana Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. This paper proposes a framework for extracting, classifying, analyzing, and presenting semi-structured web data sources. The framework is able to extract relevant information from different web data sources, and classify the extracted information based on the standard classification scheme of Nokia products, which has been chosen as the case study. 2010-11 Article PeerReviewed Shaker, Mahmoud and Ibrahim, Hamidah and Mustapha, Aida and Abdullah, Lili Nurliyana (2010) A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources. Journal of Next Generation Information Technology, 1 (3). pp. 106-114. ISSN 2092-8637 Information storage and retrieval systems. Natural language processing (Computer science) Interactive computer systems. English |
| spellingShingle | Information storage and retrieval systems. Natural language processing (Computer science) Interactive computer systems. Shaker, Mahmoud Ibrahim, Hamidah Mustapha, Aida Abdullah, Lili Nurliyana A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources |
| title | A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources |
| title_full | A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources |
| title_fullStr | A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources |
| title_full_unstemmed | A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources |
| title_short | A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources |
| title_sort | framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources |
| topic | Information storage and retrieval systems. Natural language processing (Computer science) Interactive computer systems. |
| url | http://psasir.upm.edu.my/id/eprint/12693/ |