A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources

Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challengi...

Full description

Bibliographic Details
Main Authors: Shaker, Mahmoud, Ibrahim, Hamidah, Mustapha, Aida, Abdullah, Lili Nurliyana
Format: Article
Language:English
Published: 2010
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/12693/
_version_ 1848841904337190912
author Shaker, Mahmoud
Ibrahim, Hamidah
Mustapha, Aida
Abdullah, Lili Nurliyana
author_facet Shaker, Mahmoud
Ibrahim, Hamidah
Mustapha, Aida
Abdullah, Lili Nurliyana
author_sort Shaker, Mahmoud
building UPM Institutional Repository
collection Online Access
description Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. This paper proposes a framework for extracting, classifying, analyzing, and presenting semi-structured web data sources. The framework is able to extract relevant information from different web data sources, and classify the extracted information based on the standard classification scheme of Nokia products, which has been chosen as the case study.
first_indexed 2025-11-15T07:50:39Z
format Article
id upm-12693
institution Universiti Putra Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T07:50:39Z
publishDate 2010
recordtype eprints
repository_type Digital Repository
spelling upm-126932011-11-24T04:50:32Z http://psasir.upm.edu.my/id/eprint/12693/ A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources Shaker, Mahmoud Ibrahim, Hamidah Mustapha, Aida Abdullah, Lili Nurliyana Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. This paper proposes a framework for extracting, classifying, analyzing, and presenting semi-structured web data sources. The framework is able to extract relevant information from different web data sources, and classify the extracted information based on the standard classification scheme of Nokia products, which has been chosen as the case study. 2010-11 Article PeerReviewed Shaker, Mahmoud and Ibrahim, Hamidah and Mustapha, Aida and Abdullah, Lili Nurliyana (2010) A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources. Journal of Next Generation Information Technology, 1 (3). pp. 106-114. ISSN 2092-8637 Information storage and retrieval systems. Natural language processing (Computer science) Interactive computer systems. English
spellingShingle Information storage and retrieval systems.
Natural language processing (Computer science)
Interactive computer systems.
Shaker, Mahmoud
Ibrahim, Hamidah
Mustapha, Aida
Abdullah, Lili Nurliyana
A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_full A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_fullStr A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_full_unstemmed A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_short A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_sort framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
topic Information storage and retrieval systems.
Natural language processing (Computer science)
Interactive computer systems.
url http://psasir.upm.edu.my/id/eprint/12693/