A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources

Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challengi...

Full description

Bibliographic Details
Main Authors:	Shaker, Mahmoud, Ibrahim, Hamidah, Mustapha, Aida, Abdullah, Lili Nurliyana
Format:	Article
Language:	English
Published:	2010
Subjects:	Information storage and retrieval systems. Natural language processing (Computer science) Interactive computer systems.
Online Access:	http://psasir.upm.edu.my/id/eprint/12693/

_version_	1848841904337190912
author	Shaker, Mahmoud Ibrahim, Hamidah Mustapha, Aida Abdullah, Lili Nurliyana
author_facet	Shaker, Mahmoud Ibrahim, Hamidah Mustapha, Aida Abdullah, Lili Nurliyana
author_sort	Shaker, Mahmoud
building	UPM Institutional Repository
collection	Online Access
description	Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. This paper proposes a framework for extracting, classifying, analyzing, and presenting semi-structured web data sources. The framework is able to extract relevant information from different web data sources, and classify the extracted information based on the standard classification scheme of Nokia products, which has been chosen as the case study.
first_indexed	2025-11-15T07:50:39Z
format	Article
id	upm-12693
institution	Universiti Putra Malaysia
institution_category	Local University
language	English
last_indexed	2025-11-15T07:50:39Z
publishDate	2010
recordtype	eprints
repository_type	Digital Repository
spelling	upm-126932011-11-24T04:50:32Z http://psasir.upm.edu.my/id/eprint/12693/ A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources Shaker, Mahmoud Ibrahim, Hamidah Mustapha, Aida Abdullah, Lili Nurliyana Extracting information from the web data sources becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. This paper proposes a framework for extracting, classifying, analyzing, and presenting semi-structured web data sources. The framework is able to extract relevant information from different web data sources, and classify the extracted information based on the standard classification scheme of Nokia products, which has been chosen as the case study. 2010-11 Article PeerReviewed Shaker, Mahmoud and Ibrahim, Hamidah and Mustapha, Aida and Abdullah, Lili Nurliyana (2010) A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources. Journal of Next Generation Information Technology, 1 (3). pp. 106-114. ISSN 2092-8637 Information storage and retrieval systems. Natural language processing (Computer science) Interactive computer systems. English
spellingShingle	Information storage and retrieval systems. Natural language processing (Computer science) Interactive computer systems. Shaker, Mahmoud Ibrahim, Hamidah Mustapha, Aida Abdullah, Lili Nurliyana A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title	A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_full	A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_fullStr	A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_full_unstemmed	A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_short	A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
title_sort	framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources
topic	Information storage and retrieval systems. Natural language processing (Computer science) Interactive computer systems.
url	http://psasir.upm.edu.my/id/eprint/12693/

A framework for extracting, classifying, analyzing, and presenting information from semi-structured web data sources

Similar Items