Instance based matching using regular expression

Instance based matching is the process of comparing data from different heterogeneous data sources in determining the correspondence of schema elements. It is a useful alternative choice when schema information (element name, description, constraint) is unavailable or unable to determine the match b...

Full description

Bibliographic Details
Main Authors: Mehdi, Osama A., Ibrahim, Hamidah, Affendey, Lilly Suriani
Format: Article
Language:English
Published: Elsevier 2012
Online Access:http://psasir.upm.edu.my/id/eprint/42917/
http://psasir.upm.edu.my/id/eprint/42917/1/42917.pdf
_version_ 1848850091885985792
author Mehdi, Osama A.
Ibrahim, Hamidah
Affendey, Lilly Suriani
author_facet Mehdi, Osama A.
Ibrahim, Hamidah
Affendey, Lilly Suriani
author_sort Mehdi, Osama A.
building UPM Institutional Repository
collection Online Access
description Instance based matching is the process of comparing data from different heterogeneous data sources in determining the correspondence of schema elements. It is a useful alternative choice when schema information (element name, description, constraint) is unavailable or unable to determine the match between schema elements. Instance based matching is a non trivial problem and is applied in many application areas such as data integration, data cleaning, query mediations, and warehousing. Many instance based solutions to the schema matching problem have been proposed and most of them utilized similarity metrics. In this paper, we present a fully automatic approach that contributes to the solution of instance based matching in identifying the correspondences of attributes which is one of the elements in the schema by utilizing regular expression. Several experiments using real-world data set have been conducted to evaluate the performance of our proposed approach. The results showed that our proposed approach achieved better accuracy compared to previous approaches using similarity metrics.
first_indexed 2025-11-15T10:00:47Z
format Article
id upm-42917
institution Universiti Putra Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T10:00:47Z
publishDate 2012
publisher Elsevier
recordtype eprints
repository_type Digital Repository
spelling upm-429172016-05-03T05:16:37Z http://psasir.upm.edu.my/id/eprint/42917/ Instance based matching using regular expression Mehdi, Osama A. Ibrahim, Hamidah Affendey, Lilly Suriani Instance based matching is the process of comparing data from different heterogeneous data sources in determining the correspondence of schema elements. It is a useful alternative choice when schema information (element name, description, constraint) is unavailable or unable to determine the match between schema elements. Instance based matching is a non trivial problem and is applied in many application areas such as data integration, data cleaning, query mediations, and warehousing. Many instance based solutions to the schema matching problem have been proposed and most of them utilized similarity metrics. In this paper, we present a fully automatic approach that contributes to the solution of instance based matching in identifying the correspondences of attributes which is one of the elements in the schema by utilizing regular expression. Several experiments using real-world data set have been conducted to evaluate the performance of our proposed approach. The results showed that our proposed approach achieved better accuracy compared to previous approaches using similarity metrics. Elsevier 2012 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/42917/1/42917.pdf Mehdi, Osama A. and Ibrahim, Hamidah and Affendey, Lilly Suriani (2012) Instance based matching using regular expression. Procedia Computer Science, 10. pp. 688-695. ISSN 1877-0509 http://www.sciencedirect.com/science/article/pii/S1877050912004450 10.1016/j.procs.2012.06.088
spellingShingle Mehdi, Osama A.
Ibrahim, Hamidah
Affendey, Lilly Suriani
Instance based matching using regular expression
title Instance based matching using regular expression
title_full Instance based matching using regular expression
title_fullStr Instance based matching using regular expression
title_full_unstemmed Instance based matching using regular expression
title_short Instance based matching using regular expression
title_sort instance based matching using regular expression
url http://psasir.upm.edu.my/id/eprint/42917/
http://psasir.upm.edu.my/id/eprint/42917/
http://psasir.upm.edu.my/id/eprint/42917/
http://psasir.upm.edu.my/id/eprint/42917/1/42917.pdf