An efficient schema transformation technique for data migration from relational to column-oriented databases

Data transformation is the core process in migrating database from relational database to NoSQL database such as column-oriented database. However, there is no standard guideline for data transformation from relational database to NoSQL database. A number of schema transformation techniques have bee...

Full description

Bibliographic Details
Main Authors: Zaidi, Norwini, Ishak, Iskandar, Sidi, Fatimah, Affendey, Lilly Suriani
Format: Article
Published: Tech Science Press 2022
Online Access:http://psasir.upm.edu.my/id/eprint/100235/
_version_ 1848863270440534016
author Zaidi, Norwini
Ishak, Iskandar
Sidi, Fatimah
Affendey, Lilly Suriani
author_facet Zaidi, Norwini
Ishak, Iskandar
Sidi, Fatimah
Affendey, Lilly Suriani
author_sort Zaidi, Norwini
building UPM Institutional Repository
collection Online Access
description Data transformation is the core process in migrating database from relational database to NoSQL database such as column-oriented database. However, there is no standard guideline for data transformation from relational database to NoSQL database. A number of schema transformation techniques have been proposed to improve data transformation process and resulted better query processing time when compared to the relational database query processing time. However, these approaches produced redundant tables in the resulted schema that in turn consume large unnecessary storage size and produce high query processing time due to the generated schema with redundant column families in the transformed column-oriented database. In this paper, an efficient data transformation technique from relational database to column-oriented database is proposed. The proposed schema transformation technique is based on the combination of denormalization approach, data access pattern and multiple-nested schema. In order to validate the proposed work, the proposed technique is implemented by transforming data from MySQL database to HBase database. A benchmark transformation technique is also performed in which the query processing time and the storage size are compared. Based on the experimental results, the proposed transformation technique showed significant improvement in terms query processing time and storage space usage due to the reduced number of column families in the column-oriented database.
first_indexed 2025-11-15T13:30:15Z
format Article
id upm-100235
institution Universiti Putra Malaysia
institution_category Local University
last_indexed 2025-11-15T13:30:15Z
publishDate 2022
publisher Tech Science Press
recordtype eprints
repository_type Digital Repository
spelling upm-1002352024-03-18T04:43:01Z http://psasir.upm.edu.my/id/eprint/100235/ An efficient schema transformation technique for data migration from relational to column-oriented databases Zaidi, Norwini Ishak, Iskandar Sidi, Fatimah Affendey, Lilly Suriani Data transformation is the core process in migrating database from relational database to NoSQL database such as column-oriented database. However, there is no standard guideline for data transformation from relational database to NoSQL database. A number of schema transformation techniques have been proposed to improve data transformation process and resulted better query processing time when compared to the relational database query processing time. However, these approaches produced redundant tables in the resulted schema that in turn consume large unnecessary storage size and produce high query processing time due to the generated schema with redundant column families in the transformed column-oriented database. In this paper, an efficient data transformation technique from relational database to column-oriented database is proposed. The proposed schema transformation technique is based on the combination of denormalization approach, data access pattern and multiple-nested schema. In order to validate the proposed work, the proposed technique is implemented by transforming data from MySQL database to HBase database. A benchmark transformation technique is also performed in which the query processing time and the storage size are compared. Based on the experimental results, the proposed transformation technique showed significant improvement in terms query processing time and storage space usage due to the reduced number of column families in the column-oriented database. Tech Science Press 2022-05-09 Article PeerReviewed Zaidi, Norwini and Ishak, Iskandar and Sidi, Fatimah and Affendey, Lilly Suriani (2022) An efficient schema transformation technique for data migration from relational to column-oriented databases. Computer Systems Science & Engineering, 43 (3). 1175 - 1188. ISSN 0267-6192 https://www.techscience.com/csse/v43n3/47678 10.32604/csse.2022.021969
spellingShingle Zaidi, Norwini
Ishak, Iskandar
Sidi, Fatimah
Affendey, Lilly Suriani
An efficient schema transformation technique for data migration from relational to column-oriented databases
title An efficient schema transformation technique for data migration from relational to column-oriented databases
title_full An efficient schema transformation technique for data migration from relational to column-oriented databases
title_fullStr An efficient schema transformation technique for data migration from relational to column-oriented databases
title_full_unstemmed An efficient schema transformation technique for data migration from relational to column-oriented databases
title_short An efficient schema transformation technique for data migration from relational to column-oriented databases
title_sort efficient schema transformation technique for data migration from relational to column-oriented databases
url http://psasir.upm.edu.my/id/eprint/100235/
http://psasir.upm.edu.my/id/eprint/100235/
http://psasir.upm.edu.my/id/eprint/100235/