JQPro : Join query processing in a distributed system for big RDF data using the hash-merge join technique

In the last decade, the volume of semantic data has increased exponentially, with the number of Resource Description Framework (RDF) datasets exceeding trillions of triples in RDF repositories. Hence, the size of RDF datasets continues to grow. However, with the increasing number of RDF triples, com...

Full description

Bibliographic Details
Main Authors: Elzein, Nahla Mohammed, Mazlina, Abdul Majid, Hashem, Ibrahim Abaker Targio, Ashraf Osman, Ibrahim, Abulfaraj, Anas W., Binzagr, Faisal
Format: Article
Language:English
Published: MDPI 2023
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/37620/
http://umpir.ump.edu.my/id/eprint/37620/1/JQPro_Join%20query%20processing%20in%20a%20distributed%20system%20for%20big%20rdf%20data%20using%20the%20hash-merge%20join%20technique.pdf
_version_ 1848825300122599424
author Elzein, Nahla Mohammed
Mazlina, Abdul Majid
Hashem, Ibrahim Abaker Targio
Ashraf Osman, Ibrahim
Abulfaraj, Anas W.
Binzagr, Faisal
author_facet Elzein, Nahla Mohammed
Mazlina, Abdul Majid
Hashem, Ibrahim Abaker Targio
Ashraf Osman, Ibrahim
Abulfaraj, Anas W.
Binzagr, Faisal
author_sort Elzein, Nahla Mohammed
building UMP Institutional Repository
collection Online Access
description In the last decade, the volume of semantic data has increased exponentially, with the number of Resource Description Framework (RDF) datasets exceeding trillions of triples in RDF repositories. Hence, the size of RDF datasets continues to grow. However, with the increasing number of RDF triples, complex multiple RDF queries are becoming a significant demand. Sometimes, such complex queries produce many common sub-expressions in a single query or over multiple queries running as a batch. In addition, it is also difficult to minimize the number of RDF queries and processing time for a large amount of related data in a typical distributed environment encounter. To address this complication, we introduce a join query processing model for big RDF data, called JQPro. By adopting a MapReduce framework in JQPro, we developed three new algorithms, which are hash-join, sort-merge, and enhanced MapReduce-join for join query processing of RDF data. Based on an experiment conducted, the result showed that the JQPro model outperformed the two popular algorithms, gStore and RDF-3X, with respect to the average execution time. Furthermore, the JQPro model was also tested against RDF-3X, RDFox, and PARJs using the LUBM benchmark. The result showed that the JQPro model had better performance in comparison with the other models. In conclusion, the findings showed that JQPro achieved improved performance with 87.77% in terms of execution time. Hence, in comparison with the selected models, JQPro performs better.
first_indexed 2025-11-15T03:26:44Z
format Article
id ump-37620
institution Universiti Malaysia Pahang
institution_category Local University
language English
last_indexed 2025-11-15T03:26:44Z
publishDate 2023
publisher MDPI
recordtype eprints
repository_type Digital Repository
spelling ump-376202025-05-30T08:49:42Z http://umpir.ump.edu.my/id/eprint/37620/ JQPro : Join query processing in a distributed system for big RDF data using the hash-merge join technique Elzein, Nahla Mohammed Mazlina, Abdul Majid Hashem, Ibrahim Abaker Targio Ashraf Osman, Ibrahim Abulfaraj, Anas W. Binzagr, Faisal QA75 Electronic computers. Computer science QA76 Computer software T Technology (General) TA Engineering (General). Civil engineering (General) In the last decade, the volume of semantic data has increased exponentially, with the number of Resource Description Framework (RDF) datasets exceeding trillions of triples in RDF repositories. Hence, the size of RDF datasets continues to grow. However, with the increasing number of RDF triples, complex multiple RDF queries are becoming a significant demand. Sometimes, such complex queries produce many common sub-expressions in a single query or over multiple queries running as a batch. In addition, it is also difficult to minimize the number of RDF queries and processing time for a large amount of related data in a typical distributed environment encounter. To address this complication, we introduce a join query processing model for big RDF data, called JQPro. By adopting a MapReduce framework in JQPro, we developed three new algorithms, which are hash-join, sort-merge, and enhanced MapReduce-join for join query processing of RDF data. Based on an experiment conducted, the result showed that the JQPro model outperformed the two popular algorithms, gStore and RDF-3X, with respect to the average execution time. Furthermore, the JQPro model was also tested against RDF-3X, RDFox, and PARJs using the LUBM benchmark. The result showed that the JQPro model had better performance in comparison with the other models. In conclusion, the findings showed that JQPro achieved improved performance with 87.77% in terms of execution time. Hence, in comparison with the selected models, JQPro performs better. MDPI 2023-03 Article PeerReviewed pdf en cc_by_4 http://umpir.ump.edu.my/id/eprint/37620/1/JQPro_Join%20query%20processing%20in%20a%20distributed%20system%20for%20big%20rdf%20data%20using%20the%20hash-merge%20join%20technique.pdf Elzein, Nahla Mohammed and Mazlina, Abdul Majid and Hashem, Ibrahim Abaker Targio and Ashraf Osman, Ibrahim and Abulfaraj, Anas W. and Binzagr, Faisal (2023) JQPro : Join query processing in a distributed system for big RDF data using the hash-merge join technique. Mathematics, 11 (5). pp. 1-20. ISSN 2227-7390. (Published) https://doi.org/10.3390/math11051275 https://doi.org/10.3390/math11051275
spellingShingle QA75 Electronic computers. Computer science
QA76 Computer software
T Technology (General)
TA Engineering (General). Civil engineering (General)
Elzein, Nahla Mohammed
Mazlina, Abdul Majid
Hashem, Ibrahim Abaker Targio
Ashraf Osman, Ibrahim
Abulfaraj, Anas W.
Binzagr, Faisal
JQPro : Join query processing in a distributed system for big RDF data using the hash-merge join technique
title JQPro : Join query processing in a distributed system for big RDF data using the hash-merge join technique
title_full JQPro : Join query processing in a distributed system for big RDF data using the hash-merge join technique
title_fullStr JQPro : Join query processing in a distributed system for big RDF data using the hash-merge join technique
title_full_unstemmed JQPro : Join query processing in a distributed system for big RDF data using the hash-merge join technique
title_short JQPro : Join query processing in a distributed system for big RDF data using the hash-merge join technique
title_sort jqpro : join query processing in a distributed system for big rdf data using the hash-merge join technique
topic QA75 Electronic computers. Computer science
QA76 Computer software
T Technology (General)
TA Engineering (General). Civil engineering (General)
url http://umpir.ump.edu.my/id/eprint/37620/
http://umpir.ump.edu.my/id/eprint/37620/
http://umpir.ump.edu.my/id/eprint/37620/
http://umpir.ump.edu.my/id/eprint/37620/1/JQPro_Join%20query%20processing%20in%20a%20distributed%20system%20for%20big%20rdf%20data%20using%20the%20hash-merge%20join%20technique.pdf