Skyline queries computation on crowdsourced- enabled incomplete database

Data incompleteness becomes a frequent phenomenon in a large number of contemporary database applications such as web autonomous databases, big data, and crowd-sourced databases. Processing skyline queries over incomplete databases impose a number of challenges that negatively influence processing t...

Full description

Bibliographic Details
Main Authors: Swidan, Marwa B., Alwan, Ali A., Turaev, Sherzod, Ibrahim, Hamidah, Abualkishik, Abedallah Zaid, Gulzar, Yonis
Format: Article
Published: Institute of Electrical and Electronics Engineers 2020
Online Access:http://psasir.upm.edu.my/id/eprint/85830/
_version_ 1848860193938472960
author Swidan, Marwa B.
Alwan, Ali A.
Turaev, Sherzod
Ibrahim, Hamidah
Abualkishik, Abedallah Zaid
Gulzar, Yonis
author_facet Swidan, Marwa B.
Alwan, Ali A.
Turaev, Sherzod
Ibrahim, Hamidah
Abualkishik, Abedallah Zaid
Gulzar, Yonis
author_sort Swidan, Marwa B.
building UPM Institutional Repository
collection Online Access
description Data incompleteness becomes a frequent phenomenon in a large number of contemporary database applications such as web autonomous databases, big data, and crowd-sourced databases. Processing skyline queries over incomplete databases impose a number of challenges that negatively influence processing the skyline queries. Most importantly, the skylines derived from incomplete databases are also incomplete in which some values are missing. Retrieving skylines with missing values is undesirable, particularly, for recommendation and decision-making systems. Furthermore, running skyline queries on a database with incomplete data raises a number of issues influence processing skyline queries such as losing the transitivity property of the skyline technique and cyclic dominance between the tuples. The issue of estimating the missing values of skylines has been discussed and examined in the database literature. Most recently, several studies have suggested exploiting the crowd-sourced databases in order to estimate the missing values by generating plausible values using the crowd. Crowd-sourced databases have proved to be a powerful solution to perform user-given tasks by integrating human intelligence and experience to process the tasks. However, task processing using crowd-sourced incurs additional monetary cost and increases the time latency. Also, it is not always possible to produce a satisfactory result that meets the user's preferences. This paper proposes an approach for estimating the missing values of the skylines by first exploiting the available data and utilizes the implicit relationships between the attributes in order to impute the missing values of the skylines. This process aims at reducing the number of values to be estimated using the crowd when local estimation is inappropriate. Intensive experiments on both synthetic and real datasets have been accomplished. The experimental results have proven that the proposed approach for estimating the missing values of the skylines over crowd-sourced enabled incomplete databases is scalable and outperforms the other existing approaches.
first_indexed 2025-11-15T12:41:21Z
format Article
id upm-85830
institution Universiti Putra Malaysia
institution_category Local University
last_indexed 2025-11-15T12:41:21Z
publishDate 2020
publisher Institute of Electrical and Electronics Engineers
recordtype eprints
repository_type Digital Repository
spelling upm-858302023-10-02T08:44:10Z http://psasir.upm.edu.my/id/eprint/85830/ Skyline queries computation on crowdsourced- enabled incomplete database Swidan, Marwa B. Alwan, Ali A. Turaev, Sherzod Ibrahim, Hamidah Abualkishik, Abedallah Zaid Gulzar, Yonis Data incompleteness becomes a frequent phenomenon in a large number of contemporary database applications such as web autonomous databases, big data, and crowd-sourced databases. Processing skyline queries over incomplete databases impose a number of challenges that negatively influence processing the skyline queries. Most importantly, the skylines derived from incomplete databases are also incomplete in which some values are missing. Retrieving skylines with missing values is undesirable, particularly, for recommendation and decision-making systems. Furthermore, running skyline queries on a database with incomplete data raises a number of issues influence processing skyline queries such as losing the transitivity property of the skyline technique and cyclic dominance between the tuples. The issue of estimating the missing values of skylines has been discussed and examined in the database literature. Most recently, several studies have suggested exploiting the crowd-sourced databases in order to estimate the missing values by generating plausible values using the crowd. Crowd-sourced databases have proved to be a powerful solution to perform user-given tasks by integrating human intelligence and experience to process the tasks. However, task processing using crowd-sourced incurs additional monetary cost and increases the time latency. Also, it is not always possible to produce a satisfactory result that meets the user's preferences. This paper proposes an approach for estimating the missing values of the skylines by first exploiting the available data and utilizes the implicit relationships between the attributes in order to impute the missing values of the skylines. This process aims at reducing the number of values to be estimated using the crowd when local estimation is inappropriate. Intensive experiments on both synthetic and real datasets have been accomplished. The experimental results have proven that the proposed approach for estimating the missing values of the skylines over crowd-sourced enabled incomplete databases is scalable and outperforms the other existing approaches. Institute of Electrical and Electronics Engineers 2020 Article PeerReviewed Swidan, Marwa B. and Alwan, Ali A. and Turaev, Sherzod and Ibrahim, Hamidah and Abualkishik, Abedallah Zaid and Gulzar, Yonis (2020) Skyline queries computation on crowdsourced- enabled incomplete database. IEEE Access, 8. 106660 - 106689. ISSN 2169-3536 https://ieeexplore.ieee.org/abstract/document/9110574 10.1109/ACCESS.2020.3000664
spellingShingle Swidan, Marwa B.
Alwan, Ali A.
Turaev, Sherzod
Ibrahim, Hamidah
Abualkishik, Abedallah Zaid
Gulzar, Yonis
Skyline queries computation on crowdsourced- enabled incomplete database
title Skyline queries computation on crowdsourced- enabled incomplete database
title_full Skyline queries computation on crowdsourced- enabled incomplete database
title_fullStr Skyline queries computation on crowdsourced- enabled incomplete database
title_full_unstemmed Skyline queries computation on crowdsourced- enabled incomplete database
title_short Skyline queries computation on crowdsourced- enabled incomplete database
title_sort skyline queries computation on crowdsourced- enabled incomplete database
url http://psasir.upm.edu.my/id/eprint/85830/
http://psasir.upm.edu.my/id/eprint/85830/
http://psasir.upm.edu.my/id/eprint/85830/