Investigating Design Choices between Bitmap index and B-tree index for a Large Data Warehouse System

Building indexes on database is common, but it has an important impact on the query performance, especially in large databases such as a Data Warehouse where the queries are usually very complex and ad hoc. If a proper index structure is chosen, the query response time call be accelerated. Until now...

Full description

Bibliographic Details
Main Authors: Morteza, Zaker, Somnuk, Phon-Amnuaisuk, Su-Cheng, Haw
Format: Conference or Workshop Item
Published: 2008
Subjects:
Online Access:http://shdl.mmu.edu.my/2841/
_version_ 1848790163871760384
author Morteza, Zaker
Somnuk, Phon-Amnuaisuk
Su-Cheng, Haw
author_facet Morteza, Zaker
Somnuk, Phon-Amnuaisuk
Su-Cheng, Haw
author_sort Morteza, Zaker
building MMU Institutional Repository
collection Online Access
description Building indexes on database is common, but it has an important impact on the query performance, especially in large databases such as a Data Warehouse where the queries are usually very complex and ad hoc. If a proper index structure is chosen, the query response time call be accelerated. Until now, there is no definite guideline for Data Warehouse analysts to choose the appropriate index. According to conventional wisdom, Bitmap index is a preferred indexing technique for cases where the indexed attributes have few distinct values (i.e., low cardinality). The query response time is expected to degrade as the cardinality of indexed columns increase due to a larger index size. On the other hand, B-tree index is good if the column values are of high cardinality due to its indexing and retrieving mechanisms. In this paper, we show that this may not be true under certain circumstances. Experimental results support the fact that even though the level of column cardinality determines the index file size, but the query processing time is not determined by the level of column cardinality. Moreover, our results indicate that the Bitmap index is faster than B-tree index on a large dataset with multi-billion records.
first_indexed 2025-11-14T18:08:15Z
format Conference or Workshop Item
id mmu-2841
institution Multimedia University
institution_category Local University
last_indexed 2025-11-14T18:08:15Z
publishDate 2008
recordtype eprints
repository_type Digital Repository
spelling mmu-28412011-09-21T07:49:48Z http://shdl.mmu.edu.my/2841/ Investigating Design Choices between Bitmap index and B-tree index for a Large Data Warehouse System Morteza, Zaker Somnuk, Phon-Amnuaisuk Su-Cheng, Haw T Technology (General) QA75.5-76.95 Electronic computers. Computer science Building indexes on database is common, but it has an important impact on the query performance, especially in large databases such as a Data Warehouse where the queries are usually very complex and ad hoc. If a proper index structure is chosen, the query response time call be accelerated. Until now, there is no definite guideline for Data Warehouse analysts to choose the appropriate index. According to conventional wisdom, Bitmap index is a preferred indexing technique for cases where the indexed attributes have few distinct values (i.e., low cardinality). The query response time is expected to degrade as the cardinality of indexed columns increase due to a larger index size. On the other hand, B-tree index is good if the column values are of high cardinality due to its indexing and retrieving mechanisms. In this paper, we show that this may not be true under certain circumstances. Experimental results support the fact that even though the level of column cardinality determines the index file size, but the query processing time is not determined by the level of column cardinality. Moreover, our results indicate that the Bitmap index is faster than B-tree index on a large dataset with multi-billion records. 2008-11 Conference or Workshop Item NonPeerReviewed Morteza, Zaker and Somnuk, Phon-Amnuaisuk and Su-Cheng, Haw (2008) Investigating Design Choices between Bitmap index and B-tree index for a Large Data Warehouse System. In: 8th WSEAS International Conference on Applied Computer Science (ACS 08) , 21-23 NOV 2008 , Venice, ITALY. http://apps.webofknowledge.com/full_record.do?product=WOS&search_mode=GeneralSearch&qid=1&SID=V1OJnefKFf4@FFPHd@m&page=88&doc=876
spellingShingle T Technology (General)
QA75.5-76.95 Electronic computers. Computer science
Morteza, Zaker
Somnuk, Phon-Amnuaisuk
Su-Cheng, Haw
Investigating Design Choices between Bitmap index and B-tree index for a Large Data Warehouse System
title Investigating Design Choices between Bitmap index and B-tree index for a Large Data Warehouse System
title_full Investigating Design Choices between Bitmap index and B-tree index for a Large Data Warehouse System
title_fullStr Investigating Design Choices between Bitmap index and B-tree index for a Large Data Warehouse System
title_full_unstemmed Investigating Design Choices between Bitmap index and B-tree index for a Large Data Warehouse System
title_short Investigating Design Choices between Bitmap index and B-tree index for a Large Data Warehouse System
title_sort investigating design choices between bitmap index and b-tree index for a large data warehouse system
topic T Technology (General)
QA75.5-76.95 Electronic computers. Computer science
url http://shdl.mmu.edu.my/2841/
http://shdl.mmu.edu.my/2841/