A survey of statistical approaches for query expansion

A major issue in effective information retrieval is the problem of vocabulary mismatches. The method called query expansion addresses this issue by reformulating each search query with additional terms that better define the information needs of the user. Many researchers have contributed to improvi...

Full description

Bibliographic Details
Main Authors: Raza, Muhammad Ahsan, Rahmah, Mokhtar, Noraziah, Ahmad
Format: Article
Language:English
Published: Springer Verlag 2018
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/23272/
http://umpir.ump.edu.my/id/eprint/23272/1/A%20survey%20of%20statistical%20approaches%20for%20query%20expansion1.pdf
_version_ 1848821766297747456
author Raza, Muhammad Ahsan
Rahmah, Mokhtar
Noraziah, Ahmad
author_facet Raza, Muhammad Ahsan
Rahmah, Mokhtar
Noraziah, Ahmad
author_sort Raza, Muhammad Ahsan
building UMP Institutional Repository
collection Online Access
description A major issue in effective information retrieval is the problem of vocabulary mismatches. The method called query expansion addresses this issue by reformulating each search query with additional terms that better define the information needs of the user. Many researchers have contributed to improving the accuracy of information retrieval systems, through different approaches to query expansion. In this article, we primarily discuss statistical query expansion approaches that include document analysis, search and browse log analyses, and web knowledge analyses. In addition to proposing a comprehensive classification for these approaches, we also briefly analyse the pros and cons of each technique. Finally, we evaluate these techniques using five functional features and experimental settings such as TREC collection and results of performance metrics. An in-depth survey of different statistical query expansion approaches suggests that the selection of the best approach depends on the type of search query, the nature and availability of data resources, and performance efficiency requirements.
first_indexed 2025-11-15T02:30:34Z
format Article
id ump-23272
institution Universiti Malaysia Pahang
institution_category Local University
language English
last_indexed 2025-11-15T02:30:34Z
publishDate 2018
publisher Springer Verlag
recordtype eprints
repository_type Digital Repository
spelling ump-232722019-01-11T04:21:17Z http://umpir.ump.edu.my/id/eprint/23272/ A survey of statistical approaches for query expansion Raza, Muhammad Ahsan Rahmah, Mokhtar Noraziah, Ahmad QA75 Electronic computers. Computer science A major issue in effective information retrieval is the problem of vocabulary mismatches. The method called query expansion addresses this issue by reformulating each search query with additional terms that better define the information needs of the user. Many researchers have contributed to improving the accuracy of information retrieval systems, through different approaches to query expansion. In this article, we primarily discuss statistical query expansion approaches that include document analysis, search and browse log analyses, and web knowledge analyses. In addition to proposing a comprehensive classification for these approaches, we also briefly analyse the pros and cons of each technique. Finally, we evaluate these techniques using five functional features and experimental settings such as TREC collection and results of performance metrics. An in-depth survey of different statistical query expansion approaches suggests that the selection of the best approach depends on the type of search query, the nature and availability of data resources, and performance efficiency requirements. Springer Verlag 2018-09-01 Article PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/23272/1/A%20survey%20of%20statistical%20approaches%20for%20query%20expansion1.pdf Raza, Muhammad Ahsan and Rahmah, Mokhtar and Noraziah, Ahmad (2018) A survey of statistical approaches for query expansion. Knowledge and Information System. pp. 1-25. ISSN 02191377. (Published) http://doi.org/10.1007/s10115-018-1269-8 http://doi.org/10.1007/s10115-018-1269-8
spellingShingle QA75 Electronic computers. Computer science
Raza, Muhammad Ahsan
Rahmah, Mokhtar
Noraziah, Ahmad
A survey of statistical approaches for query expansion
title A survey of statistical approaches for query expansion
title_full A survey of statistical approaches for query expansion
title_fullStr A survey of statistical approaches for query expansion
title_full_unstemmed A survey of statistical approaches for query expansion
title_short A survey of statistical approaches for query expansion
title_sort survey of statistical approaches for query expansion
topic QA75 Electronic computers. Computer science
url http://umpir.ump.edu.my/id/eprint/23272/
http://umpir.ump.edu.my/id/eprint/23272/
http://umpir.ump.edu.my/id/eprint/23272/
http://umpir.ump.edu.my/id/eprint/23272/1/A%20survey%20of%20statistical%20approaches%20for%20query%20expansion1.pdf