An Automated System For Classifying Conference Papers

In the research conference domain, paper assignment process often poses as a timeconsuming and repetitive task for a chairman. A chairman is required to manually review the contents of a research paper, before assigning it to a suitable reviewer. This project is aimed to develop an automated web-bas...

Full description

Bibliographic Details
Main Author: Ngan, Seon Choon Han
Format: Final Year Project / Dissertation / Thesis
Published: 2021
Subjects:
Online Access:http://eprints.utar.edu.my/4096/
http://eprints.utar.edu.my/4096/1/1706067_FYP_report_%2D_NGAN_CHOON_HAN_SEON.pdf
_version_ 1848886075315978240
author Ngan, Seon Choon Han
author_facet Ngan, Seon Choon Han
author_sort Ngan, Seon Choon Han
building UTAR Institutional Repository
collection Online Access
description In the research conference domain, paper assignment process often poses as a timeconsuming and repetitive task for a chairman. A chairman is required to manually review the contents of a research paper, before assigning it to a suitable reviewer. This project is aimed to develop an automated web-based conference paper system for the manual process of assigning papers to reviewers by using classification models. The project is also aimed to select the best classification model for the system, based on an empirical study. The Knowledge Discovery in Databases (KDD) process was followed as a formal data mining methodology where 1000 AI conference papers were carefully collected, pre-processed and transformed to numerical representations through TF-IDF vectorisation. A randomised stratified 5- fold cross validation was then applied on several data mining algorithms and evaluated using the F-measure as a metric. The Support Vector Machine algorithm resulted in the highest F-measure (0.906), followed closely by Logistic Regression (0.903), Random Forest (0.891), Naïve Bayes (0.880), K-Nearest Neighbour (0.831) and lastly, Decision Tree (0.778). Grid search optimisation was then performed but no significant improvements could be observed. The best classification model was then deployed to a web-based research conference system. The web-based system was developed using the Django web framework, based on a system architecture defined in this project called the Enhanced 3-Tier Web-based System with a Data Mining Layer. In conclusion, an automated paper classification system was successfully developed using classification models and its practical usage was demonstrated on a web-based research conference system to help chairmen in assigning papers to suitable reviewers.
first_indexed 2025-11-15T19:32:43Z
format Final Year Project / Dissertation / Thesis
id utar-4096
institution Universiti Tunku Abdul Rahman
institution_category Local University
last_indexed 2025-11-15T19:32:43Z
publishDate 2021
recordtype eprints
repository_type Digital Repository
spelling utar-40962021-06-11T19:05:31Z An Automated System For Classifying Conference Papers Ngan, Seon Choon Han QA76 Computer software In the research conference domain, paper assignment process often poses as a timeconsuming and repetitive task for a chairman. A chairman is required to manually review the contents of a research paper, before assigning it to a suitable reviewer. This project is aimed to develop an automated web-based conference paper system for the manual process of assigning papers to reviewers by using classification models. The project is also aimed to select the best classification model for the system, based on an empirical study. The Knowledge Discovery in Databases (KDD) process was followed as a formal data mining methodology where 1000 AI conference papers were carefully collected, pre-processed and transformed to numerical representations through TF-IDF vectorisation. A randomised stratified 5- fold cross validation was then applied on several data mining algorithms and evaluated using the F-measure as a metric. The Support Vector Machine algorithm resulted in the highest F-measure (0.906), followed closely by Logistic Regression (0.903), Random Forest (0.891), Naïve Bayes (0.880), K-Nearest Neighbour (0.831) and lastly, Decision Tree (0.778). Grid search optimisation was then performed but no significant improvements could be observed. The best classification model was then deployed to a web-based research conference system. The web-based system was developed using the Django web framework, based on a system architecture defined in this project called the Enhanced 3-Tier Web-based System with a Data Mining Layer. In conclusion, an automated paper classification system was successfully developed using classification models and its practical usage was demonstrated on a web-based research conference system to help chairmen in assigning papers to suitable reviewers. 2021 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/4096/1/1706067_FYP_report_%2D_NGAN_CHOON_HAN_SEON.pdf Ngan, Seon Choon Han (2021) An Automated System For Classifying Conference Papers. Final Year Project, UTAR. http://eprints.utar.edu.my/4096/
spellingShingle QA76 Computer software
Ngan, Seon Choon Han
An Automated System For Classifying Conference Papers
title An Automated System For Classifying Conference Papers
title_full An Automated System For Classifying Conference Papers
title_fullStr An Automated System For Classifying Conference Papers
title_full_unstemmed An Automated System For Classifying Conference Papers
title_short An Automated System For Classifying Conference Papers
title_sort automated system for classifying conference papers
topic QA76 Computer software
url http://eprints.utar.edu.my/4096/
http://eprints.utar.edu.my/4096/1/1706067_FYP_report_%2D_NGAN_CHOON_HAN_SEON.pdf