An Automated System For Classifying Conference Papers
In the research conference domain, paper assignment process often poses as a timeconsuming and repetitive task for a chairman. A chairman is required to manually review the contents of a research paper, before assigning it to a suitable reviewer. This project is aimed to develop an automated web-bas...
| Main Author: | |
|---|---|
| Format: | Final Year Project / Dissertation / Thesis |
| Published: |
2021
|
| Subjects: | |
| Online Access: | http://eprints.utar.edu.my/4096/ http://eprints.utar.edu.my/4096/1/1706067_FYP_report_%2D_NGAN_CHOON_HAN_SEON.pdf |
| _version_ | 1848886075315978240 |
|---|---|
| author | Ngan, Seon Choon Han |
| author_facet | Ngan, Seon Choon Han |
| author_sort | Ngan, Seon Choon Han |
| building | UTAR Institutional Repository |
| collection | Online Access |
| description | In the research conference domain, paper assignment process often poses as a timeconsuming and repetitive task for a chairman. A chairman is required to manually review the contents of a research paper, before assigning it to a suitable reviewer. This project is aimed to develop an automated web-based conference paper system for the manual process of assigning papers to reviewers by using classification models. The project is also aimed to select the best classification model for the system, based on an empirical study. The Knowledge Discovery in Databases (KDD) process was followed as a formal data mining methodology where 1000 AI conference papers were carefully collected, pre-processed and transformed to numerical representations through TF-IDF vectorisation. A randomised stratified 5- fold cross validation was then applied on several data mining algorithms and evaluated using the F-measure as a metric. The Support Vector Machine algorithm resulted in the highest F-measure (0.906), followed closely by Logistic Regression (0.903), Random Forest (0.891), Naïve Bayes (0.880), K-Nearest Neighbour (0.831) and lastly, Decision Tree (0.778). Grid search optimisation was then performed but no significant improvements could be observed. The best classification model was then deployed to a web-based research conference system. The web-based system was developed using the Django web framework, based on a system architecture defined in this project called the Enhanced 3-Tier Web-based System with a Data Mining Layer. In conclusion, an automated paper classification system was successfully developed using classification models and its practical usage was demonstrated on a web-based research conference system to help chairmen in assigning papers to suitable reviewers. |
| first_indexed | 2025-11-15T19:32:43Z |
| format | Final Year Project / Dissertation / Thesis |
| id | utar-4096 |
| institution | Universiti Tunku Abdul Rahman |
| institution_category | Local University |
| last_indexed | 2025-11-15T19:32:43Z |
| publishDate | 2021 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | utar-40962021-06-11T19:05:31Z An Automated System For Classifying Conference Papers Ngan, Seon Choon Han QA76 Computer software In the research conference domain, paper assignment process often poses as a timeconsuming and repetitive task for a chairman. A chairman is required to manually review the contents of a research paper, before assigning it to a suitable reviewer. This project is aimed to develop an automated web-based conference paper system for the manual process of assigning papers to reviewers by using classification models. The project is also aimed to select the best classification model for the system, based on an empirical study. The Knowledge Discovery in Databases (KDD) process was followed as a formal data mining methodology where 1000 AI conference papers were carefully collected, pre-processed and transformed to numerical representations through TF-IDF vectorisation. A randomised stratified 5- fold cross validation was then applied on several data mining algorithms and evaluated using the F-measure as a metric. The Support Vector Machine algorithm resulted in the highest F-measure (0.906), followed closely by Logistic Regression (0.903), Random Forest (0.891), Naïve Bayes (0.880), K-Nearest Neighbour (0.831) and lastly, Decision Tree (0.778). Grid search optimisation was then performed but no significant improvements could be observed. The best classification model was then deployed to a web-based research conference system. The web-based system was developed using the Django web framework, based on a system architecture defined in this project called the Enhanced 3-Tier Web-based System with a Data Mining Layer. In conclusion, an automated paper classification system was successfully developed using classification models and its practical usage was demonstrated on a web-based research conference system to help chairmen in assigning papers to suitable reviewers. 2021 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/4096/1/1706067_FYP_report_%2D_NGAN_CHOON_HAN_SEON.pdf Ngan, Seon Choon Han (2021) An Automated System For Classifying Conference Papers. Final Year Project, UTAR. http://eprints.utar.edu.my/4096/ |
| spellingShingle | QA76 Computer software Ngan, Seon Choon Han An Automated System For Classifying Conference Papers |
| title | An Automated System For Classifying Conference Papers |
| title_full | An Automated System For Classifying Conference Papers |
| title_fullStr | An Automated System For Classifying Conference Papers |
| title_full_unstemmed | An Automated System For Classifying Conference Papers |
| title_short | An Automated System For Classifying Conference Papers |
| title_sort | automated system for classifying conference papers |
| topic | QA76 Computer software |
| url | http://eprints.utar.edu.my/4096/ http://eprints.utar.edu.my/4096/1/1706067_FYP_report_%2D_NGAN_CHOON_HAN_SEON.pdf |