Predicting students’ performance in mathematics subjects at Kolej MARA Banting using machine learning methods

Predicting students’ performance is crucial for personalised and educational success for individuals. However, no standard procedure or method considers external factors to predict students’ performance in mathematics at Kolej MARA Banting (KMB). This research aims to address this problem by explori...

Full description

Bibliographic Details
Main Authors: Ahmad Akif, Ibrahim, Nor Azuana, Ramli, Sahimel Azwal, Sulaiman
Format: Article
Language:English
Published: Universiti Pendidikan Sultan Idris 2025
Subjects:
Online Access:https://umpir.ump.edu.my/id/eprint/45505/
_version_ 1848827436417941504
author Ahmad Akif, Ibrahim
Nor Azuana, Ramli
Sahimel Azwal, Sulaiman
author_facet Ahmad Akif, Ibrahim
Nor Azuana, Ramli
Sahimel Azwal, Sulaiman
author_sort Ahmad Akif, Ibrahim
building UMP Institutional Repository
collection Online Access
description Predicting students’ performance is crucial for personalised and educational success for individuals. However, no standard procedure or method considers external factors to predict students’ performance in mathematics at Kolej MARA Banting (KMB). This research aims to address this problem by exploring the potential of machine learning methods for predicting students’ performance in mathematics at KMB. The study follows a machine learning process: data collection, attribute selection, pre-processing, model training, and evaluation. A sample of 703 data points on students’ demographics, academic records, and mathematics performance were collected and pre-processed. Machine learning models such as support vector machine, decision tree, k-nearest neighbours, Naïve Bayes, Random Forest, AdaBoost, and stacking model were applied in this study. The accuracy and performance of these models were assessed to determine which model outperformed the others and its effectiveness in predicting students’ mathematics performance. The study findings demonstrate that the stacking model exhibited superior performance in accuracy (71.43%), precision (68.73%), recall (71.43%), and F1-score (69.80%) compared to the other models. Nevertheless, it is essential to note that the stacking model achieved moderate accuracy. This could be attributed to the inherent difficulties in constructing a precise predictive model for student performance, such as the models failing to sufficiently reflect the complexities within the dataset, resulting in underfitting. Additionally, the target attribute, International Baccalaureate (IB) grade, is imbalanced, with more high performers than low performers, causing the models to be biased towards the majority class and impacting overall accuracy. The performance of the models in this study could be improved by adding more features related to students’ performance, such as anxiety, depression, well-being, and others, to capture enough complexity in the data. It is also suggested that samples from other colleges with a balanced grade distribution be obtained compared to students at KMB.
first_indexed 2025-11-15T04:00:41Z
format Article
id ump-45505
institution Universiti Malaysia Pahang
institution_category Local University
language English
last_indexed 2025-11-15T04:00:41Z
publishDate 2025
publisher Universiti Pendidikan Sultan Idris
recordtype eprints
repository_type Digital Repository
spelling ump-455052025-08-28T07:12:51Z https://umpir.ump.edu.my/id/eprint/45505/ Predicting students’ performance in mathematics subjects at Kolej MARA Banting using machine learning methods Ahmad Akif, Ibrahim Nor Azuana, Ramli Sahimel Azwal, Sulaiman L Education (General) LB Theory and practice of education QA Mathematics Predicting students’ performance is crucial for personalised and educational success for individuals. However, no standard procedure or method considers external factors to predict students’ performance in mathematics at Kolej MARA Banting (KMB). This research aims to address this problem by exploring the potential of machine learning methods for predicting students’ performance in mathematics at KMB. The study follows a machine learning process: data collection, attribute selection, pre-processing, model training, and evaluation. A sample of 703 data points on students’ demographics, academic records, and mathematics performance were collected and pre-processed. Machine learning models such as support vector machine, decision tree, k-nearest neighbours, Naïve Bayes, Random Forest, AdaBoost, and stacking model were applied in this study. The accuracy and performance of these models were assessed to determine which model outperformed the others and its effectiveness in predicting students’ mathematics performance. The study findings demonstrate that the stacking model exhibited superior performance in accuracy (71.43%), precision (68.73%), recall (71.43%), and F1-score (69.80%) compared to the other models. Nevertheless, it is essential to note that the stacking model achieved moderate accuracy. This could be attributed to the inherent difficulties in constructing a precise predictive model for student performance, such as the models failing to sufficiently reflect the complexities within the dataset, resulting in underfitting. Additionally, the target attribute, International Baccalaureate (IB) grade, is imbalanced, with more high performers than low performers, causing the models to be biased towards the majority class and impacting overall accuracy. The performance of the models in this study could be improved by adding more features related to students’ performance, such as anxiety, depression, well-being, and others, to capture enough complexity in the data. It is also suggested that samples from other colleges with a balanced grade distribution be obtained compared to students at KMB. Universiti Pendidikan Sultan Idris 2025-04-28 Article PeerReviewed pdf en cc_by_nc_sa_4 https://umpir.ump.edu.my/id/eprint/45505/1/2.pdf Ahmad Akif, Ibrahim and Nor Azuana, Ramli and Sahimel Azwal, Sulaiman (2025) Predicting students’ performance in mathematics subjects at Kolej MARA Banting using machine learning methods. Jurnal Pendidikan Sains Dan Matematik Malaysia, 15 (1). pp. 19-31. ISSN 2600-9307. (Published) https://doi.org/10.37134/jpsmm.vol15.1.2.2025 https://doi.org/10.37134/jpsmm.vol15.1.2.2025 https://doi.org/10.37134/jpsmm.vol15.1.2.2025
spellingShingle L Education (General)
LB Theory and practice of education
QA Mathematics
Ahmad Akif, Ibrahim
Nor Azuana, Ramli
Sahimel Azwal, Sulaiman
Predicting students’ performance in mathematics subjects at Kolej MARA Banting using machine learning methods
title Predicting students’ performance in mathematics subjects at Kolej MARA Banting using machine learning methods
title_full Predicting students’ performance in mathematics subjects at Kolej MARA Banting using machine learning methods
title_fullStr Predicting students’ performance in mathematics subjects at Kolej MARA Banting using machine learning methods
title_full_unstemmed Predicting students’ performance in mathematics subjects at Kolej MARA Banting using machine learning methods
title_short Predicting students’ performance in mathematics subjects at Kolej MARA Banting using machine learning methods
title_sort predicting students’ performance in mathematics subjects at kolej mara banting using machine learning methods
topic L Education (General)
LB Theory and practice of education
QA Mathematics
url https://umpir.ump.edu.my/id/eprint/45505/
https://umpir.ump.edu.my/id/eprint/45505/
https://umpir.ump.edu.my/id/eprint/45505/