A Comparative Study of the Applicability of Regression Models in Predicting Student Academic Performance
Main Article Content
Abstract
Educational Data Mining (EDM) has witnessed a surge in educational systems, as it enables the analysis and prediction of student performance, facilitating proactive measures. This paper aims to present a comparative study of regression prediction using eight mainstream models: Linear Regression, Ridge Regression, Lasso Regression, Huber Regression, Support Vector Regression (SVR), K-Nearest Neighbors Regression (KNN), Decision Tree Regression (DT), and Neural Network Regression. The applicability of these eight models is analyzed across different courses and semester GPAs, considering three distinct scenarios. Our thorough analysis underscores the substantial influence of data granularity and integrity on bolstering the precision of final CGPA predictions. In the third scenario where Semester GPAs were utilized, Lasso Regression achieved an R-value of 0.9901 with remarkably low RMSE and MAE, establishing its dominance across all scenarios. Neural Network Regression, with an R-value of 0.9832 and minimal error metrics in the same scenario, also demonstrated robust predictive capabilities. These insights highlight the imperative of tailoring regression model selection to align with specific scenario nuances and the targeted predictive precision.
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
References
Albalooshi, F., Alobaidy, H., & Ghanim, A. (2019). Mining students outcomes: An empirical study. International Journal of Computing and Digital Systems, 8(3), 229–241. https://doi.org/10.12785/ ijcds/080303
Arsad, P. M., & Buniyamin, N. (2014, April). Neural Network and Linear Regression methods for prediction of students' academic achievement. In 2014 IEEE Global Engineering Education Conference (EDUCON) (pp. 916-921). IEEE. https://doi.org/10.1109/EDUCON.2014.6826206
Bravo, L. E. C., Molano, J. I. R., & Trujillo, E. R. (2020). Exploration of a system to determine the academic performance of engineering students through machine learning. International Journal of Advanced Science and Technology, 29(7), 11894–11905. http://sersc.org/journals/index.php/IJAST/article/view/27865
Bujang, S. D. A., Selamat, A., Ibrahim, R., Krejcar, O., Herrera-Viedma, E., Fujita, H., & Ghani, N. A. M. (2021). Multiclass prediction model for student grade prediction using machine learning. Ieee Access: Practical Innovations, Open Solutions, 9, 95608–95621. https://doi.org/10.1109/ACCESS.2021.3093563
Bydžovská, H. (2016). A Comparative Analysis of Techniques for Predicting Student Performance. International Educational Data Mining Society. https://doi.org/10.1109/FIE.2007.4417993
Cavazos, R., & Garza, S. E. (2017). Learning models for student performance prediction. Paper presented at the Mexican International Conference on Artifcial Intelligence. https://doi.org/10.1007/978-3-030-02840-4_14
Cutad, R. E. E., & Gerardo, B. D. (2019). A prediction-based curriculum analysis using the modifed artifcial Bee Colony Algorithm. International Journal of Advanced Computer Science and Applications, 10(10), 117–123. https://doi.org/10.14569/ijacsa.2019.0101017
El Aouifi, H., El Hajji, M., Es-Saady, Y., & Douzi, H. (2021). Predicting learner’s performance through video sequences viewing behavior analysis using educational data-mining. Education and Information Technologies, 26(5), 5799-5814. https://doi.org/10.1007/s10639-021-10512-4
Hsu Wang, F. (2019). On prediction of online behaviors and achievement using self-regulated learning awareness in fipped classrooms. International Journal of Information and Education Technology, 9(12), 874–879. https://doi.org/10.18178/ijiet.2019.9.12.1320
Hussain, S., & Khan, M. Q. (2021). Student-performulator: Predicting students’ academic performance at secondary and intermediate level using machine learning. Annals of Data Science. https://doi.org/10.1007/s40745-021-00341-0
Ibrahim, Z., & Rusli, D. (2007, September). Predicting students’ academic performance: comparing artificial neural network, decision tree and linear regression. In 21st Annual SAS Malaysia Forum, 5th September. https://www.researchgate.net/publication/228894873
Iyanda, A. R., Ninan, O. D., Ajayi, A. O., & Anyabolu, O. G. (2018). Predicting student academic performance in computer science courses: A comparison of neural network models. International Journal of Modern Education & Computer Science, 10(6).
https://dx.doi.org/10.5815/ijmecs.2018.06.01
Lyu, S. (2023, November). Student’s Academic Performance Prediction Based on Machine Learning Regression Models. In 2023 International Conference on Image, Algorithms and Artificial Intelligence (ICIAAI 2023) (pp. 293-299). Atlantis Press. https://doi.org/10.2991/978-94-6463-300-9_29
Kumar, A., Eldhose, K. K., Sridharan, R., & Panicker, V. V. (2020, July). Students' Academic Performance Prediction using Regression: A Case Study. In 2020 International Conference on System, Computation, Automation and Networking (ICSCAN) (pp. 1-6). IEEE. https://doi.org/10.1109/ICSCAN49426.2020.9262346
Kumar, A., Selvam, R., & Kumar, K. (2018). Review on prediction algorithms in educational data mining. International Journal of Pure and Applied Mathematics, 118, 531–536. https://www.acadpubl.eu/jsi/2018-118-7-9/articles/8/77.pdf
Ma, Y., Liu, B., Wong, C. K., Yu, P. S., & Lee, S. M. (2000, August). Targeting the right students using data mining. In Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 457-464). http://scholarbank.nus.edu.sg/handle/10635/39963
Mai, T. T., Bezbradica, M., & Crane, M. (2022). Learning behaviours data in programming education: Community analysis and outcome prediction with cleaned data. Future Generation Computer Systems, 127, 42–55. https://doi.org/10.1016/j.future.2021.08.026
Makombe, F., & Lall, M. (2020). A predictive model for the determination of academic performance in private higher education institutions. International Journal of Advanced Computer Science and Applications (IJACSA), 11(9). http://dx.doi.org/10.14569/IJACSA.2020.0110949
Mengash, H. A. (2020). Using data mining techniques to predict student performance to support decision making in university admission systems. IEEE Access, 8, 55462–55470. https://doi.org/10.1109/ACCESS.2020.2981905
Mutanu, L., & Machoka, P. (2019). Enhancing computer students’ academic performance through predictive modelling-a proactive approach. In Paper presented at the 2019 14th international conference on Computer Science & Education (ICCSE).
https://doi.org/10.1109/ICCSE.2019.8845452
Omer, U., Farooq, M. S., & Abid, A. (2020). Cognitive learning analytics using assessment data and concept map: A framework-based approach for sustainability of programming courses. Sustainability (Switzerland), 12(17). https://doi.org/10.3390/su12176990
Sa’ad, M. I., & Mustafa, M. S. (2020, October). Student Prediction of Drop Out Using Extreme Learning Machine (ELM) Algorithm. In 2020 2nd International Conference on Cybernetics and Intelligent System (ICORIS) (pp. 1–6). IEEE. https://doi.org/10.1109/ICORIS50180.2020.9320831
Tomasevic, N., Gvozdenovic, N., & Vranes, S. (2020). An overview and comparison of supervised data mining techniques for student exam performance prediction. Computers & Education, 143, 103676. https://doi.org/10.1016/j.compedu.2019.103676
Tuononen, T., & Parpala, A. (2021). The role of academic competences and learning processes in predicting Bachelor’s and Master’s thesis grades. Studies in Educational Evaluation, 70. https://doi.org/10.1016/j.stueduc.2021.101001
Vijayalakshmi, V., & Venkatachalapathy, K. (2019). Comparison of predicting student’s performance using machine learning algorithms. International Journal of Intelligent Systems and Applications, 11(12), 34. https://doi.org/10.5815/ijisa.2019.12.04
Waheed, H., Hassan, S.-U., Aljohani, N. R., Hardman, J., Alelyani, S., & Nawaz, R. (2020). Predicting academic performance of students from VLE big data using deep learning models. Computers in Human Behavior, 104, 106189. https://doi.org/10.1016/j.chb.2019.106189
Xiushan NIE, Yuling MA, Huiyan QIAO, Jie GUO, Chaoran CUI, Zhiyun YU, Xingbo LIU, & Yilong YIN. (2022). Surver on student academic performance prediction from the perspective of task granularity. Journal of Shandong University (Engineering Science), 52(2): 1-14.
https://doi.org/10.6040/j.issn.1672-3961.0.2021.489
Yang, S. J. H., Lu, O. H. T., Huang, A. Y. Q., Huang, J. C. H., Ogata, H., & Lin, A. J. Q. (2018). Predicting students’ academic performance using multiple linear regression and principal component analysis. Journal of Information Processing, 26, 170–176.