Applying Data Mining Techniques to Develop a Model for Predicting Suitable Academic Fields for University Applicants at Kamphaeng Phet Rajabhat University
Main Article Content
Abstract
This research aimed to develop predictive models for determining the suitability of academic programs for undergraduate students by applying five data mining techniques: Decision Tree (DT), Naive Bayes, Support Vector Machine (SVM), Random Forest, and AdaBoost. The study utilized a dataset of 1,392 students across 10 academic programs at Kamphaeng Phet Rajabhat University. Students’ academic performance was grouped into four levels include Excellent (A), Good (B), Fair (C), and Low (E). The experimental results indicated that Naive Bayes achieved the highest average F1-Score in 6 out of 10 programs, particularly in programs with simple data structures and clearly separable classes. In contrast, SVM performed well in programs with complex or overlapping data structures, while Random Forest demonstrated outstanding performance in handling high-variance data, especially in the General Management program, where it achieved the highest F1-Score of 0.75. The findings suggest that selecting an appropriate model should consider the underlying structure of the dataset in each specific context. Although Naive Bayes yielded the best overall results, data overlapping between classes in several programs remained a limiting factor, resulting in moderate classification accuracy. Future research should consider incorporating behavioral, interest-based, or skill-related features to enhance prediction accuracy and support educational guidance systems that better align with each student’s potential.
Article Details
References
Academic Promotion and Registration Office, Report: Student Academic Performance Data 2020 of Kamphaeng Phet Rajabhat University, 2020 (in Thai).
W. Matchai and W. Buathong, "A study of factors affecting the selection of vocational certificate students to continue in higher vocational education using data mining techniques, Chandrakasem Rajabhat University Journal of Science and Academic Research, vol. 33, no. 1, pp. 1-10, Jan.-Jun. 2023 (in Thai).
C. Wilkin, A. Ferreira, K. Rotaru, and L. R. Gaerlan, "Big data prioritization in SCM decision-making: Its role and performance implications," International Journal of Accounting Information Systems, vol. 100470, 2020. [Online]. Available: https://doi.org/10.1016/j.accinf.2020.100470.
I. A. T. Hashem, I. Yaqoob, N. B. Anuar, S. Mokhtar, A. Gani, and S. U. Khan, "The rise of 'big data' on cloud computing: Review and open research issues," Report, Information Systems, vol. 47, pp. 98–115, 2015. [Online]. Available: https://doi.org/10.1016/j.is.2014.07.006.
S. Ahmed, M. A. Khan, and M. Rashid, "Classification techniques in data mining for student performance prediction," Computers in Human Behavior, vol. 85, pp. 78–85, 2018.
C. Romero and S. Ventura, "Educational data mining: A review of the state of the art," IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 50, no. 3, pp. 601–618, 2020.
E. Osmanbegovic and M. Suljic, "Data mining approach for predicting student performance," Economic Review – Journal of Economics and Business, vol. 10, no. 1, pp. 3–12, May 2012.
H. Yao, D. Lian, Y. Cao, Y. Wu, and T. Zhou, "Predicting academic performance for college students: A campus behavior perspective," ACM Transactions on Intelligent Systems and Technology, vol. 10, pp. 1–21, 2019.
J. R. Quinlan, "Induction of Decision Trees," Machine Learning, vol. 1, no. 1, pp. 81–106, 1986.
A. Pinathe, "The Use of Data Mining in Selecting Areas of Study for Further Education Opportunity," Journal of Science and Technology Mahasarakham University, vol. 36, no. 6, pp. 704–712, 2017 (in Thai).
J. Laohawaranan, R. Limsuthiwanphoom, and B. Thanasophon, "Using data mining techniques for classifying and selecting study branches for technology faculty students," KMITL Journal of Information Technology, vol. 4, no. 2, pp. 1–9, 2015 (in Thai).
N. Napaporn Chongkasikit, " An Application of FP-Growth Algorithm to Find Factors in Choosing to Study in the Faculty of Industrial Technology Lampang Rajabhat University," Industrial Technology Lampang Rajabhat University Journal, vol. 11, no. 2, pp. 29–39, 2018 (in Thai).
K. Hengpraprom, S. Hengpraprom, and S. Makwiboonchai, " The Knowledge Discovery of Students’ Critical Characteristics towards Learning Achievement of Nakhon Pathom Rajabhat University Students’ Program in Computer via Data Mining Technique," Journal of Western Rajabhat Universitie, vol. 9, no. 1, pp. 71–80, 2014 (in Thai).
P. Eakasit, An Introduction to Data Mining Techniques, Cube Edu-Scene, 2015 (in Thai).
T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed. New York: Springer, 2009.
L. McInnes and J. Healy, "UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction," arXiv preprint arXiv:1802.03426, 2018. [Online]. Available: https://doi.org/10.48550/arXiv.1802.03426
V. Vapnik, Statistical Learning Theory, New York: Wiley, 1998.
P. Phetkong and C. Biaokaimuk, "MACHINE LEARNING BASED PREDICTION ANALYSIS FOR THE SERVICE LEVEL AGREEMENT: A CASE STUDY OF DHANARAK ASSET DEVELOPMENT CO., LTD," SAU Journal of Science & Technology, vol. 9, no. 2, pp. 66–76, Jul.–Dec. 2023 (in Thai).