YOLO-Based Object Detection for Locating Handwritten X-Marks on Multiple-Choice Answer Sheets
Keywords:
Automatic X-mark detection, Object detection, Deep learning, Convolution neural network, YOLOAbstract
Student assessment is considered a measure of learning achievement. Among the various assessment tools available, multiple-choice tests are commonly used, allowing students to mark their answers by filling or marking crosses on provided answer sheets. However, this method presents significant challenges, particularly when dealing with a large number of questions, as it may require additional personnel for grading, leading to delays in the assessment process. Additionally, manual grading can introduce errors, resulting in inaccuracies in score calculation. This study aims to address these challenges by developing a mathematical model capable of automatically detecting crosses on multiple-choice answer sheets using object detection techniques and deep learning. The model is based on the You Only Look Once version 8 (YOLOv8) technique for object detection. It employs a Convolutional Neural Network (CNN) for cross detection. The performance of the model is evaluated using metrics such as Precision, Recall, and F1-score. The results indicate that at a Threshold value of 0.496, the model achieves the highest F1-score of 0.989. This suggests that the model is effective in automatically detecting and identifying cross markings on multiple-choice answer sheets.
References
Alomran, M., & Chai, D. (2018). Automated scoring system for multiple choice test with quick feedback. International Journal of Information and Education Technology, 8(8), 538-545. https://doi.org/10.18178/ijiet.2018.8.8.1096
Chinnasarn, K., & Rangsanseri, Y. (1998). A multiple choice test checking system that uses the principles of image processing. The 36th Kasetsart University Academic Conference, 218. (in Thai)
de Elias, E. M., Tasinaffo, P. M., & Hirata Jr, R. (2021). Optical mark recognition: Advances, difficulties, and limitations. SN Computer Science, 2(5), 367.
Deepak Kumar P. (2023). Cloud computing. International Scientific Journal of Engineering and Management, 02. https://doi.org/10.55041/ISJEM00289
Deng, H., Wang, F., & Liang, B. (2008, December). A low-cost OMR solution for educational applications. In 2008 IEEE international symposium on parallel and distributed processing with applications.pp. 967-970. https://doi.org/10.1109/ISPA.2008.130
Fisteus, J. A., Pardo, A., & Garcí a, N. F. (2013). Grading multiple choice exams with low-cost and portable computer-vision techniques. Journal of Science Education and Technology, 22, 560-571. https://doi.org/10.1007/s10956-012-9414-8
He, K., Zhang, X., Ren, S., & Sun, J. (2015). Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence, 37(9), 1904-1916.
Hussmann, S., & Deng, P. W. (2005). A high-speed optical mark reader hardware implementation at low cost using programmable logic. Real-Time Imaging, 11(1), 19-30. https://doi.org/10.1016/j.rti.2005.03.001
Jocher, G., Chaurasia, A., & Qiu, J. (2023). Ultralytics YOLO (8.0.0) [Computer software]. https://github.com/ultralytics/ultralytics
Kingma, D. P., & Ba, J. (2017). Adam: A Method for Stochastic Optimization, Published as a conference paper at ICLR 2015. https://doi.org/10.48550/arXiv.1412.6980
Kumar, A., Singal, H., & Bhavsar, A. (2018). Cost effective real-time image processing based optical mark reader. World Acad. Sci. Eng. International Journal of Information Technology and Computer Engineering, 12(9), 787-791.
Lin, W., Hasenstab, K., Moura Cunha, G., & Schwartzman, A. (2020). Comparison of handcrafted features and convolutional neural networks for liver MR image adequacy assessment. Scientific Reports, 10(1), 20336. https://doi.org/10.1038/s41598-020-77264-y
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., & Berg, A. C. (2016). Ssd: Single shot multibox detector. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I14. pp. 21-37. Springer International Publishing. https://doi.org/10.1007/978-3-319-46448-0_2
Nguyen, T. D., Manh, Q. H., Minh, P. B., Thanh, L. N., & Hoang, T. M. (2011). Efficient and reliable camera based multiple-choice test grading system. In The 2011 International Conference on Advanced Technologies for Communications (ATC 2011). pp. 268-271. https://doi.org/10.1109/ATC.2011.6027482
Phatkrajang, P. (2010). Thesis database system, Department of Computer Engineering, Faculty of Engineering, Kamphaeng Saen. http://cpe.eng.kps.ku.ac.th/db_cpeproj/paperShowFile.php?id_pro=66 (in Thai)
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 779-788.
Sanguansat, P. (2015, June). Robust and low-cost Optical Mark Recognition for automated data entry. In 2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON). pp. 1-5. https://doi.org/10.1109/ECTICon.2015.7206937
Spadaccini, A., & Rizzo, V. (2011). A multiple-choice test recognition system based on the gamera framework. arXiv preprint arXiv:1105.3834. https://doi.org/10.48550/arXiv.11005.3834
Tan-ut, P. (2017). Development of a program for checking multiple choice answer sheets using image processing. Science and Technology Nakhon Sawan Rajabhat University Journal, 9(10). (in Thai)
Tinh, P. D., & Minh, T. Q. (2024). Automated Paper-basedMultiple Choice Scoring Framework using Fast Object Detection Algorithm. International Journal of Advanced Computer Science & Applications, 15(1). https://doi.org/10.14569/IJACSA.2024.01501115
Wang, C. Y., Liao, H. Y. M., Wu, Y. H., Chen, P. Y., Hsieh, J. W., & Yeh, I. H. (2020). CSPNet: A new backbone that can enhance learning capability of CNN. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops. pp. 390-391.

Downloads
Published
Issue
Section
Categories
License
Copyright (c) 2024 Journal of Applied Science and Emerging Technology

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.