Comparison of Imbalanced data classification efficiency to predict road accident deaths using data mining techniques

Main Article Content

Narin Jiwitan


This research has an objective to analyze the factors involved in road accident deaths and to compare the classification efficiency of road fatalities data of forecasting models. Consists of Gradient Boosted Tree, Decision Tree, Naïve Bayes, Random Forest, and Deep Learning which is a data mining technique based on the CRISP-DM methodology. Study and collect accident data 51,384-row from the Information and Communication Technology Center Office of the Permanent Secretary for Transport between 1 January 2019 - 30 June 2021. The data was prepared and cleaned before being analyzed and modeled. The results of the analysis of factors involved in road accident deaths by calculating the weight of relevant data or related attributes using the Gain Ratio method, it was found that the type of vehicle or road user weighted 0.1030. The results of the comparison of the classification efficiency of forecast model data by measuring the performance of the forecast model using the K-Fold Cross Validation method; it was divided into 30 fold, showed that the Naïve Bayes technique had the highest accuracy of 72.23% F1-measure of 73.35% and G-mean of 72.10%. The results of this research can be used as information to effectively prevent road accidents in each area, reduce risks, reduce impacts and prepare for road accident prevention that may occur.

Article Details



World Health Organization. Global Situation of Road Safety Report 2015. Global Situation of Road Safety Report. World Health Organization Publishers (WHO Press), 2015.

Road Safety Operation Center. Thailand Road Safety Master Plan 2018-2021. Thailand Road Safety Master Plan. 2019.

Traffic Injury Prevention Group Bureau of Non-Communicable Diseases. Road accident situation in 2014. 2015.

S. Sinlapasorn. Modeling used to predict osteoarthritis scores WOMAC of Post-Knee Replacement Surgery Patients with Characteristic Engineering and Machine Learning Techniques. Proceedings of the 25th Annual Meeting in Mathematics (AMM 2021). Department of Mathematics, Faculty of Science King Mongkut's Institute of Technology Ladkrabang, 2021.

ABB Corporate Research Center. What is Deep Learning?. Available Online at , accessed on 1 November 2021.

J. Disapon, and A. Sirisophon. “Factors affecting the prevention of road traffic accident among population in Sawang arom district, Uthai thani province: MIMIC model analysis”. Journal of Nurses Association of Thailand Northern Region, Vol. 27, pp. 101 – 112, 2021.

R. Kladchuen, and C. Saenrat. “An Efficiency Comparison of Algorithms and Feature Selection Methods to Predict the Learning Achievement of Vocational Students”. Research Journal Rajamangala University of Technology Thanyaburi, ISSN 1686-8420, Vol 17, No. 1, pp.1-10, 2018.

S. Sinsomboonthong. “An Efficiency Comparison in Prediction of Imbalanced Data Classification with Data Mining Techniques”. Thai Science and Technology Journal (TSTJ), Vol. 28 No. 3, pp.383 - 393, 2020.

A. Leelakajonjit, and P. Panudulkitti. Severity level from road traffic accidents. Road accident data management. Education Research and Development Center and Academic Center for Road Safety, Aksornthai Press (fahmuang Thai Newspaper) Limited Partnership, Thailand, 2015.

Bureau of Highway Safety. Time of incident. New Year Festival 2020 (During 27 Dec. 19 – 2 Jan. 20). 2020.

S. Dechapromphan. Geography of Thailand : Geographic Perspectives. Geoinformatics Faculty, Burapha University, Thailand, 2017.

W. Nuipian, and P. Meesut. “Comparison of filtering and consolidation techniques of text mining for text classification”. The Journal of Industrial Technology. Vol. 9, No. 3, pp.118 - 129, September –December 2013.

N. Klinjun, and N. Supphasri, “Asip Useng and Akeon Sawangnipun. A Case Study on Public Transportation and Road Safety: Why so Many Passengers' Injuries from Van Crashes onto Trees?”. The Southern College Network Journal of Nursing and Public Health, .pp. 26 – 38, Vol.8, No.2, May-August 2021.