Developing a Credit Approval Determination Model Using Principal Component Analysis with Machine Learning Techniques

Main Article Content

Chonlada Muangthanang
Surasak Mungsing
Nivet Chirawichichai

Abstract

This research presents a model and tests the performance of a credit approval determination model by using principal component analysis with machine learning techniques. Four algorithms: Naïve Bayes, Decision Tree, Random Forest, and Support Vector Machine were tested with 441,335 examples of lending data. There are four model performance test results: Accuracy, Precision, Recall, and F-Measure.
From the experiment, it was found that the random forest model provides the best accuracy performance of 92.90 percent, followed by support vector machine and decision tree is 87.00 percent, and Naïve Bayes is 83.40 percent, respectively. It was found that reducing the data dimensions resulted in improved model performance by eliminating insignificant features and solve the problem of the traditional credit approval model that considers the old attribute variables. The model's performance has improved by considering new attribute variables. The results of the completeness value (Recall) and the overall performance measurement value (F-Measure) from the experiment found that Random forest model provides the best performance as well as accuracy values were 99.64 and 99.35 percent, respectively, and the highest precision value was 99.07 percent for the random forest model and decision trees.

Article Details

Section
Research Article

References

Committee for the Protection of Credit Information. “Basic Information about Credit Scoring.” (in Thai), CREDITINFOCOMMITTEE.or.th. https://www.creditinfocommittee.or.th/Public/BasicIntormation (accessed Mar. 1, 2023).

Supervision Group, Bank of Thailand. “Credit scoring, things that need to be preserved.” (in Thai), BOT.or.th. https://www.bot.or.th/th/research-and-publications/articles-and-publications/articles/Article_22Jan201.html (accessed Mar. 1, 2023).

P. Moeynoi. “Dimensionality Reduction.” (in Thai), MEDIUM.com. https://medium.com/@doohpim/02-การลดมิติข้อมูล-dimensionality-reduction-a5df2adb4f92 (accessed Sep. 7, 2022).

S. Teerawittayakul, “Ransomware detection study using machine learning techniques from a sample attack dataset,” (in Thai), Dept. Data Commun. Netw., King Mongkut's Univ. Technol. North Bangkok, Bangkok, Thailand, 2019.

PradyaSin. “What is Random Forest.” (in Thai), MEDIUM.com. https://medium.com/@pradyasin/random-forest-คืออะไร-74d2a0af3d7 (accessed Sep. 7, 2022).

B. Pasilatesang, Build Learning for AI with Python Machine Learning. Bangkok, Thailand: SE-EDUCATION (in Thai), 2021.

C. Ambi. “Difference Between Normalization and Standar-dization.” TOWARDSAI.net. https://pub.towardsai.net/difference-between-normalization-and-standardization-745030eaf96f (accessed Sep. 7, 2022).

E. Pacharawongsakda, Practical Data Science with RapidMiner Studio 1. Nonthaburi, Thailand: IDC Premier (in Thai), 2022.

Lending Club Loan Data: Complete loan data for all loans issued through the 2007-2015, including status, LendingClub 2007-2015, 2021. [Online]. Available: https://www.kaggle.com/datasets/adarshsng/lending-club-loan-data-csv

N. Promrita and S. Waijanya, Fundamental of DEEP LEARNING in Practice. Nonthaburi, Thailand: IDC Premier (in Thai), 2021.

T. Kaewsri. “Feature Engineering.” (in Thai), MEDIUM.com. https://grimreaperjunior.medium.com/สกัดข้อมูลให้ตรงใจด้วยการทำ-feature-engineering-bcdba80124bc (accessed Sep. 7, 2022).

S. Tidta and T. Kangkachit, “Automated system for employee welfare loan approval through machine learning techniques,” (in Thai), DPU Graduate Studies J., vol. 10, no. 2, pp. 93–107, 2022. [Online]. Available: https://grad.dpu.ac.th/upload/content/files/year10-2/10-7.pdf

B. Chitambira, “Credit scoring using machine learning approaches,” M.S. thesis, Division Math. Phys., Malardalen Univ., Vasteras, Sweden, Jun. 2022. [Online]. Available: https://www.diva-portal.org/smash/get/diva2:1664698/FULLTEXT01.pdf

C. Muangthanang, S. Mungsing, and N. Chirawichitchai, “Development of a credit scoring model using data mining techniques,” (in Thai), Huachiew Chalermprakiet Sci, Technol. J., vol. 7, no. 1, pp. 82–93, 2021.

R. Bai, H. Mo, and H. Li, “A comparison of credit rating classification models based on spark-evidence from lending-club,” Procedia Comput. Sci., vol. 162, pp. 811–818, 2019.