Developing a Credit Approval Determination Model Using Principal Component Analysis with Machine Learning Techniques

Main Article Content

Chonlada Muangthanang
Surasak Mungsing
Nivet Chirawichichai

Abstract

This research presents a model and tests the performance of a credit approval determination model using principal component analysis with machine learning techniques. Four algorithms were tested: Naïve Bayes, Decision Tree, Random Forest, and Support Vector Machine. 441,335 examples of lending data. There are 4 model performance test results: Accuracy, Precision, Recall, and F-Measure.
The result of Accuracy from the experiment found that Random Forest model The best efficiency is 92.9%, followed by Support Vector Machine and Decision Tree was 87.0% and Naïve Bayes was 83.4%, respectively. It was found that reducing the data dimensions resulted in improved model performance by eliminating insignificant features and solves the problem of the traditional credit approval model that considers the old attribute variables. The model has improved performance by considering new attribute variables.
The results of the completeness Recall and the overall performance F-Measure from the experiment showed that Random Forest model provides the best performance as well as Accuracy was 99.64% and 99.35%, respectively, and the highest Precision was 99.07% for Random Forest Model and Decision Tree.

Article Details

Section
Research Article

References

Committee for the Protection of Credit Information. “Basic Information about Credit Scoring.” (in Thai), CREDITINFOCOMMITTEE.or.th. https://www.creditinfocommittee.or.th/Public/BasicIntormation (accessed Mar. 1, 2023).

Supervision Group, Bank of Thailand. “Credit scoring, things that need to be preserved.” (in Thai), BOT.or.th. https://www.bot.or.th/th/research-and-publications/articles-and-publications/articles/Article_22Jan201.html (accessed Mar. 1, 2023).

P. Moeynoi. “Dimensionality Reduction.” (in Thai), MEDIUM.com. https://medium.com/@doohpim/02-การลดมิติข้อมูล-dimensionality-reduction-a5df2adb4f92 (accessed Sep. 7, 2022).

S. Teerawittayakul, “Ransomware detection study using machine learning techniques from a sample attack dataset,” (in Thai), Dep. Data Commun. Netw., King Mongkut's Univ. Technol. North Bangkok, Bangkok, Thailand, 2019.

PradyaSin. “What is Random Forest.” (in Thai), MEDIUM.com. https://medium.com/@pradyasin/random-forest-คืออะไร-74d2a0af3d7 (accessed Sep. 7, 2022).

B. Pasilatesang, Build Learning for AI with Python Machine Learning. Bangkok, Thailand: SE-EDUCATION (in Thai), 2021.

C. Ambi. “Difference Between Normalization and Standar-dization.” TOWARDSAI.net. https://pub.towardsai.net/difference-between-normalization-and-standardization-745030eaf96f (accessed Sep. 7, 2022).

E. Pacharawongsakda, Practical Data Science with Rapid Miner Studio 1. Nonthaburi, Thailand: IDC Premier (in Thai), 2022.

Lending Club Loan Data: Complete loan data for all loans issued through the 2007-2015, including status, LendingClub 2007-2015, 2021. [Online]. Available: https://www.kaggle.com/datasets/adarshsng/lending-club-loan-data-csv

N. Promrita and S. Waijanya, Fundamental of DEEP LEARNING in Practice. Nonthaburi, Thailand: IDC Premier (in Thai), 2021.

T. Kaewsri. “Feature Engineering.” (in Thai), MEDIUM.com. https://grimreaperjunior.medium.com/สกัดข้อมูลให้ตรงใจด้วยการทำ-feature-engineering-bcdba80124bc (accessed Sep. 7, 2022).

S. Tidta and T. Kangkachit, “Automated system for employee welfare loan approval through machine learning techniques,” (in Thai), DPU Graduate Studies J., vol. 10, no. 2, pp. 93–107, 2022. [Online]. Available: https://grad.dpu.ac.th/upload/content/files/year10-2/10-7.pdf

B. Chitambira, “Credit scoring using machine learning approaches,” M.S. thesis, Division Math. Phys., Malardalen Univ., Vasteras, Sweden, Jun. 2022. [Online]. Available: https://www.diva-portal.org/smash/get/diva2:1664698/FULLTEXT01.pdf

C. Muangthanang, S. Mungsing, and N. Chirawichitchai, “Development of a credit scoring model using data mining techniques,” (in Thai), Huachiew Chalermprakiet Sci, Technol. J., vol. 7, no. 1, pp. 82–93, 2021.

R. Bai, H. Mo, and H. Li, “A comparison of credit rating classification models based on spark-evidence from lending-club,” Procedia Comput. Sci., vol. 162, pp. 811–818, 2019.