Developing a Prediction Model for Coronavirus-2019 infections Base on Patient Symptoms
Main Article Content
Abstract
The objectives of this research are: 1) to develop a prediction model for coronavirus-2019 infection from patient symptoms using data mining techniques 2) to compare and evaluate the performance of the model. The patient condition dataset was collected according to the guidelines of the World Health Organization and the All India Institute of Medical Sciences (AIIMS). This dataset contains a total of 5,434 records, which are divided into 4,348 records for learning and 1,086 records for testing. 10 significant features were carefully selected from a list of 21 to be used in the data mining algorithm's process of learning. The CRIPS-DM approach served as the main research methodology for this study. The learning methodology consists of 8 algorithms: Random Forest, Naïve Bayes, Support Vector Machine, Neural Network, Ada-Boot, K-Nearest Neighbor, Logistic Regression, and Decision Trees. Learning processes and prediction models' performance is evaluated through classification accuracy, precision, sensitivity, and specificity. The study's findings showed that neural network algorithms provide the best learning performance. When various parameters were adjusted in this approach, the prediction model achieved the highest efficiency of 91.40% with the learning rate set to 0.1, ReLU as the activation function, and the maximum number of iterations set to 200. To summarize the results were that, the development of a prediction model for coronavirus-2019 infection based on patient symptoms using a neural network algorithm has shown best predictive performance.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Articles published in Journal of Industrial Technology Ubon Ratchathani Rajabhat University both hard copy and electronically are belonged to the Journal.
References
B. Unim, N. Schutte, M. Thissen, and L. Palmieri, “Innovative Methods Used in Monitoring COVID-19 in Europe: A Multinational Study,” International Journal of Environmental Research and Public Health, vol. 20, no. 1, pp. 1-15, Dec. 2022, doi: 10.3390/ijerph20010564.
K. H. Chaiyachati et al., “Patient and clinician perspectives of a remote monitoring program for COVID-19 and lessons for future programs,” BMC Health Services Research, vol. 23, no. 1, pp. 1-10, Jun. 2023, doi: 10.1186/s12913-023-09684-1.
S. L. Ng et al., “Focused review: potential rare and atypical symptoms as indicator for targeted COVID-19 screening,” Medicina, vol. 57, no. 2, pp. 1-10, 2021, doi: 10.3390/medicina57020189.
Y. Azeli et al., “A machine learning COVID-19 mass screening based on symptoms and a simple olfactory test,” Scientific Reports, vol. 12, no. 1, pp. 1-9, Sep. 2022.
E. J. Chow et al., “Symptom screening at illness onset of health care personnel with SARS-CoV-2 infection in King County,” JAMA, vol. 323, no. 20, pp. 2087-2089, Apr. 2020.
P. Trimankha et al., “Utility of screening chest radiographs in patients with asymptomatic and mildly symptomatic COVID-19 at a field hospital in Samut Sakhon, Thailand,” The ASEAN Journal of Radiology, vol. 22, no. 2, pp. 5-20, May-Aug. 2021, doi: 10.46475/aseanjr.v22i2.119.
A. Bhargava, A. Bansal, and V. Goyal, “Machine learning-based automatic detection of novel coronavirus (COVID-19) disease,” Multimedia Tools and Applications, vol. 81, no. 10, pp. 13731-13750, Feb. 2022, doi: 10.1007/s11042-022-12508-9.
N. K. Chowdhury, M. A. Kabir, M. M. Rahman, and S. M. S. Islam, “Machine learning for detecting COVID-19 from cough sounds: An ensemble-based MCDM method,” Computers in Biology and Medicine, vol. 145, no. 105405, pp. 1-14, Mar. 2022, doi: 10.1016/j.compbiomed.2022.105405.
N. N. M. Azam, M. A. Ismail, M. S. Mohamad, A. O. Ibrahim, and S. Jeba, “Classification of COVID-19 Symptoms Using Multilayer Perceptron,” Iraqi Journal For Computer Science and Mathematics, vol. 4, no. 4, pp. 100-110, Oct. 2023, doi: 10.52866/ijcsm.2023.04.04.009.
A. C. Utku, G. Budak, O. Karabay, E. Güçlü, H. D. Okan, and A. Vatan, “Main symptoms in patients presenting in the COVID-19 period,” Scottish Medical Journal, vol. 65, no. 4, pp. 127-132, 2020.
M. A. Maqbali, K. A. Badi, M. A. Sinani, N. Madkhali, and G. L. Dickens, “Clinical features of COVID-19 patients in the first year of pandemic: a systematic review and meta-analysis,” Biological Research for Nursing, vol. 24, no. 2, pp. 172-185, Dec. 2022, doi: 10.1177/10998004211055866.
M. K. Yusof, W. M. A. F. W. Hamzah, and N. S. M. Rusli, “Efficiency of hybrid algorithm for COVID-19 online screening test based on its symptoms,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 25, no. 1, pp. 440-449, Jun. 2022.
T. Mauritsius, A. S. Braza, and Fransisca, “Bank marketing data mining using CRISP-DM approach,” International Journal of Advanced Trends in Computer Science and Engineering, vol. 8, no. 5, pp. 2322-2329, Oct. 2019, doi: 10.30534/ijatcse/2019/71852019.
F. Binsar and T. Mauritsius, “Mining of Social Media on Covid-19 Big Data Infodemic in Indonesia,” Journal of Computer Science, vol. 16, no. 11, pp. 1598-1609, Nov. 2020.
F. Y. Lan et al., “COVID-19 symptoms predictive of healthcare workers’ SARS-CoV-2 PCR results,” PLoS One, vol. 15, no. 6, pp. 1-12, Jun. 2020, doi: 10.1371/journal.pone.0235460.
H. Harikrishnan, “Symptoms and COVID Presence (May 2020 data).” Kaggle. Accessed: Sep. 20, 2022. [Online]. Available: https://www.kaggle.com/datasets/hemanthhari/symptoms-and-covid-presence
M. S. Amin, Y. K. Chiam, and K. D. Varathan, “Identification of significant features and data mining techniques in predicting heart disease,” Telematics and Informatics, vol. 36, pp. 82-93, Nov. 2018, doi: 10.1016/j.tele.2018.11.007.
V. K. Gupta, A. Gupta, D. Kumar, and A. Sardana, “Prediction of COVID-19 confirmed, death, and cured cases in India using random forest model,” Big Data Mining and Analytics, vol. 4, no. 2, pp. 116-123, Feb. 2021, doi: 10.26599/BDMA.2020.9020016.
T. Sher, A. Rehman, and D. Kim, “COVID-19 Outbreak Prediction by Using Machine Learning Algorithms,” Computers, Materials & Continua, vol. 75, no. 1, pp. 1561-1574, 2023, doi: 10.32604/cmc.2023.032020.
L. Chaves and G. Marques, “Data mining techniques for early diagnosis of diabetes: a comparative study,” Applied Sciences, vol. 11, no. 5, pp. 2218, Mar. 2021.