Breast Cancer Data Classification Using Ensemble Machine Learning

Main Article Content

Meerja Akhil Jabbar

Abstract

Breast cancer (BC) is the largest cause of death in women. Accurate classification of breast cancer data is important in cancer diagnosis and classification of Malignant and Benign tumors can prevent patients to take unnecessary tests. Breast cancer classification can also be used to determine suitable treatment. Classification of Benign and Malignant patient groups is widely recognized research in the medical field. Due to the advantage of detecting critical features from a medical data set, machine learning is widely used in Breast cancer Prediction. Recently there has been greater attention to the use of machine learning methods in medical diagnosis. These decision support systems are effective and helpful for medical experts in the healthcare domain. The objective of this work is to address the problem of the classification of breast cancer data using ensemble learning. Ensemble learning techniques are used to improve the performance of a classifier. This paper deals with building a decision support system using the ensemble model built with   Bayesian network and Radial Basis Function. In this work, extensive experiments were carried out on the much- studied open access data set “Wisconsin Breast Cancer Data set (WBCD)”. The data set is partitioned into training and testing. Various metrics like accuracy, sensitivity, specificity, positive predictive value, negative predicted value, Error rate, false-positive rate, Mathew’s correlation coefficient were used to measure the performance of the model. Experimental results show that the proposed method records a remarkable accuracy of 97% to classify breast cancer data and outperformed the existing approaches. The proposed ensemble learning would be viable in helping cancer specialists in recognizing cancer tumors accurately and help the patients in taking the correct treatment.

Article Details

How to Cite
Jabbar, M. A. . (2021). Breast Cancer Data Classification Using Ensemble Machine Learning. Engineering and Applied Science Research, 48(1), 65–72. Retrieved from https://ph01.tci-thaijo.org/index.php/easr/article/view/234959
Section
ORIGINAL RESEARCH

References

Qasem A, Sheikh Abdullah SNH, Sahran S, Iqbal Hussain R, Ismail F. An accurate rejection model for false positive reduction of mass localisation in mammogram. Pertanika J Sci Technol. 2017;25(S6):49-62.

DeSantis C, Siegel R, Bandi P, Jemal A. Breast cancer statistics. CA Cancer J clin. 2011;61:408-18.

Chen W, Zheng R, Baade PD, Zhang S, Zeng H, Bray F, et al. Cancer statistics in China 2015. CA Cancer J Clin. 2016;66(2):115-32.

Siegel RL, Miller KD, Jemal A. A cancer statistics 2016. CA Cancer J Clin.2016;66(1):7-30.

Torre LA, Bray F, Siegel RL, Ferlay J, Lortet-Tieulent J, Jemal A. A global cancer statistics 2012. CA Cancer J Clin.2015;65(2):87-108.

Yue W, Wang Z, Chen H, Payne A, Liu X. Machine learning with application in breast cancer diagnosis and prognosis. Designs. 2018;2:1-17.

MayoClinic. Breast cancer [Internet]. 2020 [cited 2020 Jan 1]. Available from: https://www.mayoclinic.org/diseases-conditions/breast-cancer/symptoms-causes/syc-20352470.

Medindia. Breast Cancer [Internet]. 2020 [cited 2020 Jan 1]. Available form: https://www.Medindia.net.

Al-Hadidi MR, Alarabeyyat A, Alhanahnah M. Breast cancer detection using K-nearest neighbormachine learning algorithm. 9th International Conference on Developments in eSystems Engineering; 2016 Aug 31 – Sep 2; Liverpool, UK. USA: IEEE; 2016. p. 35-9.

Khuriwal N, Mishra N. Breast cancer diagnosis using adaptive voting ensemble machine learning algorithm. 2018 IEEMA Engineer Infinite Conference (eTechNxT); 2018 Mar 13-14; New Delhi, India. USA: IEEE; 2018. p. 1-5.

Turgut S, Dağtekin M, Ensari T. Microarray breast cancer data classification using machine learning methods.2018 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT); 2018 Apr 18-19;Istanbul, Turkey. USA: IEEE; 2018. p. 1-4.

Asri H, Mousannif H, Moatassime HA, Noel T. Using machine learning algorithms for Breast cancer risk Prediction and Diagnosis. Procedia Comput Sci. 2016;83:1064-9.

Shailaja K, Seetharamulu B, Jabbar MA. Machine learning in healthcare: a review.2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA); 2018 Mar 29-31; Coimbatore, India. USA: IEEE; 2018. p. 910-4.

Shailaja K, Seetharamulu B, Jabbar MA. Prediction of breast cancer using big data analytics. Int J Eng Tech. 2018;7(4.6):223-6.

Jabbar MA, Samreen S, Aluvalu R. The future of healthcare: machine learning. Int J Eng Tech. 2018;7(4.6):23-5.

Douangnoulack P, Boonjing V. Building minimal classification rules for breast cancer diagnosis. 2018 10th International Conference on Knowledge and Smart Technology (KST); 2018 Jan 31-Feb 3; Chiang Mai, Thailand. USA: IEEE; 2018. p. 278-81.

Fu MR, Wang Y, Li C, Qiu Z, Axelrod D, Guth AA, et al. Machine learning for detection of lymphedema among breast cancer survivors. MHealth. 2018;4:17.

Ragab DA, Sharkas M, Marshall S, Ren J. Breast cancer detection using deep convolutional neural networks and support vector machines. Peer J. 2019;7:e6201.

Agarap AFM. On breast cancer detection: an application of machine learning algorithms on the wisconsin diagnostic dataset. Proceedings of the 2nd International Conference on Machine Learning and Soft Computing; 2018 Feb 2-4; Phu Quoc Island, Viet Nam. New York: ACM; 2018. p. 5-9.

Pritom AI, Munshi MAR, Sabab SA, Shihab S. Predicting breast cancer recurrence using effective classification and feature selection technique. 19th International Conference on Computer and Information Technology (ICCIT); 2016 Dec 18-20; Dhaka, Bangladesh. USA: IEEE; 2016. p. 310-4.

Karthik S, Srinivasa Perumal R, Chandra Mouli PVSSR. Breast cancer classification using deep neural networks. In: Margret Anouncia S, Wiil U, editors. Knowledge Computing and Its Applications. Singapore: Springer; 2018. P. 227-41.

Ebrahim Ali EE, Feng WZ. Breast cancer classification using support vector machine and neural network. Int J Sci Res. 2016;5(3):1-6.

Singh S, Saini S, Singh M. Cancer detection using adaptive neural network. Int J Adv Res Tech. 2012;1(4):93-7.

Liu L. Research onlogisticregression algorithm of breast cancer diagnosis data by machine learning. 2018 International Conference on Robots & Intelligent System (ICRIS); 2018 May 26-27; Changsha, China. USA: IEEE; 2018. p 157-60.

Wadkar K, Pathak P, Wagh N. Breast cancer detection using ANN network and performance analysis with SVM. Int J Comput Eng Tech. 2019;10(3):75-86.

Cowsik A, Clark JW. Breast cancer diagnosis by higher order probabilistic perceptrons. arXiv: 1912.06969. 2019:1-17.

Murugan S, Kumar BM, Amudha S. Classification and prediction of breast cancer using linear regression, decision tree and random forest. 2017 International Conference on Current Trends in Computer, Electrical, Electronics and Communication (CTCEEC); 2017 Sep 8-9; Mysore, India. USA: IEEE; 2017. p. 763-6.

Kumar UK, Nikhil MBS, Sumangali K. Prediction of breast cancer using voting classifier technique. 2017 IEEE International Conference on Smart Technologies and Management for Computing, Communication, Controls, Energy and Materials (ICSTM); 2017 Aug 2-4; Chennai, India. USA: IEEE; 2017. p. 108-14.

Wolberg WH, Street WN, Mangasarian OL. Breast Cancer Wisconsin (Diagnostic) Data Set [Internet]. 2020 [cited 2020 Jan 2] Available from: https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(diagnostic).

Karabatak M, Ince MC. An expert system for detection of breast cancer based on association rules and neural network. Expert Syst Appl. 2009;36(2):3465-9.

Seera M, Lim CP. A hybrid intelligent system for medical data classification. Expert Syst Appl. 2014;41(5):2239-49.

Chen HL, Yang B, Liu J, Liu DY. A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst Appl. 2011;38(7):9014-22.

Bayrak EA, Kirci P, Ensari T. Comparison of machine learning methods for breast cancer diagnosis.2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT); 2019 Apr 24-26; Istanbul, Turkey. USA: IEEE; 2019. p. 1-3.

Chaurasia V, Pal S, Tiwari B. Prediction of benign and malignant breast cancer using data mining techniques. J Algorithm Comput Tech. 2018;12(2):119-26.

Polat K, Güneş S. Breast cancer diagnosis using least square support vector machine. Digit Signal Process. 2007;17(4):694-701.

Mert A, Kilic N, Akan A. Breast cancer classification by using support vector machines with reduced dimension. Proceedings ELMAR-2011; 2011 Sep 14-16; Zadar, Croatia. USA: IEEE; 2011. p. 37-40.

Amrane M, Oukid S, Gagaoua I, Ensarİ T. Breast cancer classification using machine learning. 2018 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT); 2018 Apr 18-19; Istanbul, Turkey. USA: IEEE; 2018. p. 1-4.