Bagging Model with Cost Sensitive Analysis on Diabetes Data

  • Punnee Sittidech Faculty of Science, Naresuan University, Phitsanulok, Thailand
  • Nongyao Nai-arun Faculty of Science and Technology, Nakhon Sawan Rajabhat University, Thailand
  • Ian T. Nabney Faculty of Engineering & Applied Science, Aston University, Birmingham,United Kingdom
Keywords: Rdiabetes, feature selection, classification, bagging, cost sensitive analysis


Diabates patients might suffer from an unhealthy life, long-term treatment and chronic complicated diseases. The decreasing hospitalization rate is a crucial problem for health care centers. This study combines the bagging method with base classifier decision tree and cost sensitive analysis for diabetes patients’ classification purpose. Real patients’ data collected from a regional hospital in Thailand were analyzed. The relevance factors were selected and used to construct base classifier decision tree models to classify diabetes and non-diabetes patients. The bagging method was then applied to improve accuracy. Finally, asymmetric classification cost matrices were used to give more alternative models for diabetes data analysis.


Download data is not yet available.