วิธีจำแนกกลุ่มการเป็นโรคมะเร็งเต้านมด้วยวิธีการทางสถิติ (Classification Groups For Breast Cancer With Statistical Method)

เกรียง กิจบำรุงรัตน์ (Krieng Kitbumrungrat)

PDF

Published: Aug 3, 2017

Keywords:

ตัวแบบการถดถอยลอจิสติก ตัวแบบการวิเคราะห์การจำแนกกลุ่ม สถิติทดสอบวอลด์ สถิติทดสอบวิลค์ แลมด้า Logistic Regression Model Discriminant Model Wald statistic Wilks' Lambda

เกรียง กิจบำรุงรัตน์ (Krieng Kitbumrungrat)

มหาวิทยาลัยราชภัฎธนบุรี

Abstract

การวิจัยครั้งนี้เป็นการหาตัวแบบการจำแนกกลุ่มผู้ป่วยที่ได้รับการตรวจจากแพทย์พบว่าผู้ป่วยเป็นโรคมะเร็งเต้านมหรือผู้ป่วยไม่เป็นโรคมะเร็งเต้านม โดยพิจารณาตามลักษณะเซลล์เนื้อร้ายที่เจริญเติบโตผิดปกติของโรคมะเร็งเต้านม ซึ่งมีตัวแปรอิสระ คือ เซลล์เนื้อร้ายที่เจริญเติบโตผิดปกติ ได้แก่ ความหนาของก้อนเนื้อ (Clump Thickness (X₁)), ความสม่ำเสมอของขนาดเซลล์ (Uniformity of Cell Size (X₂)), ความสม่ำเสมอของรูปร่างเซลล์ (Uniformity of Cell Shape (X₃)), การเกาะติดขอบของเซลล์ (Marginal Adhesion (X₄)), ขนาดเซลล์เดียว (Single Epithelial Cell Size (X₅)), นิวเคลียสไม่ถูกห้อหุ้ม (Bare Nuclei (X₆)), โครมาตินเฉพาะ (Bland Chromatin (X₇)), นิวคลีโอไลในภาวะปกติ (Normal Nucleoli (X₈)), และการขยายตัวของเซลล์ (Mitoses (X₉)), ส่วนตัวแปรตามคือผลการตรวจจากแพทย์พบว่าผู้ป่วยเป็นโรคมะเร็งเต้านมหรือผู้ป่วยไม่เป็นโรคมะเร็งเต้านม โดยใช้เทคนิคการวิเคราะห์การถดถอยลอจิสติก (Logistic Regression Analysis) และวิธีการวิเคราะห์การจำแนกกลุ่ม (Discriminant Analysis) จะสรุปได้ว่า ตัวแบบ Logistic Regression มีอำนาจจำแนกได้ถูกต้องร้อยละ 96.90 สูงกว่าวิธี Discriminant Analysis มีอำนาจจำแนกได้ถูกต้องร้อยละ 96.10 ซึ่งตัวแบบ Logistic Regression ใช้ตัวแปรพยากรณ์ 4 ชนิดคือความหนาของก้อนเนื้อ (Clump Thickness (X₁)), การเกาะติดขอบของเซลล์ (Marginal Adhesion (X₄)), นิวเคลียสไม่ถูกห้อหุ้ม (Bare Nuclei (X₆)) และโครมาตินเฉพาะ (Bland Chromatin(X₇)) ส่วนตัวแบบ Discriminant Analysis ใช้ตัวแปรพยากรณ์ทั้ง 9 ชนิดในการพยากรณ์จัดกลุ่ม

This research is a classification group, the patient is detected at any breast cancer or non-breast cancer. Assessment of characteristics of abnormal growth of breast cancer cells such as Clump Thickness (X₁), Uniformity of Cell Size (X₂), Uniformity of Cell Shape (X₃), Marginal Adhesion (X₄), Single Epithelial Cell Size (X₅), Bare Nuclei (X₆), Bland Chromatin (X₇), Normal Nucleoli (X₈), and Mitoses (X₉) are independent variables. The dependent variable is the result which is detected at any breast cancer or non-breast cancer by using Logistic Regression Model and Discriminant Model. Conclude that Logistic Regression Model has 96.90% classification higher than Discriminant Model has 96.10% classification. Logistic Regression Model can used predicted variables 4 variables are Clump Thickness (X₁), Marginal Adhesion (X₄), Bare Nuclei (X₆) and Bland Chromatin (X₇). The study results reveal that the discriminant analysis can used predicted variables 9 variables for classifying groups of breast cancer and non- breast cancer.

Issue

Vol. 4 No. 5 (2017): Science and Technology ( กันยายน - ตุลาคม 2560 )

Section

บทความ : Science and Technology

Article Sidebar

Main Article Content

Abstract

Article Details