Improvement of Classification for Agriculture Bibliographic Data

Main Article Content

พิลาพรรณ โพธิ์นรินทร์
สุพจน์ นิตย์สุวัฒน์
ชูชาติ หฤไชยะศักดิ์

Abstract

In general, Classifying bibliographic data using lexicon-based might not enough in terms of classification efficiency. In this paper, we propose the agriculture bibliographic classification model by improving lexicon set and by using feature selection techniques. The techniques of Information Gain (IG), Chi Squared (CHI) and Gain Ratio (GR) are used in order to select the distinguish properties for feature selection process. Then three algorithms Decision Tree (DT), Naïve Bayes (NB) and Support Vector Machine (SVM) are applied to classify those features. The experiments were done by using 2,580 papers from agriculture publication database. The results show that the proposed method gave better performance than using only lexicon-based in terms of precision/recall and F-measure, respectively 1.3%.

 

การเพิ่มประสิทธิภาพการจำแนกหมวดหมู่ข้อมูลเชิงบรรณานุกรมด้านการเกษตร

บทคัดย่อไม่สมบูรณ์

Article Details

Section
ACTIS Article