Enhancement of Machine Learning Algorithm in Fine-grained Sentiment Analysis Using the Ensemble

M. Khairul Anam; Tri Putri Lestari; Helda Yenni; Torkis Nasution; Muhammad Bambang Firdaus

doi:10.37936/ecti-cit.2025192.257815

PDF

Published: Mar 8, 2025

DOI: https://doi.org/10.37936/ecti-cit.2025192.257815

Keywords:

Ensemble Learning Machine Learning Sentiment Analysis SMOTE Voting

M. Khairul Anam

Universitas Samudra, Indonesia

Tri Putri Lestari

Universitas Indraprasta PGRI, Indonesia

Helda Yenni

Universitas Sains dan Teknologi Indonesia, Indonesia

Torkis Nasution

Universitas Sains dan Teknologi Indonesia, Indonesia

Muhammad Bambang Firdaus

Universitas Mulawarman, Indonesia

Abstract

Fine-grained sentiment analysis plays a crucial role in extracting subtle opinions from textual data, especially in domains such as customer reviews and social media analysis. Traditional machine learning models, including Support Vector Machines (SVM), Naïve Bayes, and Decision Tree, often face limitations in accurately classifying fine-grained sentiments due to their inability to generalize well in complex classication tasks. To address this challenge, this study proposes an ensemble learning approach integrating voting, bagging, boosting, and stacking to enhance sentiment classification performance. Experiments were conducted on multiple datasets, comparing standalone classiers and ensemble-based approaches. The results indicate that stacking-based ensemble models achieve the highest accuracy, reaching 92.45%, outperforming traditional classiers such as SVM (88.23%) and Naïve Bayes (85.67%). Additionally, ensemble methods demonstrate improved generalization and robustness, reducing misclassification rates by 6% on average compared to individual classifiers. Among the tested ensemble techniques, stacking consistently delivered superior results, leveraging diverse weak learners to optimize sentiment classication accuracy. This research highlights the eectiveness of ensemble learning in fine-grained sentiment analysis, oering a robust methodology for improving classication accuracy and reducing sentiment misclassication. The ndings suggest that ensemble approaches, particularly stacking, provide a more reliable and scalable solution for sentiment analysis tasks, making them suitable for real-world applications in natural language processing.

How to Cite

[1]

M. K. Anam, T. P. Lestari, H. Yenni, T. Nasution, and M. B. Firdaus, “Enhancement of Machine Learning Algorithm in Fine-grained Sentiment Analysis Using the Ensemble”, ECTI-CIT Transactions, vol. 19, no. 2, pp. 159–167, Mar. 2025.

Issue

Vol. 19 No. 2 (2025): ECTI Transactions on CIT (April 2025)

Section

Research Article

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

References

J. Kufel et al., “What Is Machine Learning, Artificial Neural Networks and Deep Learning?—Examples of Practical Applications in Medicine,” Diagnostics, vol. 13, no. 15, pp. 1–22, 2023.

N. Seman and N. A. Razmi, “Machine learning-based technique for big data sentiments extraction,” IAES International Journal of Artificial Intelligence, vol. 9, no. 3, pp. 473–479, Sep. 2020.

M. K. Anam, T. A. Fitri, Agustin, Lusiana, M. B. Firdaus and A. T. Nurhuda, “Sentiment Analysis for Online Learning using The Lexicon Based Method and The Support Vector Machine Algorithm,” ILKOM Jurnal Ilmiah, vol. 15, no. 2, pp. 290–302, 2023.

F. A. Ramadhan, Rd. R. P. Ruslan and A. Zahra, “Sentiment Analysis Of E-Commerce Product Reviews For Content Interaction Using Machine Learning,” CAKRAWALA –Repositori IMWI, vol. 6, no. 1, pp. 207–220, 2023.

Y. Xiao, C. Li, M. Th¨urer, Y. Liu and T. Qu, “Towards Lean Automation: Fine-Grained sentiment analysis for customer value identification,” Computers & Industrial Engineering, vol. 169, no. 108186, Jul. 2022.

J. L. Lavado, I. Cantador, M. E. Cort´es Cediel and M. Fern´andez, “Automatic Intent-based Classification of Citizen-to-Government Tweets,” in Proceedings of Ongoing Research, 2021, pp. 47–55, Dec. 2022.

H. T. Ismet, T. Mustaqim and D. Purwitasari, “Aspect Based Sentiment Analysis of Product Review Using Memory Network,” Scientific Journal of Informatics, vol. 9, no. 1, pp. 73–83, May 2022.

P. Nandwani and R. Verma, “A review on sentiment analysis and emotion detection from text,” Social Network Analysis and Mining, vol. 11, no. 81, Aug. 2021.

I. H. Sarker, “Machine Learning: Algorithms, Real-World Applications and Research Directions,” SN Computer Science, vol. 2, no. 3, pp. 1–21, May 2021.

M. Soori, B. Arezoo and R. Dastres, “Artificial intelligence, machine learning and deep learning in advanced robotics, a review,” Cognitive Robotics, vol. 3, pp. 54–70, Jan. 2023.

S. Chatterjee and Y. C. Byun, “Voting Ensemble Approach for Enhancing Alzheimer’s Disease Classification,” Sensors, vol. 22, no. 19, Oct. 2022.

L. L. Van Fc, M. K. Anam, M. B. Firdaus, Y. Yunefri and N. A. Rahmi, “Enhancing Machine Learning Model Performance in Address-

ing Class Imbalance,” COGITO Smart Journal, vol. 10, no. 1, pp. 478–490, 2024.

L. Muflikhah, F. A. Bachtiar, D. E. Ratnawati and R. Darmawan, “Improving Performance for Diabetic Nephropathy Detection Using Adaptive Synthetic Sampling Data in Ensemble Method of Machine Learning Algorithms,” Jurnal Ilmiah Teknik Elektro Komputer dan Informatika, vol. 10, no. 1, pp. 123–137, Feb. 2024.

M. Iqbal et al., “Implementation Of Particle Swarm Optimization Based Machine Learning Algorithm For Student Performance Prediction,” JITK (Jurnal Ilmu Pengetahuan dan Teknologi Komputer), vol. 6, no. 2, pp. 195–204, 2020.

A. S. Aribowo, N. H. Cahyana and Y. Fauziah, “Enhancing Semi-Supervised Sentiment Analysis Through Hyperparameter Tuning Within Iterations: A Comparative Study Using Grid Search and Random Search,” in International Conference on Advanced Informatics and Intelligent Information Systems, pp. 248–260, 2024.

S. Hadhri, M. Hadiji and W. Labidi, “A voting ensemble classifier for stress detection,” Journal of Information and Telecommunication, pp. 1–18, 2024.

Y. Q. Song, X. Yao, Z. Liu, X. Shen and J. Mao, “An Improved C4.5 Algorthm in Bagging Integration Model,” IEEE Access, vol. 8, no. 1, pp. 206866–206875, 2020.

R. M. Syafei and D. A. Efrilianda, “Machine Learning Model Using Extreme Gradient Boosting (XGBoost) Feature Importance and Light Gradient Boosting Machine (LightGBM) to Imcursive Journal of Informatics, vol. 1, no. 2, pp. 64–72, Sep. 2023.

B. L. V. S. R. Krishna, V. Mahalakshmi and G. K. M. Nukala, “A Stacking Model for Outlier Prediction using Learning Approaches,” International Journal of Intelligent Systems and Applications in Engineering, vol. 12, no. 2s, pp. 629–638, 2023.

D. J. I. Supriatna, H. Saputra and K. Hasan, “Enhancing the Red Wine Quality Classification Using Ensemble Voting Classifiers,” Infolitika Journal of Data Science, vol. 1, no. 2, pp. 42–47, Oct. 2023.

N. Rai, N. Kaushik, D. Kumar, C. Raj and A. Ali, “Mortality prediction of COVID-19 patients using soft voting classifier,” International Journal of Cognitive Computing in Engineering, vol. 3, pp. 172–179, Jun. 2022.

H. Ghali Jabbar, “Advanced Threat Detection Using Soft and Hard Voting Techniques in Ensemble Learning,” Journal of Robotics and Control (JRC), vol. 5, no. 4, pp. 1104–1116, 2024.

N. Hicham, S. Karim and N. Habbat, “Customer sentiment analysis for Arabic social media using a novel ensemble machine learning approach,” International Journal of Electrical and Computer Engineering, vol. 13, no. 4, pp. 4504–4515, Aug. 2023.

M. Atif, F. Anwer and F. Talib, “An Ensemble Learning Approach for Eﬀective Prediction of Diabetes Mellitus Using Hard Voting Classifier,” Indian Journal of Science and Technology, vol. 15, no. 39, pp. 1978–1986, 2022.

H. Li, “Machine Learning-based Voting Classifier for Improving Sentiment Analysis on Twitter Data,” Transactions on Computer Science and Intelligent Systems Research, vol. 5, pp. 2960–2238, 2024.

S. W. A. Sherazi, J. W. Bae and J. Y. Lee, “A soft voting ensemble classifier for early prediction and diagnosis of occurrences of major adverse cardiovascular events for STEMI and NSTEMI during 2-year follow-up in patients with acute coronary syndrome,” PLoS One, vol. 16, no. 6, pp. 1–20, Jun. 2021.

M. K. Anam, M. B. Firdaus, F. Suandi, Lathifah, T. Nasution and S. Fadly, “Performance Improvement of Machine Learning Algorithm Using Ensemble Method on Text Mining,” in ICFTSS 2024 - International Conference on Future Technologies for Smart Society, Kuala Lumpur: Institute of Electrical and Electronics Engineers Inc., pp. 90–95, Sep. 2024.

N. Matondang and N. Surantha, “Eﬀects of over-sampling SMOTE in the classification of hypertensive dataset,” Advances in Science, Technology and Engineering Systems, vol. 5, no. 4, pp. 432–437, 2020.

M. K. Anam, S. Defit, Haviluddin, L. Efrizoni and M. B. Firdaus, “Early Stopping on CNN-LSTM Development to Improve Classification Performance,” Journal of Applied Data Sciences, vol. 5, no. 3, pp. 1175–1188, 2024.

M. K. Anam et al., “Sara Detection on Social Media Using Deep Learning Algorithm Development,” Journal of Applied Engineering and Technological Science, vol. 6, no. 1, pp. 225–237, Dec. 2024.

K. Maharana, S. Mondal and B. Nemade, “A review: Data pre-processing and data augmentation techniques,” Global Transitions Proceedings, vol. 3, no. 1, pp. 91–99, Jun. 2022.

C. Liu, L. Yang and J. Qu, “A structured data preprocessing method based on hybrid encoding,” in Journal of Physics: Conference Series, IOP Publishing Ltd., pp. 1–9, Jan. 2021.

A. Zamsuri, S. Defit and G. W. Nurcahyo, “Classification Of Multiple Emotions In Indonesian Text Using The K-Nearest Neighbor Method,” Journal of Applied Engineering and Technological Science, vol. 4, no. 2, pp. 1012–1021, 2023.

S. Rabbani, D. Safitri, N. Rahmadhani, A. A. F. Sani and M. K. Anam, “Perbandingan Evaluasi Kernel SVM untuk Klasifikasi Sentimen dalam Analisis Kenaikan Harga BBM,” MALCOM: Indonesian Journal of Machine Learning and Computer Science, vol. 3, no. 2, pp. 153–160, Oct. 2023.

K. Juluru, H. H. Shih, K. N. K. Murthy and P. Elnajjar, “Bag-of-words technique in natural language processing: A primer for radiologists,” Radiographics, vol. 41, no. 5, pp. 1420–1426, Sep. 2021.

Y. Barve, J. R. Saini and K. Kotecha, “A Novel Evolving Sentimental Bag-of-Words Approach for Feature Extraction to Detect Misinformation,” (IJACSA) International Journal of Advanced Computer Science and Applications, vol. 13, no. 4, pp. 266–275, 2022.

M. K. Anam, S. Sumijan, K. Karfindo and M. B. Firdaus, “Comparison Analysis of HSV Method, CNN Algorithm, and SVM Algorithm in Detecting the Ripeness of Mangosteen Fruit Images,” Indonesian Journal of Artificial Intelligence and Data Mining, vol. 7, no. 2, pp. 348–356, May 2024.

S. U. Hassan, J. Ahamed and K. Ahmad, “An alytics of machine learning-based algorithms for text classification,” Sustainable Operations and Computers, vol. 3, pp. 238–248, Jan. 2022.

V. Nyandwi, O. Habimana and N. M. Enan, “Ensemble Machine Learning-Based Sentiment Analysis Model for Teachers’ Performance Evaluation,” International Journal of Advances in Engineering and Management (IJAEM), vol. 5, no. 4, pp. 1220–1233, 2023.

C. J. Varshney, A. Sharma and D. P. Yadav, “Sentiment Analysis using Ensemble Classification Technique,” 2020 IEEE Students Conference on Engineering & Systems (SCES), Prayagraj, India, pp. 1-6, 2020.

M. Ma’ruf, A. P. Kuncoro, P. Subarkah and F. Nida, “Sentiment analysis of customer satisfaction levels on smartphone products using Ensemble Learning,” ILKOM Jurnal Ilmiah, vol. 14, no. 3, pp. 339–347, Dec. 2022.

S. Shah, H. Ghomeshi, E. Vakaj, E. Cooper and R. Mohammad, “An Ensemble-Learning-Based Technique for Bimodal Sentiment Analysis,” Big Data and Cognitive Computing, vol. 7, no. 2, pp. 1–20, Jun. 2023.

A. K. Abbas, A. K. Salih, H. A. Hussein, Q. M. Hussein and S. A. Abdulwahhab, “Twitter Sentiment Analysis Using an Ensemble Majority Vote Classifier,” Journal of Southwest Jiaotong University, vol. 55, no. 1, 2020.

Article Sidebar

Main Article Content

Abstract

Article Details

References