The Use of Machine Learning Algorithms for Water Quality Index Prediction in the Sai Gon River, Vietnam

Main Article Content

Thuy Nguyen Thi Diem
Mai Nguyen Thi Huynh
Tra Tran Quang

Abstract

Accurate prediction of the water quality index (WQI) lays the groundwork for integrated river basins and sustainable water resource management. Recent and accelerated advances in machine learning have led to various promising applications in water quality assessment. The present study leverages the predictive performance of several ML algorithms, including extreme gradient boosting (XGB), the gradient boosting model (GBM), support vector regression (SVR), and the radial basic function (RBF), to predict the WQI at three monitoring sites on the Sai Gon River from 2015–2019. In comparison, the results indicate that the XGB model outperforms the other models when eight parameters, including DO, BOD5, COD, N-NH₄⁺, P-PO₄³⁻, pH, temperature, and total coliforms, are input. Specifically, the XGB model exhibited the lowest error rates (RMSE = 1.630 and MAE = 0.782) and highest correlation (R2 = 0.960 and NSE = 0.953), followed by the GBM, SVR, and RBF models. This study also revealed that model performance decreased substantially when N-NH₄⁺ and P-PO₄³⁻ were removed, whereas the exclusion of COD or BOD5 caused marginal declines in predictive capacity. These findings highlight that parsimonious ML models can minimize the parameters required for WQI prediction but still maintain satisfactory simulations and effectively capture potential relationships between input parameters and derive WQI. Generally, this study provides an analytical framework for simulating WQI based on parsimonious and accurate ML algorithms, which are conducive to water quality assessment and monitoring in developing nations.

Article Details

How to Cite
Nguyen Thi Diem, T., Nguyen Thi Huynh, M., & Tran Quang, T. (2025). The Use of Machine Learning Algorithms for Water Quality Index Prediction in the Sai Gon River, Vietnam. Applied Environmental Research, 47(2). https://doi.org/10.35762/AER.2025018
Section
Original Article
Author Biographies

Thuy Nguyen Thi Diem, University of Science, Ho Chi Minh City, Vietnam

https://orcid.org/0000-0002-1504-6699

Mai Nguyen Thi Huynh, University of Science, Ho Chi Minh City, Vietnam

https://orcid.org/0000-0002-5797-5832

Tra Tran Quang, Vietnam National University, Ho Chi Minh City, Vietnam

https://orcid.org/0000-0002-7797-4503