Studying Characteristics of Huay Saneng and Predicting The Water Levels by Comparison with Other Regression Models
Main Article Content
Abstract
The drought problem in Surin Province has become complex issue over the course of the last five years. Water level in reservoir is directly associated with drought. Therefore, understanding the root cause of the problem is of utmost importance. This research investigated which attributes were subject to drought in a reservoir called Huay Saneng and which machine learning regression models showed the best accuracy using data from the last 5 years; with 1,825 records derived from Royal Irrigation Department operation on Huay Saneng area. The research finding established that Decision tree regression showed satisfactory results with 94.7% reliability; better than Multiple linear regression, Polynomial regression, and Random forest regression. The use of four attributes consisting of date, evaporation, water levels and rain can assist to make prediction with satisfactory results. In addition, the research found that the levels of evaporation of water is an important factor in assisting prediction through the process of feature selection. The research discussed how conservation of water levels in the past five years tended to be less inclined and how future research may be improved in order to predict drought accurately in the future.
Article Details
References
B. C. Bates, Z. W. Kundzewicz, S. Wu, and J. P. Palutikof, “Climate Change and Water. Technical Paper of the Intergovernmental Panel on Climate Change, IPCC Secretariat: Geneva, Switzerland,” The American Midland Naturalist, Vol. 168, No. 1, 2008.
A. K. Mishra and V. P. Singh, “A review of drought concepts,” Journal of hydrology, Vol. 391, No. 1–2, pp. 202–216, 2010.
bangkokpost, Drought-struck Surin Hospital declares emergency,. Available Online at. https://www. bangkokpost.com/thailand/general/1727555/drought-struck-surin-hospital-declares-emergency, accessed on 10 Mar 2021.
B. Lantz, Machine learning with R. Packt publishing ltd, 2013.
D. A. Sachindra and S. Kanae, “Machine learning for downscaling: the use of parallel multiple populations in genetic programming,” Stochastic Environmental Research and Risk Assessment, Vol. 33, No. 8, pp. 1497–1533, 2019.
P. Sukka, Forecasting of Daily Inflow to Large Reservoir in Upper Ping River Basin Using Artificial Neural Network, Mas t er of Engineering (Civil Engineering), Chiang Mai University, Chiang Mai.(in Thai), 2005.
S. Amloy, Reservoir Inflow Forecasting by Decision Tree Model: A Case Study of Huay Nam Sai Reservoir Nakhon Si Thammarat Province, Master of Engineering (Water Resources Engineering), Kasetsart University, Bangkok.(in Thai), 2009.
H. D. P. Weerasinghe, H. L. Premaratne, and D. U. J. Sonnadara, “Performance of neural networks in forecasting daily precipitation using multiple sources,” Journal of the National Science Foundation of Sri Lanka, Vol. 38, No. 3, 2010.
N. Z. Che Ghani, Z. Abu Hasan, and L. Tze Liang, “Estimation of missing rainfall data using GEP: case study of raja river, Alor Setar, Kedah,” Advances in Artificial Intelligence, Vol. 2014, 2014.
L. Mediero, L. Garrote, and A. Chavez-Jimenez, “Improving probabilistic flood forecasting through a data assimilation scheme based on genetic programming,” Natural Hazards and Earth System Sciences, Vol. 12, No. 12, pp. 3719–3732, 2012.
G. Artigue, A. Johannet, V. Borrell, and S. Pistre, “Flash flood forecasting in poorly gauged basins using neural networks: case study of the Gardon de Mialet basin (southern France),” Natural Hazards and Earth System Sciences, Vol. 12, No. 11, pp. 3307–3324, 2012.
N. Watanabe, K. Fukami, H. Imamura, K. Sonoda, and S. Yamane, “Flood forecasting technology with radar-derived rainfall data using genetic programming,” 2009 International Joint Conference on Neural Networks, Atlanta, Georgia, USA, pp. 3311–3318, 2009.
R. Obiedat, M. Alkasassbeh, H. Faris, and O. Harfoushi, “Customer churn prediction using a hybrid genetic programming approach,” Scientific Research and Essays, Vol. 8, No. 27, pp. 1289–1295, 2013.
H. Ebrahimi and T. Rajaee, “Simulation of groundwater level variations using wavelet combined with neural network, linear regression and support vector machine,” Global and Planetary Change, Vol. 148, pp. 181–191, 2017.
U. S. Panu and T. C. Sharma, “Challenges in drought research: some perspectives and future directions,” Hydrological Sciences Journal, Vol. 47, No. S1, pp. S19–S30, 2002.
Y. Feng and S. Wang, “A forecast for bicycle rental demand based on random forests and multiple linear regression,” 2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS), Wuhan, China, pp. 101–105, 2017.
S. Barua, A. W. M. Ng, and B. J. C. Perera, “Artificial neural network–based drought forecasting using a nonlinear aggregated drought index,” Journal of Hydrologic Engineering, Vol. 17, No. 12, pp. 1408–1413, 2012.
S. Ghimire, R. C. Deo, N. J. Downs, and N. Raj, “Global solar radiation prediction by ANN integrated with European Centre for medium range weather forecast fields in solar rich cities of Queensland Australia,” Journal of cleaner production, Vol. 216, pp. 288–310, 2019.
T. Yang, X. Zhou, Z. Yu, V. Krysanova, and B. Wang, “Drought projection based on a hybrid drought index using Artificial Neural Networks,” Hydrological Processes, Vol. 29, No. 11, pp. 2635–2648, 2015.
M. Mokhtarzad, F. Eskandari, N. J. Vanjani, and A. Arabasadi, “Drought forecasting by ANN, ANFIS, and SVM and comparison of the models,” Environmental earth sciences, Vol. 76, No. 21, pp. 1–10, 2017.
P. C. Deka, “Support vector machine applications in the field of hydrology: a review,” Applied soft computing, Vol. 19, pp. 372–386, 2014.
X. Wang, T. Liu, X. Zheng, H. Peng, J. Xin, and B. Zhang, “Short-term prediction of groundwater level using improved random forest regression with a combination of random features,” Applied Water Science, Vol. 8, No. 5, pp. 1–12, 2018.
V. Rodriguez-Galiano, M. P. Mendes, M. J. Garcia-Soldado, M. Chica-Olmo, and L. Ribeiro, “Predictive modeling of groundwater nitrate pollution using Random Forest and multisource variables related to intrinsic and specific vulnerability: A case study in an agricultural setting (Southern Spain),” Science of the Total Environment, Vol. 476, pp. 189–206, 2014.
M. Hamza and D. Larocque, “An empirical comparison of ensemble methods based on classification trees,” Journal of Statistical Computation and Simulation, Vol. 75, No. 8, pp. 629–643, 2005.
H. Liao and W. Sun, “Forecasting and evaluating water quality of Chao Lake based on an improved decision tree method,” Procedia Environmental Sciences, Vol. 2, pp. 970–979, 2010.
W. Moudani, “Dynamic features selection for heart disease classification,” International Journal of Health and Medical Engineering, Vol. 7, No. 2, pp. 105–110, 2013.
D. PANAGOULIA, “Impacts of GISS-modelled climate changes on catchment hydrology,” Hydrological Sciences Journal, Vol. 37, No. 2, pp. 141–163, 1992.
D. P. Lettenmaier and T. Y. Gan, “Hydrologic sensitivities of the Sacramento‐San Joaquin River basin, California, to global warming,” Water Resources Research, Vol. 26, No. 1, pp. 69–86, 1990.
M. B. Richman and L. M. Leslie, “Machine Learning for Attribution of Heat and Drought in Southwestern Australia,” Procedia Computer Science, Vol. 168, pp. 3–10, 2020.
H. Balti, A. Ben Abbes, N. Mellouli, I. R. Farah, Y. Sang, and M. Lamolle, “A review of drought monitoring with big data: Issues, methods, challenges and research directions,” Ecological Informatics, Vol. 60, pp. 101136, 2020.
D. C. Montgomery, E. A. Peck, and G. G. Vining, Introduction to linear regression analysis: John Wiley & Sons, 2021.
D. E. Kandzari, D. L. Bhatt, S. Brar, C. M. Devireddy, M. Esler, M. Fahy, J. M. Flack, B. T. Katzen, J. Lea, and D. P. Lee, “Predictors of blood pressure response in the SYMPLICITY HTN-3 trial,” European heart journal, Vol. 36, No. 4, pp. 219–227, 2015.
D. B. Lobell, J. I. Ortiz‐Monasterio, G. P. Asner, R. L. Naylor, and W. P. Falcon, “Combining field surveys, remote sensing, and regression trees to understand yield variations in an irrigated wheat landscape,” Agronomy Journal, Vol. 97, No. 1, pp. 241–249, 2005.
J. A. Vitale, V. Povìa, N. D. Vitale, T. Bassani, G. Lombardi, L. Giacomelli, G. Banfi, and A. La Torre, “The effect of two different speed endurance training protocols on a multiple shuttle run performance in young elite male soccer players,” Research in Sports Medicine, Vol. 26, No. 4, pp. 436–449, 2018.
A. Gelman, J. Hill, and A. Vehtari, Regression and other stories. Cambridge University Press, 2020.
M. R. Balf, R. Noori, R. Berndtsson, A. Ghaemi, and B. Ghiasi, “Evolutionary polynomial regression approach to predict longitudinal dispersion coefficient in rivers,” Journal of Water Supply: Research and Technology-Aqua, Vol. 67, No. 5, pp. 447–457, 2018.
H. Huang, Z. Wang, F. Xia, X. Shang, Y. Liu, M. Zhang, R. A. Dahlgren, and K. Mei, “Water quality trend and change-point analyses using integration of locally weighted polynomial regression and segmented regression,” Environmental Science and Pollution Research, Vol. 24, No. 18, pp. 15827–15837, 2017.
M. Gentilucci, C. Bisci, P. Burt, M. Fazzini, and C. Vaccaro, “Interpolation of rainfall through polynomial regression in the Marche region (Central Italy),” The Annual International Conference on Geographic Information Science, Lund, Sweden, pp. 55–73, 2018.
X. Jing, “An integrated prediction model for water supply-demand ability,” 2016 4th International Conference on Advanced Materials and Information Technology Processing (AMITP 2016), Guilin, China, pp. 528–531, 2016.
J. R. Quinlan, C4. 5: programs for machine learning. Elsevier, 2014.
S. Ragettli, J. Zhou, H. Wang, C. Liu, and L. Guo, “Modeling flash floods in ungauged mountain catchments of China: A decision tree learning approach for parameter regionalization,” Journal of Hydrology, Vol. 555, pp. 330–346, 2017.
E. Sutoyo and A. Musnansyah, “A Hybrid of Seasonal Autoregressive Integrated Moving Average (SARIMA) and Decision Tree for Drought Forecasting,” Proceedings of the International Conference on Engineering and Information Technology for Sustainable Industry, Tangerang, Indonesia, pp. 1–6, 2020.
L. Breiman, “Bagging predictors,” Machine learning, Vol. 24, No. 2, pp. 123–140, 1996.
J. W. Osborne, Best practices in data cleaning: A complete guide to everything you need to do before and after collecting your data. Sage, 2013.
S. Raschka, Python machine learning: Packt publishing ltd, 2015.
A. Géron, Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O’Reilly Media, 2019.