Time Series-Based Predictive Modeling of PM2.5 Levels in Chiang Mai, Thailand

Authors

  • Tewa Promnuchanont Department of Computer Information System, Faculty of Business Administration and Libera Arts,Rajamangala University of Technology Lanna, Chiang Mai, Thailand
  • Theeraphop Saengsri Department of Computer Information System, Faculty of Business Administration and Libera Arts,Rajamangala University of Technology Lanna, Chiang Mai, Thailand
  • Rujipan Kosarat Department of Software Engineering, Faculty of Engineering, Rajamangala University of TechnologyLanna, Chiang Mai, Thailand
  • Piyaphol Yuenyongsathaworn Department of Software Engineering, Faculty of Engineering, Rajamangala University of TechnologyLanna, Chiang Mai, Thailand

DOI:

https://doi.org/10.14456/rmutlengj.2025.11

Keywords:

PM2.5, ARIMA, SARIMA, SARIMAX, Forecast Model

Abstract

The purpose of this study is to use a variety of models to create prediction models for Chiang Mai's PM2.5 levels. To improve the accuracy of our predictions, we take into account outside variables that might influence PM2.5 levels. Among the variables that we include in the data are PM2.5 concentrations, temperature, wind speed, precipitation, cloud cover, relative humidity, and other external factors. Before using the model, the researcher used basic statistical analysis, seasonal analysis, and stationary analysis to assess the data. The team of researchers carried out both data transformation and data cleansing. We tested the ARIMA, SARIMA, and SARIMAX forecasting models. First, we use ARIMA to forecast and assess results. The SARIMA model more accurately captured the seasonal connection in the data when we included a seasonal component. The model was able to forecast PM2.5 levels more precisely at times when seasonal patterns recurred thanks to this improvement. As the last step, we used the SARIMAX model to improve performance by adding exogenous variables. In the end, we assessed the accuracy and performance of each forecast using the MAE and RMSE numbers. The ARIMA model yielded MAE values of 7.34 and RMSE 7.95. The SARIMA model MAE values of 5.76 and RMSE 6.54. The SARIMAX model, when incorporating humidity, had the lowest MAE values of 4.36 and RMSE 5.25, representing improvements MAE of 40.6% and RMSE 34% compared to ARIMA.

References

Adiat KAN, Akeredolu BE, Akinlalu AA, Olayanju GM. Application of logistic regression analysis in prediction of groundwater vulnerability in gold mining environment: a case of Ilesa gold mining area, southwestern, Nigeria. Environ Monit Assess. 2020 Aug; 192(9).

Robles-Velasco A, Cortés P, Muñuzuri J, Baets BD. Prediction of pipe failures in water supply networks for longer time periods through multi-label classification. Expert Syst Appl. 2023 Mar; 213:119050.

Telikani A, Gandomi AH, Shahbahrami A. A survey of evolutionary computation for association rule mining. Inf Sci. 2020 Mar ; 524:318–52.

Hickman L, Thapa S, Tay L, Cao M, Srinivasan P. Text preprocessing for text mining in organizational research: review and recommendations. Organ Res Methods. 2020 Nov ; 25(1):114–46.

Cuenca E, Sallaberry A, Wang FY, Poncelet P. MultiStream: a multiresolution streamgraph approach to explore hierarchical time series. IEEE Trans Vis Comput Graph. 2018 Dec; 24(12):3160–73.

Felzenszwalb PF, Huttenlocher DP. Efficient graph-based image segmentation. Int J Comput Vis. 2004 Apr; 59(2):167–81.

Wang S, Cao J, Yu PS. Deep learning for spatio-temporal data mining: a survey. IEEE Trans Knowl Data Eng. 2020 Sep; 34(8):3681–700.

Kong S, Ji Y, Lü B, Zhao X, Han B, Bai Z. Similarities and differences in PM2.5, PM10 and TSP chemical profiles of fugitive dust sources in a coastal oilfield city in China. Aerosol Air Qual Res. 2014 Jan; 14(7):2017–28.

Chen Y, Shah N, Huggins FE, Huffman GP. Investigation of the microcharacteristics of PM2.5 in residual oil fly ash by analytical transmission electron microscopy. Environ Sci Technol. 2004 Nov; 38(24):6553–60.

Singh K, Singh S, Jha AK, Aggarwal SG, Bisht DS, Murty BP, et al. Mass-size distribution of PM10 and its characterization of ionic species in fine (PM2.5) and coarse (PM10−2.5) mode, New Delhi, India. Nat Hazards. 2013 Mar; 68(2):775–89.

Amnuaylojaroen T, Surapipith V, Macatangay RC. Projection of the near-future PM2.5 in Northern Peninsular Southeast Asia under RCP8.5. Atmosphere. 2022; 13(2):305.

Lakshmi P, Yarlagadda S, Akkineni H, Reddy AM. Big data analytics applying the fusion approach of multicriteria decision making with deep learning algorithms. Int J Eng Trends Technol. 2021 Jan; 69(1):24–8.

Yang C, Huang Q, Li Z, Liu K, Hu F. Big data and cloud computing: innovation opportunities and challenges. Int J Digit Earth. 2016 Nov; 10(1):13–53.

Wang S, Cao J, Yu PS. Deep Learning for Spatio-Temporal Data Mining: A Survey. IEEE Transactions on Knowledge and Data Engineering. 2020 Sep; 34(8):3681–700.

Díaz-Robles LA, Ortega JC, Fu JS, Reed GD, Chow JC, Watson JG, et al. A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: the case of Temuco, Chile. Atmos Environ. 2008 Jul; 42(35):8331–40.

Lee D, Lee D, Choi M, Lee J. Prediction of network throughput using ARIMA. In: 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC); 2020; Fukuoka, Japan. IEEE; 2020. p. 1–5.

Khashei M, Bijari M. A novel hybridization of artificial neural networks and ARIMA models for time series forecasting. Appl Soft Comput. 2010 Nov; 11(2):2664–75.

Shivhare N, Rahul AK, Dwivedi SB, Dikshit PKS. ARIMA based daily weather forecasting tool: a case study for Varanasi. Mausam. 2021 Nov;70(1):133–40.

Berkeleyearth. Org [Internet]. Independent Data for a Resilient Future, Available from: https://berkeleyearth.org/.

An additional source of environmental information [Internet]. the Hydrological Information Institute, Available from: https://www.hii.or.th/.

Yakubu UA, Saputra MPA. Time series model analysis using autocorrelation function (ACF) and partial autocorrelation function (PACF) for e-wallet transactions during a pandemic. Int J Glob Oper Res. 2022 Aug ;3(3):80–5.

Manigandan P, Alam MS, Alharthi M, Khan U, Alagirisamy K, Pachiyappan D, et al. Forecasting natural gas production and consumption in United States—evidence from SARIMA and SARIMAX models. Energies. 2021 Sep;14(19):6021.

Downloads

Published

2025-12-16

How to Cite

Promnuchanont, T. ., Saengsri, T. ., Kosarat, R., & Yuenyongsathaworn, P. . (2025). Time Series-Based Predictive Modeling of PM2.5 Levels in Chiang Mai, Thailand. RMUTL Engineering Journal, 10(2), 22–33. https://doi.org/10.14456/rmutlengj.2025.11

Issue

Section

Research Article