Optimizing Crop Yield Predictions through Satellite Data Fusion and Machine Learning

Main Article Content

Shilpa Naresh Vatkar
Sujata Kulkarni

Abstract

Accurate crop yield estimation is crucial for sustainable agriculture and food security, especially in Maharashtra, where climate variability significantly impacts crop growth. This study utilizes satellite data from MODIS, Landsat, Sentinel-1, and Sentinel-2 to predict the yields of 22 crops across 36 districts. Machine learning models, including Random Forest, Gradient Boosting, and SVM, were evaluated using RMSE, MAE, and R2 metrics. Random Forest outperformed the others, achieving R2 values above 0.70 for all crops, with a peak R2 of 0.93. Incorporating seasonal and permuted feature data further enhanced predictions, demonstrating the efficacy of integrating satellite data and machine learning for agriculture. Keywords: Machine learning, MODIS, Landsat-8, Sentinel-2, Sentinel-1, crop yield, features, vegetation indices.

Article Details

How to Cite
[1]
S. N. Vatkar and S. Kulkarni, “Optimizing Crop Yield Predictions through Satellite Data Fusion and Machine Learning”, ECTI-CIT Transactions, vol. 19, no. 4, pp. 557–568, Sep. 2025.
Section
Research Article

References

J. Liu et al., “Crop Yield Estimation Using Time-Series MODIS Data and the Effects of Cropland Masks in Ontario, Canada,” Remote Sensing, vol. 11, no. 20, p. 2419, Jan. 2019.

N. Kim, K.-J. Ha, N.-W. Park, J. Cho, S. Hong, and Y.-W. Lee, “A Comparison Between Major Artificial Intelligence Models for Crop Yield Prediction: Case Study of the Midwestern United States, 2006–2015,” ISPRS International Journal of Geo-Information, vol. 8, no. 5, p. 240, May 2019.

S. Khaki and L. Wang, “Crop Yield Prediction Using Deep Neural Networks,” Frontiers in Plant Science, vol. 10, 2019.

A. K. Maurya, M. Nadeem, D. Singh, K. P. Singh and N. S. Rajput, “Critical Analysis of Machine Learning Approaches for Vegetation Fractional Cover Estimation Using Drone and Sentinel-2 Data,” 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, pp. 343-346, 2021.

F. Gao, J. Masek, M. Schwaller and F. Hall, “On the blending of the Landsat and MODIS surface reflectance: predicting daily Landsat surface reflectance,” in IEEE Transactions on Geoscience and Remote Sensing, vol. 44, no. 8, pp. 22072218, Aug. 2006.

E. Vermote, “MOD09Q1 MODIS/Terra Surface Reflectance 8-Day L3 Global 250m SIN Grid V006,” NASA EOSDIS Land Processes DAAC, 2015.

T. Dong et al., “Estimating winter wheat biomass by assimilating leaf area index derived from fusion of Landsat-8 and modis data,” International Journal of Applied Earth Observation and Geoinformation, vol. 49, pp. 63–74, 2016.

M. Mao, H. Zhao, G. Tang and J. Ren, “In Season Crop Type Detection by Combing Sentinel1A and Sentinel-2 Imagery Based on the CNN Model,” Agronomy, vol. 13, no. 7, p. 1723, 2023.

F. Kogan et al., “Winter wheat yield forecasting in Ukraine based on Earth observation, meteorological data and biophysical models,” International Journal of Applied Earth Observation and Geoinformation, vol. 23, pp. 192–203, Aug. 2013.

R. Balaghi, B. Tychon, H. Eerens and M. Jlibene, “Empirical regression models using NDVI, rainfall and temperature data for the early prediction of wheat grain yields in Morocco,” International Journal of Applied Earth Observation and Geoinformation, vol. 10, no. 4, pp. 438–452, 2008.

M. Huihui, M. Jihua, F. Ji, Q. Zhang and H. Fang, “Comparison of Machine Learning Regression Algorithms for Cotton Leaf Area Index Retrieval Using Sentinel-2 Spectral Bands,” Applied Sciences, vol. 9, no. 7, p. 1459, 2019.

S. Kelkar and A. Kulkarni, “Impact of climate variability and change on crop production in Maharashtra, India,” Current Science, vol. 118, no. 8, 25 April 2020.

E. Ersin et al., “Crop Prediction Model Using Machine Learning Algorithms,” Applied Sciences, vol. 13, no. 16, p. 9288, 2023.

A. Abdollahi, B. Pradhan and A. Alamri, “Regional-Scale Analysis of Vegetation Dynamics Using Satellite Data and Machine Learning Algorithms: A Multi-Factorial Approach,” International Journal on Smart Sensing and Intelligent Systems, vol. 16, no. 1, 2023.

C. Achahboun, M. Chikhaoui, M. Naimi and M. Bellafkih, “Crops Classification Using Machine Learning And Google Earth Engine,” 2023 14th International Conference on Intelligent Systems: Theories and Applications (SITA), Casablanca, Morocco, pp. 1-8, 2023.

N. Gorelick, M. Hancher, M. Dixon, S. Ilyushchenko, D. Thau and R. Moore, “Google Earth Engine: Planetary-scale geospatial analysis for everyone,” Remote Sensing of Environment, vol. 202, pp. 18-27, 2017.

A. Verma, A. Kumar and K. Lal, “Kharif crop characterization using combination of SAR and MSI Optical Sentinel Satellite datasets,” Journal of Earth System Science, vol. 128, no. 230, 2019.

F. Novelli, H. Spiegel, T. Sand´en and F. Vuolo, “Assimilation of Sentinel-2 Leaf Area Index Data into a Physically-Based Crop Growth Model for Yield Estimation,” Agronomy, vol. 9, no. 5, p. 255, May 2019.

P. Tummala, M. Sobhana and S. Kakumani, “Predicting crop yield with NDVI and Backscatter values using Deep Neural Networks,” 2022 International Mobile and Embedded Technology Conference (MECON), Noida, India, pp. 390394, 2022.

S. V. S. Prasad, T. S. Savithri and I. V. M. Krishna, “Performance Evaluation Of SVM Kernels On Multispectral Liss III Data For Object Classification,” International Journal On Smart Sensing And Intelligent Systems, vol. 10, no. 4, 2017.

A. Bannari, D. Morin, F. Bonn and A. R. Huete, “A review of vegetation indices,” Remote Sensing Reviews, vol 13, no.1-2, pp. 95-120, 1995.

S. P. N and H. P. M. Kumar, “Soil Quality Identifying and Monitoring Approach for Sugarcane Using Machine Learning Techniques,” 2022 Fourth International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT), Mandya, India, pp. 15, 2022.

F. Xiao, H. Wang, Y. Xu and R. Zhang, “Fruit Detection and Recognition Based on Deep Learning for Automatic Harvesting: AnOverview and Review,” Agronomy, vol. 13, no. 6, p. 1625, 2023.

A. L’Heureux, K. Grolinger, H. F. Elyamany and M. A. M. Capretz, “Machine Learning With Big Data: Challenges and Approaches,” in IEEE Access, vol. 5, pp. 7776-7797, 2017.

S. D. R´ıo, V. L´opez, J. M. Ben´ıtez and F. Herrera, “On the use of Map Reduce for imbalanced Big Data using Random Forest,” Information Sciences, vol. 285, pp. 112–137, 2014.

X. Li and R. Bai, “Freight Vehicle Travel Time Prediction Using Gradient Boosting Regression Tree,” 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA), Anaheim, CA, USA, pp. 1010-1015, 2016.

L. Lavazza, S. Morasca and G. Rotoloni and , “On the Reliability of the Area Under the ROC Curve in Empirical Software Engineering,” in Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering, pp. 93-100, 2023.

F. Pedregosa et al., “Scikit-learn: Machine learning in Python,” The Journal of machine Learning research, vol. 12, pp. 2825–2830, 2011.