Lithological Classification from Well Logs using Machine Learning Algorithms
Main Article Content
Abstract
Well logging is a widely used geophysical method to gather information from subsurface rocks and establish lithological classification. However, the criteria of lithological classification are loosely defined and human error can significantly contribute to the uncertainty of the interpretation. This study uses machine learning approaches to classify rock types from well logs of the Snake River Plain (SRP) in Idaho. To achieve the comprehensive results, three machine learning algorithms, K-nearest neighbour (K-nn), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGB) are employed on fifteen types of well logs from four geothermal wells in the SRP under three experimental conditions. In the first experiment, the classifiers are trained and tested with data from the same well in the train:test ratio of 7:3. The second scenario assigns data from three wells as a training subset and the remaining well as test subset. The third experiment uses the largest amount of data as a training subset, which combines data from three wells and 70% of the data from the remaining well. Hyperparameters in all classifiers are optimized to enhance model performance. Results suggest that SVM and K-nn exhibit comparable performance in all experiments, resulting in 89.68% (s = 10.40) and 88.84% (s = 9.92) of average accuracy, respectively. XGB shows the highest prediction accuracy in this study with average prediction accuracy at 90.67% (s = 8.21). This is largely because XGB partitions data into subgroups based on available features iteratively until every class is clearly separated from each other. In addition, XGB can recognize missing values in well logs and does not use these values for classification. XGB further indicates that gamma ray, neutron, and temperature are the top three important features that are used to improve the prediction accuracy.
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Copyright © 2008 Department of Geology, Faculty of Science, Chulalongkorn University. Parts of an article can be photocopied or reproduced without prior written permission from the author(s), but due acknowledgments should be stated or cited accordingly.
References
Aly, M.H., Rodgers, D.W., Thackray, G.D., and Hughes, S.S. 2009. Recent magmatotectonic activity in the Eastern Snake River Plain–Island Park region revealed by SAR interferometry. Journal of Volcanology and Geothermal Research 188: 297-304.
Bedrosian, P.A., and Feucht, D.W. 2014. Structure and tectonics of the northwestern United States from EarthScope USArray magnetotelluric data. Earth and Planetary Science Letters 402: 275-289.
Bestagini, P., Lipari, V., and Tubaro, S. 2017. A machine learning approach to facies classification using well logs.
Bishop, C.M. 2006. Pattern Recognition and Machine Learning. Singapore: Springer Science Business Media, LLC.
Carmona, P., Climent, F., and Momparler, A. 2018. Predicting failure in the U.S. banking sector: An extreme gradient boosting approach. International Review of Economics & Finance.
Chen, T., and Guestrin, C. 2016. XGBoost. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '16, 785-794.
De Boissieu, F., Sevin, B., Cudahy, T., Mangeas, M., Chevrel, S., Ong, C., Rodger, A., Maurizot, P., Laukamp, C., Lau, I., Touraivane, T., Cluzel, D., and Despinoy, M. 2018. Regolith-geology mapping with support vector machine: A case study over weathered Ni-bearing peridotites, New Caledonia. International Journal of Applied Earth Observation and Geoinformation 64: 377-385.
DeNosaquo, K.R., Smith, R.B., and Lowry, A.R. 2009. Density and lithospheric strength models of the Yellowstone– Snake River Plain volcanic system from gravity and heat flow data. Journal of Volcanology and Geothermal Research 188: 108-127.
Devak, M., Dhanya, C.T., and Gosain, A.K. 2015. Dynamic coupling of support vector machine and K-nearest neighbour for downscaling daily rainfall. Journal of Hydrology 525: 286301.
Dubois, M.K., Bohling, G.C., and Chakrabarti, S. 2007. Comparison of four approaches to a rock facies classification problem. Computers & Geosciences 33: 599-617.
Fan, J., Wang, X., Wu, L., Zhou, H., Zhang, F., Yu, X., Lu, X., and Xiang, Y. 2018. Comparison of Support Vector Machine and Extreme Gradient Boosting for predicting daily global solar radiation using temperature and precipitation in humid subtropical climates: A case study in China. Energy Conversion and Management 164: 102-111.
Friedl, M.A., and Brodley, C.E. 1997. Decision tree classification of land cover from remotely sensed data. Remote Sensing of Environment 61: 399-409. Glowacz, A., and Glowacz, Z. 2016. Recognition of images of finger skin with application of histogram, image filtration and K-NN classifier. Biocybernetics and Biomedical Engineering 36: 95-101.
Humphreys, E.D. 1995. Post-Laramide removal of the Farallon slab, western United States. Geology 23.
Konaté, A.A., Pan, H., Khalid, M.A., Li, G., Yang, J.H., Deng, C., and Fang, S. 2015. Machine Learning Interpretation of Conventional Well Logs in Crystalline Rocks. pp. 360-370. Cham. Springer International Publishing.
Lonsdale, P. 2005. Creation of the Cocos and Nazca plates by fission of the Farallon plate. Tectonophysics 404: 237-264.
Lopes, R., and Jorge, A. 2017. Mind the Gap: A Well Log Data Analysis. The Workshop on Data Mining for Oil and Gas, Houston, Texas, USA. 1-6.
Rastegarzadeh, G., and Nemati, M. 2018. Primary mass discrimination of high energy cosmic rays using PNN and kNN methods. Advances in Space Research 61: 1181-1191.
Shervais, J.W., Schmitt, D.R., Nielson, D., Evans, J.P., Christiansen, E.H., Morgan, L., Pat Shanks, W.C., Prokopenko, A.A., Lachmar, T., Liberty, L.M., Blackwell, D.D., Glen, J.M., Champion, L.D., Potter, K.E., and Kessler, J.A. 2013. First Results from HOTSPOT: The Snake River Plain Scientific Drilling Project, Idaho, U.S.A. Sci. Dril. 15: 36-45.
Smirnoff, A., Boisvert, E., and Paradis, S.J. 2008. Support vector machine for 3D modelling from sparse geological information of various origins. Computers & Geosciences 34: 127-143.
Soltani, S., Kordestani, M., and Karim Aghaee, P. 2016. New estimation methodologies for well logging problems via a combination of fuzzy Kalman filter and different smoothers. Journal of Petroleum Science and Engineering 145: 704-710.
Steinbach, M., and Tan, P.-N. 2009. kNN: kNearest Neighbors. In Wu, X. and Kumar, V. (eds.), The Top Ten
Algorithms in Data Mining, pp. 151161. New York: Taylor & Francis Group, LLC.
Tester, J.W., Anderson, B.J., Batchelor, A.S., Blackwell, D.D., DiPippo, R., Drake, E.M., Garnish, J., Livesay, B., Moore,
M.C., Nichols, K., Petty, S., Toksöz, M.N., Ralph W. Veatch, J., Baria, R., Augustine, C., Murphy, E., Negraru, P., and Richards, M. 2006. Geothermal ResourceBase Assessment. In The future of geothermal energy: impact of Enhanced Geothermal Systems (EGS) on the United States in the 21st Century, pp. 35-81. Geothermal Program, MS 3830 Renewable Energy and Power Department Idaho National Laboratory: U.S. Department of Energy.
Tsangaratos, P., and Ilia, I. 2016. Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size. Catena 145: 164-179.
Wu, H., and Zhao, J. 2018. Deep convolutional neural network model based chemical process fault diagnosis. Computers & Chemical Engineering 115: 185-197.
Xie, Y., Zhu, C., Zhou, W., Li, Z., Liu, X., and Tu, M. 2018. Evaluation of machine learning methods for formation lithology identification: A comparison of tuning processes and model performances. Journal of Petroleum Science and Engineering 160: 182-193.
Zhu, L., Zhang, C., Wei, Y., and Zhang, C.-m. 2016. Permeability Prediction of the Tight Sandstone Reservoirs Using Hybrid Intelligent Algorithm and Nuclear Magnetic Resonance Logging Data. Arabian Journal for Science and Engineering 42.
Zhu, L., Zhang, C., Zhang, C., Wei, Y., Zhou, X., Cheng, Y., Huang, Y., and Zhang, L. 2018. Prediction of Total Organic Carbon Content in shale reservoir Based on a New Integrated Hybrid Neural Network and Conventional Well Logging Curves. Journal of Geophysics and Engineering 15: 1050-1061.