Analysis of Foreign Tourist Review by Natural Language Processing and a Lexicon-Based Sentiment Analysis Tool

Main Article Content

Chakkarin Santirattanaphakdi
Suphakit Niwattanakul

Abstract

The current behavior of tourists is to search for online information to support travel decisions. Although online reviews have many advantages, analyzing large amounts of data is time-consuming and resource-intensive in extracting important information; therefore, systematic text analysis helps present more comprehensive and targeted information. This research aims to analyze reviews using dictionary-based sentiment analysis tools and to create an analytical report of reviews from foreign tourists toward Khao Yai National Park. A total of 12,035 reviews related to Khao Yai National Park were collected from online platforms. TextBlob, Flair, and VADER were used to classify opinions as positive, negative, or neutral. The results showed that VADER had the highest average accuracy at 76%. However, sentiment analysis also found reviews in which satisfaction scores conflicted with the content, showing the limitation of using sentiment analysis alone to reflect opinions, as opinions are complex and difficult to compare using a single standard. To address this issue, an analytical report was created by analyzing the relationships between key terms and related reviews using cosine similarity and summarizing the text using natural language processing with data visualization to reflect the strengths and limitations of the park. The analysis found that although Khao Yai National Park is praised for its natural beauty, resource richness, and biodiversity, some limitations are mentioned in the reviews, including different fees for Thai and foreign tourists, inconvenient and insufficient public transportation, and the need for valuable tourism experiences and activities suitable for all tourist groups. The results of this research provide basic information for tourist decision-making and are useful for developing guidelines to improve Khao Yai National Park to better meet tourists’ needs, and can also be applied as a model for other tourist destinations to enhance service quality and the competitiveness of Thailand’s tourism industry in the long term.

Article Details

Section
Research Article

References

Z. Zhu, Y. Zhao, and J. Wang, “The impact of destination online review content characteristics on travel intention: experiments based on psychological distance perspectives,” Aslib J. Inf. Manage., vol. 76, no. 1, pp. 42–64, 2024.

A. Sihombing, L.-W. Liu, and P. Pahrudin, “The impact of online marketing on tourists' visit intention: Mediating roles of trust,” J. Tourism, Heritage Serv. Marketing, vol. 10, no. 2, pp. 15–23, 2024.

L. Lei and D. Liu, Conducting Sentiment Analysis. Cambridge, U.K.: Cambridge University Press, 2021.

R. Rachmiani, N. Kintan Oktadinna, and T. Rachmat Fauzan, “The impact of online reviews and ratings on consumer purchasing decisions on e-commerce platforms,” Int. J. Manage. Sci. Inf. Technol., vol. 4, no. 2, pp. 504–515, 2024.

National Statistical Office of Thailand, “Number of tourists in national parks, fiscal year 2022,” 2022. [Online]. Available: https://catalog.dnp.go.th/dataset/91d66c6a-5b88-41ee-8597-11be8aec5aa6/resource/525f31b3-d572-41fa-a822-04f5f3c12ab7/download/tourism65.xlsx

National Statistical Office of Thailand, “Number of tourists in national parks, fiscal year 2023,” 2023. [Online]. Available: https://catalog.dnp.go.th/dataset/91d66c6a-5b88-41ee-8597-11be8aec5aa6/resource/70ae333d-19fc-4cdb-a60c-dc43bbf74ad6/download/tourism66.xlsx

D. Jurafsky and J. H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, 2nd ed. Upper Saddle River, NJ, USA: Prentice Hall, 2008.

T. Snake, Natural Language Processing Practical Using Transformers with Python. 2022.

L. T. Dela Cruz et al., “An enhancement of support vector machine in the context of sentiment analysis applied in scraped data from Tripadvisor hotel reviews,” Int. J. Comput. Sci. Res., vol. 8, pp. 3235–3251, 2024.

S. Jardim and C. Mora, “Customer reviews sentiment-based analysis and clustering for market-oriented tourism services and products development or positioning,” Procedia Comput. Sci., vol. 196, pp. 199–206, 2022.

K. Puh and M. Bagić Babac, “Predicting sentiment and rating of tourist reviews using machine learning,” J. Hosp. Tour. Insights, vol. 6, no. 3, pp. 1188–1204, 2023.

I. Perikos and A. Diamantopoulos, “Explainable aspect-based sentiment analysis using transformer models,” Big Data Cogn. Comput., vol. 8, no. 11, 2024, Art. no. 141.

K. W. Trisna, J. Huang, H. Lei, and E. M. Dharma, “From context-independent embedding to transformer: Exploring sentiment classification in online reviews with deep learning approaches,” J. Theor. Appl. Inf. Technol., vol. 102, no. 19, pp. 6980–7003, 2024.

P. Suanpang, P. Jamjuntr, and P. Kaewyong, “Sentiment analysis with a TextBlob package Implications for tourism,” J. Manage. Inf. Decis. Sci., vol. 24, no. S6, pp. 1–9, 2021.

I. Khumtaveeporn and K. Wattanasuwan, “AI sentiment analysis for destination branding: A case study of Buriram, Thailand,” J. Bus. Adm., vol. 46, no. 180, pp. 50–74, 2023.

K. Barik and S. Misra, “Analysis of customer reviews with an improved VADER lexicon classifier,” J. Big Data, vol. 11, 2024, Art. no. 10.

D. Buhalis, “Marketing the competitive destination of the future,” Tourism Manage., vol. 21, no. 1, pp. 97–116, 2000.

R. Mitchell, Web Scraping with Python, 3rd ed. Sebastopol, CA, USA: O'Reilly Media, 2024.

M. Arief and N. A. Samsudin, “Hybrid approach with VADER and multinomial logistic regression for multiclass sentiment analysis in online customer review,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 12, pp. 311–320, 2023.

D. Tay, Data Analytics for Discourse Analysis with Python: The Case of Therapy Talk. New York, NY, USA: Routledge, 2024.

B. Liu, Sentiment Analysis and Opinion Mining. Cham, Switzerland: Springer, 2022.