Quality analysis of online geocoding services for Thai text addresses

Main Article Content

Dueanpen Manoruang
Duangduen Asavasuthirakul

Abstract

A number of online geocoding services are now available enabling fast access to map-based geolocation. However, the quality of these services is uncertain, often being based on poor data, especially in developing countries such as Thailand. This paper reports on a comparative analysis of the quality of five such online geocoding services, with tests based on text addresses and points of interest (POIs) in Thailand. The geocoding service providers included in our tests were Google, MapQuest, Bing, Yahoo!, and OpenCage and the text inputs were in Thai. The quality of the geocoded results was measured using the match rates and the positional accuracy. Two experiments were conducted, each with a different input format: (i) text addresses collected from research participants (N = 1,511), and (ii) names of POIs sampled from a dataset of Thai academic institutes  (N = 5,000). The quality of the services tested was compared statistically using the Friedman test and the Wilcoxon signed rank test. The results show that Google outperformed all other services for both text addresses and POIs. Google, Bing, Yahoo!, and OpenCage each had match rates over 90%, while MapQuest’s match rate was 82%, but the positional accuracy of most services did not reach a high standard at rooftop levels. From this analysis, we identify geocoding issues that need to be addressed for further enhancement of the quality of the geocoding of addresses in Thailand. The knowledge obtained here also provides valuable insight into the geocoding issues facing Thailand and other developing countries, and it is hoped that this will benefit further research and the future development of high-quality geocoding tools.

Article Details

How to Cite
Manoruang, D., & Asavasuthirakul, D. (2019). Quality analysis of online geocoding services for Thai text addresses. Engineering and Applied Science Research, 46(2), 86–97. Retrieved from https://ph01.tci-thaijo.org/index.php/easr/article/view/140887
Section
ORIGINAL RESEARCH

References

Goldberg DW, Wilson JP, Knoblock CA. From text to geographic coordinates: the current state of geocoding. URISA J. 2007;19:33-46.

Roongpiboonsopit D, Karimi HA. Comparative evaluation and analysis of online geocoding services. Int J Geogr Inform Sci. 2010;24:1081-100.

Karen KK. Encyclopedia of geographic information science. Thousand Oaks, California: SAGE Publications Inc.; 2008.

Department of Industry, Innovation and Science. PSMA Geocoded National Address File (G-NAF) [Internet]. Australia: PSMA Australia; 2018 [cited 2018 Jun 8]. Available from: https://www.psma.com. au/sites/default/files/g-naf_product_description_1.pdf.

Universal Postal Union. Addressing the world - An address for everyone. 1st ed. [Internet]. Switzerland: International Bureau Universal Postal Union; 2012 [cited 2012 Jun 5]. Available from: http://www.upu. int/fileadmin/documentsFiles/activities/addressingAssistance/whitePaperAddressingTheWorldEn.pdf.

Seok S, Lee J. Development of geocoding and reverse geocoding method implemented for street-based addresses in Korea. J Korean Soc Surv Geodesy Photogramm Cartogr. 2016;34:33-42.

Babu TR, Chatterjee A, Khandeparker S, Subhash AV, Gupta S. Geographical address classification without using geolocation coordinates. Paris: ACM Press; 2015.

Chatterjee A, Anjaria J, Roy S, Ganguli A, Seal K. SAGEL: smart address geocoding engine for supply-chain logistics. Burlingame, California: ACM Press; 2016.

Davis Jr. CA, Fonseca F. Assessing the certainty of locations produced by an address geocoding system. GeoInformatica. 2007;11:103-29.

Informatica LLC 1993. Address verification best practices for Japan addresses [Internet]. 2017 [cited 2018 May 3]. Available from: https://kb.informatica. com/h2l/HowTo%20Library/1/0893-AddressVerificationBestPracticesforJapan Addresses-H2L.pdf.

Chang CH, Huang CY, Su YS. On Chinese postal address and associated information extraction [Internet]. 2012 [cited 2018 May 4]. Available from: https://kaigi.org/jsai/webprogram/2012/pdf/726.pdf.

Pan Y, Chen B, Lu Z, Li S, Zhang J, & Zhou Y. An address geocoding method for improving rural spatial information infrastructure. Proceedings of the Sixth International Symposium on Digital Earth: models, algorithms, and virtual reality; 2009 Sep 9-12; Beijing, China. USA: SPIE; 2010. p. 7840-06.

Tian Q, Ren F, Hu T, Liu J, Li R, Du Q. Using an optimized Chinese address matching method to develop a geocoding service: a case study of Shenzhen, China. Int J Geo-Inf. 2016;5(5):1-17.

Li L, Wang W, He B, Zhang Y. A hybrid method for Chinese address segmentation. Int J Geogr Inform Sci. 2018;32:30-48.

Eckman S, English N. Creating housing unit frames from address databases: geocoding precision and net coverage rates. Field Meth. 2012;24:399-408.

Geographical Names Board (NSW). NSW addressing user manual / geographical names board of New South Wales [Internet]. 2016 [cited 2018 Dec 7]. Available from: http://www.gnb.nsw.gov.au/__data/assets/pdf_ file/0007/199411/NSW_AUM_July2018_Final.pdf.

National Land Information Division. Block level location reference information maintenance method [Internet]. 2007 [cited 2018 May 3]. Available from: http://nlftp.mlit.go.jp/isj/method.html.

Ministry of the Interior and Safety. Introduction Road Name Address [Internet]. 2011 [cited 2018 Jun 6]. Available from: http://www.juso.go.kr/CommonPage Link.do?link=/eng/about/GuideBook.

Sun Z, Qiu AG., Zhao J, Zhang F, Zhao Y, Wang L. Technology of fuzzy Chinese-geocoding method. 2013 International Conference on Information Science and Cloud Computing (ISCC); 2013 Dec 7-8; Guangzhou, China. USA: IEEE; 2013. p. 7-12.

Thailand Post Co Ltd. The content of postal code Thailand [Internet]. 2015 [cited 2015 Oct 7]. Available from: postalcare@thailandpost.com.

Karimi HA, Durcik M, Rasdorf W. Evaluation of uncertainties associated with geocoding techniques. Comput Aided Civ Infrastruct Eng. 2004;19:170-85.

Zandbergen PA. A comparison of address point, parcel and street geocoding techniques. Comput Environ Urban Syst. 2008;32:214-32.

Bonner MR, Han D, Nie J, Rogerson P, Vena JE, Freudenheim JL. Positional accuracy of geocoded addresses in epidemiologic research. Epidemiology. 2003;14:408-12.

Krieger N, Waterman P, Lemieux K, Zierler S, Hogan JW. On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health research. Am J Public Health. 2001;91:1114-6.

Yang D-H, Bilaver LM, Hayes O, Goerge R. Improving geocoding practices: evaluation of geocoding tools. J Med Syst. 2004;28:361-70.

Whitsel EA, Quibrera PM, Smith RL, Catellier DJ, Liao D, Henley AC, et al. Accuracy of commercial geocoding: assessment and implications. Epidem Perspect Innov. 2006;3:1-12.

Lovasi GS, Weiss JC, Hoskins R, Whitsel EA, Rice K, Erickson CE, et al. Comparing a single-stage geocoding method to a multi-stage geocoding method: how much and where do they disagree?. International Journal of Health Geographics. 2007;6:1-11.

Cetl V, Kliment T, Jogun T. A comparison of address geocoding techniques – case study of the city of Zagreb, Croatia. Survey Review. 2018;50:97-106.

Roongpiboonsopit D, Karimi HA. Quality assessment of online street and rooftop geocoding services. Cartography and Geographic Information Science. 2010;37:301-18.

Pietro GD, Rinnone F. Online geocoding services: a benchmarking analysis to some European cities. 2017 Baltic Geodetic Congress (BGC Geomatics); 2017 Jun 22-25; Gdansk, Poland. USA: IEEE; 2017. p. 273-81.

DGA Open Government License [Internet]. Thailand: School and Academic Institute Dataset 2014; 2014. [updated 2016 Jan 4; cited 2017 Jun 9]. Available from: https://data.go.th/DatasetDetail.aspxd=8548e3ab-00bf-4eae-b29a-156a4aa52c0d.

Zandbergen PA. Geocoding quality and implications for spatial analysis. Geography Compass. 2009;3:647-680.

M. Beyer KM, Schultz AF, Rushton G. Using zip codes as geocodes in cancer research. In: Rushton G, Armstrong MP, Gittler J, Greene BR, Pavlik CE, West MM, Zimmerman DL, editors. The Use of Geographic Codes in Cancer Prevention and Control, Research and Practice. USA: CRC Press Taylor & Francis Group; 2008. p. 37-67.

Wey CL, Griesse J, Kightlinger L, Wimberly MC. Geographic variability in geocoding success for west Nile virus cases in South Dakota. Health Place. 2009; 15:1108-14.

Rosu A, Chen D. An improved approach for geocoding Canadian postal code-based data in health-related studies. The Canadian Geographer / Le Géographe Canadien. 2016;60(2):270-81.