แนวคิดและความท้าทายในการพัฒนาการค้นคืนข้อมูลข้ามภาษาไทย-อังกฤษ

ไกรศักดิ์ เกษร

PDF

Published: Mar 30, 2013

Keywords:

Cross-language Translingual Bilingual Information retrieval Machine translation

ไกรศักดิ์ เกษร

ภาควิชาวิทยาการคอมพิวเตอร์และเทคโนโลยีสารสนเทศ คณะวิทยาศาสตร์ มหาวิทยาลัยนเรศวร อ.เมือง จ.พิษณุโลก 65000

Abstract

Documents on the Internet have been written using several languages. The benefit of this is those documents are useful for users to verify the information from different sources. However, users are not able to use a single language in a query to retrieve all relevant documents written in different languages. Moreover, some users do not know exactly what the keywords to be used in a query to retrieve desired documents. As a result, search engine cannot find the relevant documents effectively. In addition, a keyword can refer to many different concepts in the real world, so-called “Polysemy” or many keywords refer to one thing, socalled “Synonym”. They are two significant problems that decrease a search engine performance. Consequently, many researchers try to overcome the problem by developing the Cross-Language Information Retrieval (CLIR) system in order to retrieve documents written by different languages from using a single query. This idea is now a new trend of search engine and can be developed as a commercial product for a popular search engine e.g. Google or Bing. The article presents the concepts and ideas of CLIR and summary of the main challenges in this research area.

How to Cite

เกษร ไ. (2013). Cross Language (Thai-English) Information Retrieval: Concepts and Challenges. KKU Science Journal, 41(1), 121–133. retrieved from https://ph01.tci-thaijo.org/index.php/KKUSciJ/article/view/249084

Issue

Vol. 41 No. 1 (2013): January - March 2013

Section

Review Articles

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Article Sidebar

Main Article Content

Abstract

Article Details