การรู้จำเสียงพูดภาษาไทยในสภาพแวดล้อมเสียงรบกวนโดยใช้ PocketSphinx : กรณีศึกษาการนับสินค้าคงคลัง

อุดม ได้พร้อม; วีรวุฒิ ทัฬหิกรรม; ดัชกรณ์ ตันเจริญ

PDF

Published: Jun 15, 2015

Keywords:

Speech Recognition Noisy Environment

อุดม ได้พร้อม

The Faculty of Engineering and Technology, Panyapiwat Institute of Management

วีรวุฒิ ทัฬหิกรรม

The Faculty of Engineering and Technology, Panyapiwat Institute of Management

ดัชกรณ์ ตันเจริญ

The Faculty of Engineering and Technology, Panyapiwat Institute of Management

Abstract

Recently, Speech recognition technology play an increasingly important role in everyday life. Speech recognition applications have become very popular and increased convenience of using Smartphone such as voice commands, turn on/off hardware and access the different apps and features on Smartphone. Speech recognition technology can be very helpful for users who are blind or visually impaired because operating Smartphone with difficulties seeing is complicated task. In addition, general Smartphone users can also benefit from this technology such as the rapidity and ease of operation.

The purpose of developing this project is to apply technology of speech recognition to items counting system, Speech recognition not only makes it easier and quicker to use Smartphone, it also allows hands-free use in situations where our hands are busy such as counting stock items in retail. Speech recognition can improve the working speed by counting item and memorizing quantities without any paper note or typing on devices.

Recently, Speech recognition technology play an increasingly important role in everyday life. Speech recognition applications have become very popular and increased convenience of using Smartphone such as voice commands, turn on/off hardware and access the different apps and features on Smartphone. Speech recognition technology can be very helpful for users who are blind or visually impaired because operating Smartphone with difficulties seeing is complicated task. In addition, general Smartphone users can also benefit from this technology such as the rapidity and ease of operation.

The purpose of developing this project is to apply technology of speech recognition to items counting system, Speech recognition not only makes it easier and quicker to use Smartphone, it also allows hands-free use in situations where our hands are busy such as counting stock items in retail. Speech recognition can improve the working speed by counting item and memorizing quantities without any paper note or typing on devices.

Goal of this project is to develop items counting system by speech that can run on smart phones with the Android operating system, which has increasingly number of users. Furthermore, this system can operate under noisy environment such as office and convenience store. The simulation results show that improvement of speech recognition model can reduce error rate.

Issue

Vol. 3 No. 1 (2015): January - June

Section

Research Article

Article Accepting Policy

The editorial board of Thai-Nichi Institute of Technology is pleased to receive articles from lecturers and experts in the fields of engineering and technology written in Thai or English. The academic work submitted for publication must not be published in any other publication before and must not be under consideration of other journal submissions. Therefore, those interested in participating in the dissemination of work and knowledge can submit their article to the editorial board for further submission to the screening committee to consider publishing in the journal. The articles that can be published include solely research articles. Interested persons can prepare their articles by reviewing recommendations for article authors.

Copyright infringement is solely the responsibility of the author(s) of the article. Articles that have been published must be screened and reviewed for quality from qualified experts approved by the editorial board.

The text that appears within each article published in this research journal is a personal opinion of each author, nothing related to Thai-Nichi Institute of Technology, and other faculty members in the institution in any way. Responsibilities and accuracy for the content of each article are owned by each author. If there is any mistake, each author will be responsible for his/her own article(s).

The editorial board reserves the right not to bring any content, views or comments of articles in the Journal of Thai-Nichi Institute of Technology to publish before receiving permission from the authorized author(s) in writing. The published work is the copyright of the Journal of Thai-Nichi Institute of Technology.

References

D. Huggins-Daines, M. Kumar, A. Chan, A.W. Black, M. Ravishankar & A.I. Rudnicky, “Pocketsphinx : A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices,” in 2006 IEEE International Conference on Acoustics Speech and Signal Processing, Toulouse, France, 2006.

W. Walker, P. Lamere, P. Kwok, B. Raj, R. Singh, E. Gouvea, P. Wolf and J. Woelfel, “Sphinx-4: A Flexible Open Source Framework for Speech Recognition,” in AMLI TR-2004-139, Nov. 2004.

P. Cotsomrong, T. Sunpetchniyom, S. Kasuriya, N. Thatphithakkul & C. Wutiwiwatchai, “LOTUS : Large vOcabulary Thai continuous Speech Recognition Corpus,” in NAC2005, Nonthaburi, Thailand, 2005

บุญเสริม กิจศิริกุล, ณัฐกร ทับทอง, “การพัมนาระบบรู้จำเสียงพูดภาษาไทย,” โครงการเชื่อมโยงการวิจัยภาควิชาวิศวกรรมคอมพิวเตอร์ คณะวิศวกรรมศาสตร์ จุฬาลงกรณ์มหาวิทยาลัย, 2546.

Freeman D.K., Cosier G., Southcott C.B., Boyd., “The Voice Activity Detector for the PAN-European Digital Cellular Mobile Telephone Service,” International Conference on Acoustics, 1989.

Jon P. Nedel, “Duration normalization for robust recognition of spontaneous speech via missing featire methods,” Ph.D. Thesis, Carnegie Mellon University, 2004.

J. Baker, “Stochastic Modeling as a Means of Automatic Speech Recognition,” Ph.D. Thesis, Carnegie Mellon University, 1975.

มนตรี โพธิโสโนทัย, เฉลิมภัณฑ์ ฟองสมุทร, “วิธีการรู้จำเสียงพูดภาษาไทยแบบทนทานต่อเสียงรบกวนภายนอกม” วารสารเทคโนโลยีสารสนเทศ, ฉบับที่ 13, มกราคม-มิถุนายน 2554.

Article Sidebar

Main Article Content

Abstract

Article Details

References