An Experiment of Thai Speech Recognition in a Noisy Environment using PocketSphinx : A Case of Inventory Counting
Main Article Content
Abstract
Recently, Speech recognition technology play an increasingly important role in everyday life. Speech recognition applications have become very popular and increased convenience of using Smartphone such as voice commands, turn on/off hardware and access the different apps and features on Smartphone. Speech recognition technology can be very helpful for users who are blind or visually impaired because operating Smartphone with difficulties seeing is complicated task. In addition, general Smartphone users can also benefit from this technology such as the rapidity and ease of operation.
The purpose of developing this project is to apply technology of speech recognition to items counting system, Speech recognition not only makes it easier and quicker to use Smartphone, it also allows hands-free use in situations where our hands are busy such as counting stock items in retail. Speech recognition can improve the working speed by counting item and memorizing quantities without any paper note or typing on devices.
Recently, Speech recognition technology play an increasingly important role in everyday life. Speech recognition applications have become very popular and increased convenience of using Smartphone such as voice commands, turn on/off hardware and access the different apps and features on Smartphone. Speech recognition technology can be very helpful for users who are blind or visually impaired because operating Smartphone with difficulties seeing is complicated task. In addition, general Smartphone users can also benefit from this technology such as the rapidity and ease of operation.
The purpose of developing this project is to apply technology of speech recognition to items counting system, Speech recognition not only makes it easier and quicker to use Smartphone, it also allows hands-free use in situations where our hands are busy such as counting stock items in retail. Speech recognition can improve the working speed by counting item and memorizing quantities without any paper note or typing on devices.
Goal of this project is to develop items counting system by speech that can run on smart phones with the Android operating system, which has increasingly number of users. Furthermore, this system can operate under noisy environment such as office and convenience store. The simulation results show that improvement of speech recognition model can reduce error rate.
Goal of this project is to develop items counting system by speech that can run on smart phones with the Android operating system, which has increasingly number of users. Furthermore, this system can operate under noisy environment such as office and convenience store. The simulation results show that improvement of speech recognition model can reduce error rate.
Article Details
Article Accepting Policy
The editorial board of Thai-Nichi Institute of Technology is pleased to receive articles from lecturers and experts in the fields of business administration, languages, engineering and technology written in Thai or English. The academic work submitted for publication must not be published in any other publication before and must not be under consideration of other journal submissions. Therefore, those interested in participating in the dissemination of work and knowledge can submit their article to the editorial board for further submission to the screening committee to consider publishing in the journal. The articles that can be published include solely research articles. Interested persons can prepare their articles by reviewing recommendations for article authors.
Copyright infringement is solely the responsibility of the author(s) of the article. Articles that have been published must be screened and reviewed for quality from qualified experts approved by the editorial board.
The text that appears within each article published in this research journal is a personal opinion of each author, nothing related to Thai-Nichi Institute of Technology, and other faculty members in the institution in any way. Responsibilities and accuracy for the content of each article are owned by each author. If there is any mistake, each author will be responsible for his/her own article(s).
The editorial board reserves the right not to bring any content, views or comments of articles in the Journal of Thai-Nichi Institute of Technology to publish before receiving permission from the authorized author(s) in writing. The published work is the copyright of the Journal of Thai-Nichi Institute of Technology.
References
D. Huggins-Daines, M. Kumar, A. Chan, A.W. Black, M. Ravishankar & A.I. Rudnicky, “Pocketsphinx : A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices,” in 2006 IEEE International Conference on Acoustics Speech and Signal Processing, Toulouse, France, 2006.
W. Walker, P. Lamere, P. Kwok, B. Raj, R. Singh, E. Gouvea, P. Wolf and J. Woelfel, “Sphinx-4: A Flexible Open Source Framework for Speech Recognition,” in AMLI TR-2004-139, Nov. 2004.
P. Cotsomrong, T. Sunpetchniyom, S. Kasuriya, N. Thatphithakkul & C. Wutiwiwatchai, “LOTUS : Large vOcabulary Thai continuous Speech Recognition Corpus,” in NAC2005, Nonthaburi, Thailand, 2005
บุญเสริม กิจศิริกุล, ณัฐกร ทับทอง, “การพัมนาระบบรู้จำเสียงพูดภาษาไทย,” โครงการเชื่อมโยงการวิจัยภาควิชาวิศวกรรมคอมพิวเตอร์ คณะวิศวกรรมศาสตร์ จุฬาลงกรณ์มหาวิทยาลัย, 2546.
Freeman D.K., Cosier G., Southcott C.B., Boyd., “The Voice Activity Detector for the PAN-European Digital Cellular Mobile Telephone Service,” International Conference on Acoustics, 1989.
Jon P. Nedel, “Duration normalization for robust recognition of spontaneous speech via missing featire methods,” Ph.D. Thesis, Carnegie Mellon University, 2004.
J. Baker, “Stochastic Modeling as a Means of Automatic Speech Recognition,” Ph.D. Thesis, Carnegie Mellon University, 1975.
มนตรี โพธิโสโนทัย, เฉลิมภัณฑ์ ฟองสมุทร, “วิธีการรู้จำเสียงพูดภาษาไทยแบบทนทานต่อเสียงรบกวนภายนอกม” วารสารเทคโนโลยีสารสนเทศ, ฉบับที่ 13, มกราคม-มิถุนายน 2554.