A Real Time Noise-Robust Speech Recognition System

Main Article Content

Naoya Wada
Shingo Yoshizawa
Yoshikazu Miyanaga

Abstract

This paper introduces the extraction of speech features realizing noise robustness for speech recognition. It also explores advanced speech analysis techniques named RSF (Running Spectrum Filtering)/DRA (Dynamic Range Adjustment) in detail. The new experiments on phase recognition were carried out using 40 male and female speakers for training and 5 other male and female speakers for recognition. The result of recognition rate is improved from 17% to 63% under car noise at -10dB SNR for example. It shows the high noise robustness of the proposed system. In addition, the new parallel/pipelined LSI design of the system is proposed. It considerably reduces the calculation time. Using this architecture, the real time speech recognition can be developed. For this system, both of full-custom LSI design and FPGA design are introduced.

Article Details

How to Cite
[1]
N. Wada, S. Yoshizawa, and Y. Miyanaga, “A Real Time Noise-Robust Speech Recognition System”, ECTI-CIT Transactions, vol. 1, no. 2, pp. 75–83, Mar. 2016.
Section
Artificial Intelligence and Machine Learning (AI)