TY - JOUR
T1 - Selection of optimum vocabulary and dialog strategy for noise-robust spoken dialog systems
AU - Ito, Akinori
AU - Oba, Takanobu
AU - Konashi, Takashi
AU - Suzuki, Motoyuki
AU - Makino, Shozo
PY - 2008/3
Y1 - 2008/3
N2 - Speech recognition in a noisy environment is one of the hottest topics in the speech recognition research. Noise-tolerant acoustic models or noise reduction techniques are often used to improve recognition accuracy. In this paper, we propose a method to improve accuracy of spoken dialog system from a language model point of view. In the proposed method, the dialog system automatically changes its language model and dialog strategy according to the estimated recognition accuracy in a noisy environment in order to keep the performance of the system high. In a noise-free environment, the system accepts any utterance from a user. On the other hand, the system restricts its grammar and vocabulary in a noisy environment. To realize this strategy, we investigated a method to avoid the user's out-of-grammar utterances through an instruction given by the system to a user. Furthermore, we developed a method to estimate recognition accuracy from features extracted from noise signals. Finally, we realized a proposed dialog system according to these investigations.
AB - Speech recognition in a noisy environment is one of the hottest topics in the speech recognition research. Noise-tolerant acoustic models or noise reduction techniques are often used to improve recognition accuracy. In this paper, we propose a method to improve accuracy of spoken dialog system from a language model point of view. In the proposed method, the dialog system automatically changes its language model and dialog strategy according to the estimated recognition accuracy in a noisy environment in order to keep the performance of the system high. In a noise-free environment, the system accepts any utterance from a user. On the other hand, the system restricts its grammar and vocabulary in a noisy environment. To realize this strategy, we investigated a method to avoid the user's out-of-grammar utterances through an instruction given by the system to a user. Furthermore, we developed a method to estimate recognition accuracy from features extracted from noise signals. Finally, we realized a proposed dialog system according to these investigations.
KW - Dialog strategy
KW - Neural network
KW - Noisy environment
KW - Speech recognition
KW - Spoken dialog system
UR - http://www.scopus.com/inward/record.url?scp=68149157967&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=68149157967&partnerID=8YFLogxK
U2 - 10.1093/ietisy/e91-d.3.538
DO - 10.1093/ietisy/e91-d.3.538
M3 - Article
AN - SCOPUS:68149157967
SN - 0916-8532
VL - E91-D
SP - 538
EP - 548
JO - IEICE Transactions on Information and Systems
JF - IEICE Transactions on Information and Systems
IS - 3
ER -