TY - GEN
T1 - Intonation evaluation of English utterances using synthesized speech for computer-assisted language learning
AU - Konno, Tomoaki
AU - Ito, Akinori
AU - Ito, Masashi
AU - Makino, Shozo
AU - Suzuki, Motoyuki
PY - 2008
Y1 - 2008
N2 - In this paper, we describe a system for intonation evaluation of English utterance by Japanese native speakers using synthesized speech for rapid development of a CALL system. To evaluate the intonation of learners' utterance, we need reference utterances, for which English native speakers' utterances should be used. However, it is costly to gather native speakers' utterances for all sentences in the system. Therefore, we examined an intonation evaluation method using synthesized speech generated by text-to-speech systems instead of real speech. Intonation evaluation system calculates scores between a learner's utterance and corresponding utterances by the teachers. We investigated a method of combining multiple scores. In addition, we incorporated a feature for rhythm evaluation into intonation evaluation. As a result, we obtained improvement of correlation between scores by human evaluators and the system. Furthermore, we analyzed a tendency of intonation evaluation by the system through limiting evaluation utterances to find out what degrades the system performance.
AB - In this paper, we describe a system for intonation evaluation of English utterance by Japanese native speakers using synthesized speech for rapid development of a CALL system. To evaluate the intonation of learners' utterance, we need reference utterances, for which English native speakers' utterances should be used. However, it is costly to gather native speakers' utterances for all sentences in the system. Therefore, we examined an intonation evaluation method using synthesized speech generated by text-to-speech systems instead of real speech. Intonation evaluation system calculates scores between a learner's utterance and corresponding utterances by the teachers. We investigated a method of combining multiple scores. In addition, we incorporated a feature for rhythm evaluation into intonation evaluation. As a result, we obtained improvement of correlation between scores by human evaluators and the system. Furthermore, we analyzed a tendency of intonation evaluation by the system through limiting evaluation utterances to find out what degrades the system performance.
KW - CALL
KW - Intonation
KW - Mahalanobis distance
KW - Multiple regression
KW - Prosody
UR - http://www.scopus.com/inward/record.url?scp=67650382223&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67650382223&partnerID=8YFLogxK
U2 - 10.1109/NLPKE.2008.4906807
DO - 10.1109/NLPKE.2008.4906807
M3 - Conference contribution
AN - SCOPUS:67650382223
SN - 9781424427802
T3 - 2008 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2008
BT - 2008 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2008
T2 - 2008 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2008
Y2 - 19 October 2008 through 22 October 2008
ER -