TY - GEN
T1 - A speaker adaptation technique for mrhsmm-based style control of synthetic speech
AU - Nose, Takashi
AU - Kato, Yoichi
AU - Kobayashi, Takao
PY - 2007
Y1 - 2007
N2 - This paper describes a speaker adaptation technique for style control based on multiple regression hidden semi-Markov model (MRHSMM). In the MRHSMM-based style control technique, when available training data is very small. the resultant model would produce unnatural sounding speech. To overcome this problem, we propose a model adaptation technique for MRHSMM, which is similar to the MLLR adaptation technique used in speech recognition and speech synthesis. We formulate the model adaptation problem for MRHSMM based on a linear transformation framework and derive re-estimation formulas for transformation matrices in ML sense. We also describe the results of subjective evaluation tests.
AB - This paper describes a speaker adaptation technique for style control based on multiple regression hidden semi-Markov model (MRHSMM). In the MRHSMM-based style control technique, when available training data is very small. the resultant model would produce unnatural sounding speech. To overcome this problem, we propose a model adaptation technique for MRHSMM, which is similar to the MLLR adaptation technique used in speech recognition and speech synthesis. We formulate the model adaptation problem for MRHSMM based on a linear transformation framework and derive re-estimation formulas for transformation matrices in ML sense. We also describe the results of subjective evaluation tests.
KW - Expressive speech synthesis
KW - Hidden Markov model
KW - MLLR
KW - Speaker adaptation
KW - Style control
UR - http://www.scopus.com/inward/record.url?scp=34547550083&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547550083&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2007.367042
DO - 10.1109/ICASSP.2007.367042
M3 - Conference contribution
AN - SCOPUS:34547550083
SN - 1424407281
SN - 9781424407286
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - IV833-IV836
BT - 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
T2 - 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
Y2 - 15 April 2007 through 20 April 2007
ER -