TY - GEN
T1 - A speech parameter generation algorithm using local variance for HMM-based speech synthesis
AU - Chunwijitra, Vataya
AU - Nose, Takashi
AU - Kobayashi, Takao
PY - 2012
Y1 - 2012
N2 - This paper proposes a parameter generation algorithm using lo-cal variance (LV) constraint of spectral parameter trajectory for HMM-based speech synthesis. In the parameter generation pro-cess, we take account of both the HMM likelihood of speech feature vectors and a likelihood for LVs. To model LV precisely, we use dynamic features of LV with context-dependent HMMs. The objective experimental results show that the proposed tech-nique can generate a better spectral trajectory in terms of the spectral and LV distortions than a conventional technique with global variance (GV) constraint. The subjective experimental results also show that the proposed technique significantly im-prove the reproducibility of the synthetic speech than the con-ventional one.
AB - This paper proposes a parameter generation algorithm using lo-cal variance (LV) constraint of spectral parameter trajectory for HMM-based speech synthesis. In the parameter generation pro-cess, we take account of both the HMM likelihood of speech feature vectors and a likelihood for LVs. To model LV precisely, we use dynamic features of LV with context-dependent HMMs. The objective experimental results show that the proposed tech-nique can generate a better spectral trajectory in terms of the spectral and LV distortions than a conventional technique with global variance (GV) constraint. The subjective experimental results also show that the proposed technique significantly im-prove the reproducibility of the synthetic speech than the con-ventional one.
KW - HMM-based speech synthesis
KW - Local variance
KW - Over-smoothing problem
KW - Speech parameter generation
UR - http://www.scopus.com/inward/record.url?scp=84878412344&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84878412344&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84878412344
SN - 9781622767595
T3 - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
SP - 1150
EP - 1153
BT - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
T2 - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Y2 - 9 September 2012 through 13 September 2012
ER -