TY - GEN
T1 - An F0 modeling technique based on prosodic events for spontaneous speech synthesis
AU - Koriyama, Tomoki
AU - Nose, Takashi
AU - Kobayashi, Takao
PY - 2012
Y1 - 2012
N2 - This paper proposes a technique for effective modeling of F0 contours using prosodic-event-based HMM units for HMM-based spontaneous speech synthesis. The modeling unit corresponds to one of prosodic event segments such as pitch falling by accent and pitch rising by boundary pitch movement (BPM). Since the prosodic events of one phrase are generally less frequent than the changes of phonemes, the proposed unit is expected to reduce the number of model parameters of F0, which leads to robust parameter estimation. The objective and subjective experiments using spontaneous conversational speech data show that the proposed technique can significantly reduce the number of model parameters while keeping the naturalness of the synthetic speech.
AB - This paper proposes a technique for effective modeling of F0 contours using prosodic-event-based HMM units for HMM-based spontaneous speech synthesis. The modeling unit corresponds to one of prosodic event segments such as pitch falling by accent and pitch rising by boundary pitch movement (BPM). Since the prosodic events of one phrase are generally less frequent than the changes of phonemes, the proposed unit is expected to reduce the number of model parameters of F0, which leads to robust parameter estimation. The objective and subjective experiments using spontaneous conversational speech data show that the proposed technique can significantly reduce the number of model parameters while keeping the naturalness of the synthetic speech.
KW - F0 modeling
KW - HMM-based speech synthesis
KW - Prosodic events
KW - Spontaneous speech
UR - http://www.scopus.com/inward/record.url?scp=84867599581&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84867599581&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2012.6288940
DO - 10.1109/ICASSP.2012.6288940
M3 - Conference contribution
AN - SCOPUS:84867599581
SN - 9781467300469
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 4589
EP - 4592
BT - 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Proceedings
T2 - 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
Y2 - 25 March 2012 through 30 March 2012
ER -