TY - GEN
T1 - Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization
AU - Koriyama, Tomoki
AU - Nose, Takashi
AU - Kobayashi, Takao
PY - 2014
Y1 - 2014
N2 - This paper examines two issues of a statistical speech synthesis approach based Gaussian process (GP) regression. Although GP-based speech synthesis can give higher performance in generating spectral parameters than the HMM-based one, a number of issues still remain. In this paper, we incorporate global variance (GV) feature to overcome over-smoothing problem into the parameter generation. Furthermore, in order to utilize an appropriate kernel function in accordance with actual data, we propose an EM-based kernel hyperparameter optimization technique. Objective and subjective evaluation results show that using GV and hyperparameter estimation enhanced the performance in spectral feature generation.
AB - This paper examines two issues of a statistical speech synthesis approach based Gaussian process (GP) regression. Although GP-based speech synthesis can give higher performance in generating spectral parameters than the HMM-based one, a number of issues still remain. In this paper, we incorporate global variance (GV) feature to overcome over-smoothing problem into the parameter generation. Furthermore, in order to utilize an appropriate kernel function in accordance with actual data, we propose an EM-based kernel hyperparameter optimization technique. Objective and subjective evaluation results show that using GV and hyperparameter estimation enhanced the performance in spectral feature generation.
KW - Gaussian process
KW - global variance
KW - kernel hyperparameter
KW - statistical parametric speech synthesis
UR - http://www.scopus.com/inward/record.url?scp=84905252490&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84905252490&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2014.6854319
DO - 10.1109/ICASSP.2014.6854319
M3 - Conference contribution
AN - SCOPUS:84905252490
SN - 9781479928927
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 3834
EP - 3838
BT - 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014
Y2 - 4 May 2014 through 9 May 2014
ER -