TY - GEN
T1 - Parametric speech synthesis using local and global sparse Gaussian processes
AU - Koriyama, Tomoki
AU - Nose, Takashi
AU - Kobayashi, Takao
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/11/14
Y1 - 2014/11/14
N2 - This paper describes an application of Gaussian process regression (GPR) to parametric speech synthesis. GPR enables us to predict synthetic speech parameters by utilizing exemplars of training speech data directly without converting the acoustic features of training data into too small number of model parameters thanks to nonparametric Bayesian regression. However, GPR inherently requires high computational cost and resources. In this paper, to alleviate this problem, we incorporate local and global sparse Gaussian process approximation into the statistical speech synthesis framework, and investigate trade-off between computational cost and speech synthesis performance through experiments. Moreover, we examine the way of choosing pseudo data set used for the sparse GP approximation.
AB - This paper describes an application of Gaussian process regression (GPR) to parametric speech synthesis. GPR enables us to predict synthetic speech parameters by utilizing exemplars of training speech data directly without converting the acoustic features of training data into too small number of model parameters thanks to nonparametric Bayesian regression. However, GPR inherently requires high computational cost and resources. In this paper, to alleviate this problem, we incorporate local and global sparse Gaussian process approximation into the statistical speech synthesis framework, and investigate trade-off between computational cost and speech synthesis performance through experiments. Moreover, we examine the way of choosing pseudo data set used for the sparse GP approximation.
KW - Gaussian process regression
KW - parametric speech synthesis
KW - partially independent conditional (PIC) approximation
UR - http://www.scopus.com/inward/record.url?scp=84912544573&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84912544573&partnerID=8YFLogxK
U2 - 10.1109/MLSP.2014.6958921
DO - 10.1109/MLSP.2014.6958921
M3 - Conference contribution
AN - SCOPUS:84912544573
T3 - IEEE International Workshop on Machine Learning for Signal Processing, MLSP
BT - IEEE International Workshop on Machine Learning for Signal Processing, MLSP
A2 - Adali, Tulay
A2 - Larsen, Jan
A2 - Mboup, Mamadou
A2 - Moreau, Eric
PB - IEEE Computer Society
T2 - 2014 24th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2014
Y2 - 21 September 2014 through 24 September 2014
ER -