A speaker adaptation technique for mrhsmm-based style control of synthetic speech

Takashi Nose, Yoichi Kato, Takao Kobayashi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

17 Citations (Scopus)

Abstract

This paper describes a speaker adaptation technique for style control based on multiple regression hidden semi-Markov model (MRHSMM). In the MRHSMM-based style control technique, when available training data is very small. the resultant model would produce unnatural sounding speech. To overcome this problem, we propose a model adaptation technique for MRHSMM, which is similar to the MLLR adaptation technique used in speech recognition and speech synthesis. We formulate the model adaptation problem for MRHSMM based on a linear transformation framework and derive re-estimation formulas for transformation matrices in ML sense. We also describe the results of subjective evaluation tests.

Original languageEnglish
Title of host publication2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
PagesIV833-IV836
DOIs
Publication statusPublished - 2007
Event2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07 - Honolulu, HI, United States
Duration: 2007 Apr 152007 Apr 20

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume4
ISSN (Print)1520-6149

Conference

Conference2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
Country/TerritoryUnited States
CityHonolulu, HI
Period07/4/1507/4/20

Keywords

  • Expressive speech synthesis
  • Hidden Markov model
  • MLLR
  • Speaker adaptation
  • Style control

Fingerprint

Dive into the research topics of 'A speaker adaptation technique for mrhsmm-based style control of synthetic speech'. Together they form a unique fingerprint.

Cite this