AN FO CONTOUR CONTROL MODEL FOR TOTALLY SPEAKER DRIVEN TEXT TO SPEECH SYSTEM

Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine

Research output: Contribution to conferencePaperpeer-review

13 Citations (Scopus)

Abstract

Totally Speaker Driven Text to Speech System produces high quality and natural speech resembling the acoustic and prosodic characteristics of the original speech corpus. In the FO contour control of this system, an FO contour of a whole sentence is produced by concatenating segmental FO contours generated by modifying vectors that arc representatives of typical FO contours. The representative vectors arc selected from the FO contour codebook, which is designed so as to minimize the approximation error between FO contours generated by the proposed model and real FO contours extracted from a speech corpus. It was confirmed by experiments with Japanese speech corpus that FO contours can be modeled with small approximation errors by only 48 representative vectors, and the synthetic speech sounded very natural and resembled the prosodic characteristics of the original speaker.

Original languageEnglish
Publication statusPublished - 1998
Externally publishedYes
Event5th International Conference on Spoken Language Processing, ICSLP 1998 - Sydney, Australia
Duration: 1998 Nov 301998 Dec 4

Conference

Conference5th International Conference on Spoken Language Processing, ICSLP 1998
Country/TerritoryAustralia
CitySydney
Period98/11/3098/12/4

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'AN FO CONTOUR CONTROL MODEL FOR TOTALLY SPEAKER DRIVEN TEXT TO SPEECH SYSTEM'. Together they form a unique fingerprint.

Cite this