Abstract
This paper describes a technique for controlling voice quality of synthetic speech using multiple regression hidden semi-Markov model (HSMM). In the technique, we assume that the mean vectors of output and state duration distribution of HSMM are modeled by multiple regression with a parameter vector called voice quality control vector. We first choose three features for controlling voice qualities, that is, "smooth voice - nonsmooth voice," "warm - cold," "high-pitched - low-pitched," and then we attempt to control voice quality of synthetic speech for these features. From the results of several subjective tests, we show that the proposed technique can change these features of voice quality intuitively.
Original language | English |
---|---|
Title of host publication | INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP |
Publisher | International Speech Communication Association |
Pages | 2438-2441 |
Number of pages | 4 |
Volume | 5 |
ISBN (Print) | 9781604234497 |
Publication status | Published - 2006 Jan 1 |
Externally published | Yes |
Event | INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP - Pittsburgh, PA, United States Duration: 2006 Sept 17 → 2006 Sept 21 |
Other
Other | INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP |
---|---|
Country/Territory | United States |
City | Pittsburgh, PA |
Period | 06/9/17 → 06/9/21 |
Keywords
- HMM-based speech synthesis
- HSMM
- Multiple regression HMM
- Voice quality control
ASJC Scopus subject areas
- Computer Science(all)