TY - GEN
T1 - An analysis of the effect of emotional speech synthesis on non-task-oriented dialogue system
AU - Chiba, Yuya
AU - Nose, Takashi
AU - Yamanaka, Mai
AU - Kase, Taketo
AU - Ito, Akinori
N1 - Funding Information:
Part of this work was supported by JSPS KAK-ENHI Grant Numbers JP15H02720, JP16K13253, and JP17H00823.
Publisher Copyright:
© 2018 Association for Computational Linguistics
PY - 2018
Y1 - 2018
N2 - This paper explores the effect of emotional speech synthesis on a spoken dialogue system when the dialogue is non-task-oriented. Although the use of emotional speech responses has been shown to be effective in a limited domain, e.g., scenario-based and counseling dialogue, the effect is still not clear in the non-task-oriented dialogue such as voice chat. For this purpose, we constructed a simple dialogue system with example- and rule-based dialogue management. In the system, two types of emotion labeling with emotion estimation are adopted, i.e., system-driven and user-cooperative emotion labeling. We conducted a dialogue experiment where subjects evaluate the subjective quality of the system and the dialogue from multiple aspects such as richness of the dialogue and impression of the agent. We then analyze and discuss the results and show the advantage of using appropriate emotions for expressive speech responses in the non-task-oriented system.
AB - This paper explores the effect of emotional speech synthesis on a spoken dialogue system when the dialogue is non-task-oriented. Although the use of emotional speech responses has been shown to be effective in a limited domain, e.g., scenario-based and counseling dialogue, the effect is still not clear in the non-task-oriented dialogue such as voice chat. For this purpose, we constructed a simple dialogue system with example- and rule-based dialogue management. In the system, two types of emotion labeling with emotion estimation are adopted, i.e., system-driven and user-cooperative emotion labeling. We conducted a dialogue experiment where subjects evaluate the subjective quality of the system and the dialogue from multiple aspects such as richness of the dialogue and impression of the agent. We then analyze and discuss the results and show the advantage of using appropriate emotions for expressive speech responses in the non-task-oriented system.
UR - http://www.scopus.com/inward/record.url?scp=85057129672&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057129672&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85057129672
T3 - SIGDIAL 2018 - 19th Annual Meeting of the Special Interest Group on Discourse and Dialogue - Proceedings of the Conference
SP - 371
EP - 375
BT - SIGDIAL 2018 - 19th Annual Meeting of the Special Interest Group on Discourse and Dialogue - Proceedings of the Conference
PB - Association for Computational Linguistics (ACL)
T2 - 19th Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2018
Y2 - 12 July 2018 through 14 July 2018
ER -