TY - GEN
T1 - Multimodal Dialogue Response Timing Estimation Using Dialogue Context Encoder
AU - Yahagi, Ryota
AU - Chiba, Yuya
AU - Nose, Takashi
AU - Ito, Akinori
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2022
Y1 - 2022
N2 - Spoken dialogue systems need to determine when to respond to a user in addition to the response. Various cues, such as prosody, gaze, and facial expression are known to affect response timing. Recent studies have revealed that using the representation of a system response improves the performance of response timing prediction. However, it is difficult to directly use a future response with dialogue systems that require an entire user utterance to generate a response. This study proposes a neural-based response timing estimation model using past utterances to alleviate this problem. The proposed model is expected to consider the intention of the system response implicitly.
AB - Spoken dialogue systems need to determine when to respond to a user in addition to the response. Various cues, such as prosody, gaze, and facial expression are known to affect response timing. Recent studies have revealed that using the representation of a system response improves the performance of response timing prediction. However, it is difficult to directly use a future response with dialogue systems that require an entire user utterance to generate a response. This study proposes a neural-based response timing estimation model using past utterances to alleviate this problem. The proposed model is expected to consider the intention of the system response implicitly.
UR - http://www.scopus.com/inward/record.url?scp=85142755278&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85142755278&partnerID=8YFLogxK
U2 - 10.1007/978-981-19-5538-9_9
DO - 10.1007/978-981-19-5538-9_9
M3 - Conference contribution
AN - SCOPUS:85142755278
SN - 9789811955372
T3 - Lecture Notes in Electrical Engineering
SP - 133
EP - 141
BT - Conversational AI for Natural Human-Centric Interaction - 12th International Workshop on Spoken Dialogue System Technology, IWSDS 2021
A2 - Stoyanchev, Svetlana
A2 - Ultes, Stefan
A2 - Li, Haizhou
PB - Springer Science and Business Media Deutschland GmbH
T2 - 12th International Workshop on Spoken Dialogue System Technology, IWSDS 2021
Y2 - 15 November 2021 through 17 November 2021
ER -