Thu-2-4-5 Semi-supervised learning for character expression of spoken dialogue systems

Kenta Yamamoto(Kyoto University), Koji Inoue(Kyoto University) and Tatsuya Kawahara(Kyoto University)

Abstract: We address character expression for spoken dialogue systems (e.g. extrovert). While conventional studies focused on controlling linguistic expressions, we focus on spoken dialogue behaviors. Specifically, the proposed model maps three character traits: extroversion, emotional instability, and politeness to four spoken dialogue behaviors: utterance amount, backchannel, filler, and switching pause length. It is costly to collect annotated data for training this kind of models. Therefore, we propose a semi-supervised learning approach to utilize not only a character impression data (labeled data) but also a corpus data (unlabeled data). Experimental results show that the proposed model expresses the target character traits through the behaviors more precisely than a baseline model that corresponds to the case of supervised learning only. Besides, we also investigate how to model unlabeled behavior (e.g. speech rate) by utilizing the advantage of semi-supervised learning.

Paper

prev Thu-2-4-4 Sound-Image Grounding Based Focusing Mechanism for Efficient Automatic Spoken Language Acquisition

next Thu-2-4-6 Dimensional Emotion Prediction based on Interactive Context in Conversation

About

About the Conference

Welcome from the Chair

Conference Committees

Calls