Joint Learning of Context and Feedback Embeddings in Spoken Dialogue

被引:0
|
作者
Qian, Livia [1 ]
Skantze, Gabriel [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
来源
基金
瑞典研究理事会;
关键词
conversational systems; representation learning; unsupervised learning; backchannel; contrastive learning; feedback; dialogue; function;
D O I
10.21437/Interspeech.2024-1082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short feedback responses, such as backchannels, play an important role in spoken dialogue. So far, most of the modeling of feedback responses has focused on their timing, often neglecting how their lexical and prosodic form influence their contextual appropriateness and conversational function. In this paper, we investigate the possibility of embedding short dialogue contexts and feedback responses in the same representation space using a contrastive learning objective. In our evaluation, we primarily focus on how such embeddings can be used as a context-feedback appropriateness metric and thus for feedback response ranking in U.S. English dialogues. Our results show that the model outperforms humans given the same ranking task and that the learned embeddings carry information about the conversational function of feedback responses.
引用
收藏
页码:2955 / 2959
页数:5
相关论文
共 50 条
  • [1] Dialogue management in spoken dialogue system with visual feedback
    Ge, Wendong
    Xu, Bo
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8862 : 856 - 862
  • [2] Dialogue Management in Spoken Dialogue System with Visual Feedback
    Ge, Wendong
    Xu, Bo
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 856 - 862
  • [3] JOINT LEARNING OF WORD AND LABEL EMBEDDINGS FOR SEQUENCE LABELLING IN SPOKEN LANGUAGE UNDERSTANDING
    Wu, Jiewen
    D'Haro, Luis Fernando
    Chen, Nancy F.
    Krishnaswamy, Pavitra
    Banchs, Rafael E.
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 800 - 806
  • [4] Dialogue-Learning Correlations in Spoken Dialogue Tutoring
    Forbes-Riley, Kate
    Litman, Diane
    Huettner, Alison
    Ward, Arthur
    ARTIFICIAL INTELLIGENCE IN EDUCATION: SUPPORTING LEARNING THROUGH INTELLIGENT AND SOCIALLY INFORMED TECHNOLOGY, 2005, 125 : 225 - 232
  • [5] Conditional Joint Model for Spoken Dialogue System
    Li, Changliang
    Zhao, Yan
    Yu, Dong
    COGNITIVE COMPUTING - ICCC 2019, 2019, 11518 : 26 - 36
  • [6] Towards End-to-End Spoken Dialogue Systems with Turn Embeddings
    Bayer, Ali Orkan
    Stepanov, Evgeny A.
    Riccardi, Giuseppe
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2516 - 2520
  • [7] Reinforcement learning for spoken dialogue systems
    Singh, S
    Kearns, M
    Litman, D
    Walker, M
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 956 - 962
  • [8] Learning to ground in spoken dialogue systems
    Pietquin, Olivier
    2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3, 2007, : 165 - 168
  • [9] Spoken dialogue system for learning Braille
    Araki, Masahiro
    Shibahara, Kana
    Mizukami, Yuko
    2011 35TH IEEE ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2011, : 152 - 156
  • [10] Machine Learning for Spoken Dialogue Systems
    Lemon, Oliver
    Pietquin, Olivier
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1761 - +