Deep Dialog Act Recognition using Multiple Token, Segment, and Context Information Representations

被引:0
|
作者
Ribeiro, Eugenio [1 ]
Ribeiro, Ricardo [2 ]
de Matos, David Martins [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID Lisboa, L2F,Spoken Language Syst Lab, Lisbon, Portugal
[2] IUL, ISCTE, INESC ID Lisboa, L2F,Spoken Language Syst Lab, Lisbon, Portugal
关键词
CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic dialog act recognition is a task that has been widely explored over the years. In recent works, most approaches to the task explored different deep neural network architectures to combine the representations of the words in a segment and generate a segment representation that provides cues for intention. In this study, we explore means to generate more informative segment representations, not only by exploring different network architectures, but also by considering different token representations, not only at the word level, but also at the character and functional levels. At the word level, in addition to the commonly used uncontextualized embeddings, we explore the use of contextualized representations, which are able to provide information concerning word sense and segment structure. Character-level tokenization is important to capture intention-related morphological aspects that cannot be captured at the word level. Finally, the functional level provides an abstraction from words, which shifts the focus to the structure of the segment. Additionally, we explore approaches to enrich the segment representation with context information from the history of the dialog, both in terms of the classifications of the surrounding segments and the turn-taking history. This kind of information has already been proved important for the disambiguation of dialog acts in previous studies. Nevertheless, we are able to capture additional information by considering a summary of the dialog history and a wider turn-taking context. By combining the best approaches at each step, we achieve performance results that surpass the previous state-of-the-art on generic dialog act recognition on both the Switchboard Dialog Act Corpus (SwDA) and the ICSI Meeting Recorder Dialog Act Corpus (MRDA), which are two of the most widely explored corpora for the task. Furthermore, by considering both past and future context, similarly to what happens in an annotation scenario, our approach achieves a performance similar to that of a human annotator on SwDA and surpasses it on MRDA.
引用
收藏
页码:861 / 899
页数:39
相关论文
共 50 条
  • [1] Deep dialog act recognition using multiple token, segment, and context information representations
    Ribeiro, Eugénio
    Ribeiro, Ricardo
    De Matos, David Martins
    Journal of Artificial Intelligence Research, 2019, 66 : 861 - 899
  • [2] Joint dialog act segmentation and recognition in human conversations using attention to dialog context
    Zhao, Tianyu
    Kawahara, Tatsuya
    COMPUTER SPEECH AND LANGUAGE, 2019, 57 : 108 - 127
  • [3] A HIERARCHICAL MODEL FOR DIALOG ACT RECOGNITION CONSIDERING ACOUSTIC AND LEXICAL CONTEXT INFORMATION
    Si, Yuke
    Wang, Longbiao
    Dang, Jianwu
    Wu, Mengfei
    Li, Aijun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7994 - 7998
  • [4] Dialog-Act Recognition Using Discourse and Sentence Structure Information
    Zhou, Keyan
    Zong, Chengqing
    2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 11 - 16
  • [5] Named entity recognition with multiple segment representations
    Cho, Han-Cheol
    Okazaki, Naoaki
    Miwa, Makoto
    Tsujii, Jun'ichi
    INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (04) : 954 - 965
  • [6] Local and Global Context Modeling with Relation Matching Task for Dialog Act Recognition
    Si, Yuke
    Zhang, Yan
    Li, Yuhang
    Wang, Xiaobao
    Wang, Longbiao
    Dang, Jianwu
    Chng, Eng Siong
    Li, Haizhou
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [7] COMPARING THE CONTRIBUTIONS OF CONTEXT AND PROSODY IN TEXT-INDEPENDENT DIALOG ACT RECOGNITION
    Laskowski, Kornel
    Shriberg, Elizabeth
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5374 - 5377
  • [8] A Study on Dialog Act Recognition Using Character-Level Tokenization
    Ribeiro, Eugenio
    Ribeiro, Ricardo
    de Matos, David Martins
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2018, 2018, 11089 : 93 - 103
  • [9] DIALOG ACT CLASSIFICATION USING ACOUSTIC AND DISCOURSE INFORMATION OF MAPTASK DATA
    Julia, Fatema
    Iftekharuddin, Khan
    Islam, Atiq
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2010, 9 (04) : 289 - 311
  • [10] Encoding Individual Acoustic Features using Dyad Augmented Deep Variational Representations for Dialog-level Emotion Recognition
    Li, Jeng-Lin
    Lee, Chi-Chun
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3102 - 3106