Unsegmented Dialogue Act Annotation and Decoding With N-Gram Transducers

被引:5
|
作者
Martinez-Hinarejos, Carlos-D. [1 ]
Benedi, Jose-Miguel [1 ]
Tamarit, Vicent [1 ]
机构
[1] Univ Politecn Valencia, Pattern Recognit & Human Language Technol Ctr, Valencia 46022, Spain
关键词
Dialogue annotation; n-gram transducer; spoken dialogue systems; SEGMENTATION; FRAMEWORK; AGENDA; STATE;
D O I
10.1109/TASLP.2014.2377595
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most studies on dialogue corpora, as well as most dialogue systems, employ dialogue acts as the basic units for interpreting discourse structure, user input and system actions. The definition of the discourse structure and the dialogue strategy consequently require the tagging of dialogue corpora in terms of dialogue acts. The tagging problem presents two basic variants: a batch variant (annotation of whole dialogues, in order to define dialogue strategy or study discourse structure) and an online variant (decoding of the dialogue act sequence of a given turn, in order to interpret user intentions). In the two variants is unusual having the segmentation of each turn into the dialogue meaningful units (segments) to which a dialogue act is assigned. In this paper we present the use of the N-Gram Transducer technique for tagging dialogues, without needing to provide a prior segmentation, in these two different variants (dialogue annotation and turn decoding). Experiments were performed in two corpora of different nature and results show that N-Gram Transducer models are suitable for these tasks and provide good performance.
引用
收藏
页码:198 / 211
页数:14
相关论文
共 50 条
  • [11] Pipilika N-gram Viewer: An Efficient Large Scale N-gram Model for Bengali
    Ahmad, Adnan
    Talha, Mahbubur Rub
    Amin, Md. Ruhul
    Chowdhury, Farida
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [12] N-gram MalGAN:Evading machine learning detection via feature n-gram
    Enmin Zhu
    Jianjie Zhang
    Jijie Yan
    Kongyang Chen
    Chongzhi Gao
    Digital Communications and Networks, 2022, 8 (04) : 485 - 491
  • [13] N-gram模型综述
    尹陈
    吴敏
    计算机系统应用, 2018, 27 (10) : 33 - 38
  • [14] N-gram over Context
    Kawamae, Noriaki
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 1045 - 1055
  • [15] N-gram similarity and distance
    Kondrak, Grzegorz
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2005, 3772 : 115 - 126
  • [16] Increasing robustness of handwriting recognition using character n-gram decoding on large lexica
    Schall, Martin
    Schambach, Marc-Peter
    Franz, Matthias O.
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 156 - 161
  • [17] BIGRAM VS N-GRAM
    HALPIN, P
    BYTE, 1988, 13 (08): : 26 - 26
  • [18] Recasting the discriminative n-gram model as a pseudo-conventional n-gram model for LVCSR
    Zhou, Zhengyu
    Meng, Helen
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4933 - 4936
  • [19] Dialogue Act Annotation for Statistically Managed Spoken Dialogue Systems
    Ohtake, Kiyonori
    Misu, Teruhisa
    Hori, Chiori
    Kashioka, Hideki
    Nakamura, Satoshi
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON UNIVERSAL COMMUNICATION, 2008, : 416 - 422
  • [20] N-gram approach for gender prediction
    Reddy, T. Raghunadha
    Vardhan, B. Vishnu
    Reddy, P. Vijayapal
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 860 - 865