Unsegmented Dialogue Act Annotation and Decoding With N-Gram Transducers

被引:5
|
作者
Martinez-Hinarejos, Carlos-D. [1 ]
Benedi, Jose-Miguel [1 ]
Tamarit, Vicent [1 ]
机构
[1] Univ Politecn Valencia, Pattern Recognit & Human Language Technol Ctr, Valencia 46022, Spain
关键词
Dialogue annotation; n-gram transducer; spoken dialogue systems; SEGMENTATION; FRAMEWORK; AGENDA; STATE;
D O I
10.1109/TASLP.2014.2377595
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most studies on dialogue corpora, as well as most dialogue systems, employ dialogue acts as the basic units for interpreting discourse structure, user input and system actions. The definition of the discourse structure and the dialogue strategy consequently require the tagging of dialogue corpora in terms of dialogue acts. The tagging problem presents two basic variants: a batch variant (annotation of whole dialogues, in order to define dialogue strategy or study discourse structure) and an online variant (decoding of the dialogue act sequence of a given turn, in order to interpret user intentions). In the two variants is unusual having the segmentation of each turn into the dialogue meaningful units (segments) to which a dialogue act is assigned. In this paper we present the use of the N-Gram Transducer technique for tagging dialogues, without needing to provide a prior segmentation, in these two different variants (dialogue annotation and turn decoding). Experiments were performed in two corpora of different nature and results show that N-Gram Transducer models are suitable for these tasks and provide good performance.
引用
收藏
页码:198 / 211
页数:14
相关论文
共 50 条
  • [1] Improving unsegmented dialogue turns annotation with N-gram transducers
    Instituto Tecnologico de Informatica, Universidad Politecnica de Valencia, Camino de Vera, s/n, 46022 Valencia, Spain
    PACLIC 23 - Proc. 23rd Pacific Asia Conf. Lang. Inf. Comput., 2009, (335-344):
  • [2] ON THE USE OF N-GRAM TRANSDUCERS FOR DIALOGUE ANNOTATION
    Tamarit, Vicent
    Martinez-Hinarejos, Carlos-D.
    Benedi, Jose-Miguel
    SPOKEN DIALOGUE SYSTEMS: TECHNOLOGY AND DESIGN, 2011, : 255 - 276
  • [3] Direct and Wordgraph-Based Confidence Measures in Dialogue Annotation with N-Gram Transducers
    Martinez-Hinarejos, Carlos-D.
    Tamarit, Vicent
    Benedi, Jose-Miguel
    HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2014, 8387 : 264 - 275
  • [4] N-gram language models for document image decoding
    Kopec, GE
    Said, MR
    Popat, K
    DOCUMENT RECOGNITION AND RETRIEVAL IX, 2002, 4670 : 191 - 202
  • [5] Inference of stochastic finite-state transducers using N-gram mixtures
    Alabau, Vicente
    Casacuberta, Francisco
    Vidal, Enrique
    Juan, Alfons
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2007, 4478 : 282 - +
  • [6] High Order N-gram Model Construction and Application Based on Natural Annotation
    Wang, Qibo
    Rao, Gaoqi
    Xun, Endong
    CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 : 321 - 328
  • [7] N-gram Insight
    Prans, George
    AMERICAN SCIENTIST, 2011, 99 (05) : 356 - 357
  • [8] Evaluation of HMM-based models for the annotation of unsegmented dialogue turns
    Martinez-Hinarejos, Carlos-D.
    Tamarit, Vicent
    Benedi, Jose-Miguel
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1608 - 1613
  • [9] N-gram MalGAN: Evading machine learning detection via feature n-gram
    Zhu, Enmin
    Zhang, Jianjie
    Yan, Jijie
    Chen, Kongyang
    Gao, Chongzhi
    DIGITAL COMMUNICATIONS AND NETWORKS, 2022, 8 (04) : 485 - 491
  • [10] Pseudo-Conventional N-Gram Representation of the Discriminative N-Gram Model for LVCSR
    Zhou, Zhengyu
    Meng, Helen
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (06) : 943 - 952