Direct and Wordgraph-Based Confidence Measures in Dialogue Annotation with N-Gram Transducers

被引:0
|
作者
Martinez-Hinarejos, Carlos-D. [1 ]
Tamarit, Vicent [1 ]
Benedi, Jose-Miguel [1 ]
机构
[1] Univ Politecn Valencia, PRHLT Res Ctr, Valencia 46022, Spain
来源
HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS | 2014年 / 8387卷
关键词
Dialogue annotation; Confidence measures; N-gram transducers;
D O I
10.1007/978-3-319-08958-4_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dialogue annotation is a necessary step for the development of dialogue systems, specially for data-based dialogue strategies. Manual annotation is hard and time-consuming, and automatic techniques can be used to obtain a draft annotation and speed up the process. The presentation of the draft annotation with confidence levels on the correctness of every part of the hypothesis can make even faster the supervision process. In this paper we propose two methods to calculate confidence measures for an automatic dialogue annotation model, and test them for the annotation of a task-oriented human-computer corpus on railway information. The results show that our proposals have a similar behaviour and that they are a good starting point for incorporating confidence measures in the dialogue annotation process.
引用
收藏
页码:264 / 275
页数:12
相关论文
共 50 条
  • [1] ON THE USE OF N-GRAM TRANSDUCERS FOR DIALOGUE ANNOTATION
    Tamarit, Vicent
    Martinez-Hinarejos, Carlos-D.
    Benedi, Jose-Miguel
    SPOKEN DIALOGUE SYSTEMS: TECHNOLOGY AND DESIGN, 2011, : 255 - 276
  • [2] Unsegmented Dialogue Act Annotation and Decoding With N-Gram Transducers
    Martinez-Hinarejos, Carlos-D.
    Benedi, Jose-Miguel
    Tamarit, Vicent
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (01) : 198 - 211
  • [3] Improving unsegmented dialogue turns annotation with N-gram transducers
    Instituto Tecnologico de Informatica, Universidad Politecnica de Valencia, Camino de Vera, s/n, 46022 Valencia, Spain
    PACLIC 23 - Proc. 23rd Pacific Asia Conf. Lang. Inf. Comput., 2009, (335-344):
  • [4] N-gram posterior probability confidence measures for statistical machine translation: an empirical study
    de Gispert, Adria
    Blackwood, Graeme
    Iglesias, Gonzalo
    Byrne, William
    MACHINE TRANSLATION, 2013, 27 (02) : 85 - 114
  • [5] Speech Corpus Generation Based on N-gram Confidence Measure Classification
    Koctur, Tomas
    Ondas, Stanislav
    Juhar, Jozef
    PROCEEDINGS OF 2017 INTERNATIONAL SYMPOSIUM ELMAR, 2017, : 149 - 152
  • [6] High Order N-gram Model Construction and Application Based on Natural Annotation
    Wang, Qibo
    Rao, Gaoqi
    Xun, Endong
    CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 : 321 - 328
  • [7] Generalized N-gram measures for melodic similarity
    Frieler, Klaus
    Data Science and Classification, 2006, : 289 - 298
  • [8] N-gram measures and L2 writing proficiency
    Garner, James
    Crossley, Scott
    Kyle, Kristopher
    SYSTEM, 2019, 80 : 176 - 187
  • [9] Inference of stochastic finite-state transducers using N-gram mixtures
    Alabau, Vicente
    Casacuberta, Francisco
    Vidal, Enrique
    Juan, Alfons
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2007, 4478 : 282 - +
  • [10] Research of Affective Recognize Based on N-gram
    Xue Weimin
    Lin Benjing
    Yu Bing
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 702 - +