Direct and Wordgraph-Based Confidence Measures in Dialogue Annotation with N-Gram Transducers

被引:0
|
作者
Martinez-Hinarejos, Carlos-D. [1 ]
Tamarit, Vicent [1 ]
Benedi, Jose-Miguel [1 ]
机构
[1] Univ Politecn Valencia, PRHLT Res Ctr, Valencia 46022, Spain
来源
HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS | 2014年 / 8387卷
关键词
Dialogue annotation; Confidence measures; N-gram transducers;
D O I
10.1007/978-3-319-08958-4_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dialogue annotation is a necessary step for the development of dialogue systems, specially for data-based dialogue strategies. Manual annotation is hard and time-consuming, and automatic techniques can be used to obtain a draft annotation and speed up the process. The presentation of the draft annotation with confidence levels on the correctness of every part of the hypothesis can make even faster the supervision process. In this paper we propose two methods to calculate confidence measures for an automatic dialogue annotation model, and test them for the annotation of a task-oriented human-computer corpus on railway information. The results show that our proposals have a similar behaviour and that they are a good starting point for incorporating confidence measures in the dialogue annotation process.
引用
收藏
页码:264 / 275
页数:12
相关论文
共 50 条
  • [41] Bangla Word Clustering Based on N-gram Language Model
    Ismail, Sabir
    Rahman, M. Shahidur
    2014 1ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION & COMMUNICATION TECHNOLOGY (ICEEICT 2014), 2014,
  • [42] Answering questions with an n-gram based passage retrieval engine
    Buscaldi, Davide
    Rosso, Paolo
    Manuel Gomez-Soriano, Jose
    Sanchis, Emilio
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2010, 34 (02) : 113 - 134
  • [43] Classifying Web Pages by Genre: An n-gram Based Approach
    Mason, Jane E.
    Shepherd, Michael
    Duffy, Jack
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2009, : 458 - 465
  • [44] A Short Text Classification Method Based on N-Gram and CNN
    WANG Haitao
    HE Jie
    ZHANG Xiaohong
    LIU Shufen
    ChineseJournalofElectronics, 2020, 29 (02) : 248 - 254
  • [45] Sentiment Analysis Approach Based on N-gram and KNN Classifier
    Kaur, Sumandeep
    Sikka, Geeta
    Awasthi, Lalit Kumar
    2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 13 - 16
  • [46] Rich Morphology Based N-gram Language Models for Arabic
    Emami, Ahmad
    Zitouni, Imed
    Mangu, Lidia
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 829 - 832
  • [47] Detecting Malware Based on Opcode N-Gram and Machine Learning
    Li, Pengfei
    Chen, Zhouguo
    Cui, Baojiang
    ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC-2017), 2018, 13 : 99 - 110
  • [48] N-gram based filler model for robust grammar authoring
    Yu, Dong
    Ju, Yun Cheng
    Wang, Ye-Yi
    Acero, Alex
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 565 - 568
  • [49] Application of N-Gram Based Distances to Genetic Texts Comparison
    Valery Kirzhner
    Zeev Volkovich
    Biosemiotics, 2021, 14 : 271 - 285
  • [50] A Short Text Classification Method Based on N-Gram and CNN
    Wang, Haitao
    He, Jie
    Zhang, Xiaohong
    Liu, Shufen
    CHINESE JOURNAL OF ELECTRONICS, 2020, 29 (02) : 248 - 254