ON THE USE OF N-GRAM TRANSDUCERS FOR DIALOGUE ANNOTATION

被引:2
|
作者
Tamarit, Vicent [1 ]
Martinez-Hinarejos, Carlos-D. [1 ]
Benedi, Jose-Miguel [1 ]
机构
[1] Univ Politecn Valencia, Inst Tecnol Informat, Valencia, Spain
来源
SPOKEN DIALOGUE SYSTEMS: TECHNOLOGY AND DESIGN | 2011年
关键词
Statistical models; Dialogue annotation;
D O I
10.1007/978-1-4419-7934-6_11
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The implementation of dialogue systems is one of the most interesting applications of language technologies. Statistical models can be used in this implementation, allowing for a more flexible approach than when using rules defined by a human expert. However, statistical models require large amounts of dialogues annotated with dialogue-function labels (usually Dialogue Acts), and the annotation process is hard and time-consuming. Consequently, the use of other statistical models to obtain faster annotations is really interesting for the development of dialogue systems. In this work we compare two statistical models for dialogue annotation, a more classical Hidden Markov Model (HMM) based model and the new N-gram Transducers (NGT) model. This comparison is performed on two corpora of different nature, the well-known SwitchBoard corpus and the DIHANA corpus. The results show that the NGT model produces a much more accurate annotation that the HMM-based model (even 11% less error in the SwitchBoard corpus).
引用
收藏
页码:255 / 276
页数:22
相关论文
共 50 条
  • [41] DERIN: A data extraction information and n-gram
    Lopes Figueiredo, Leandro Neiva
    de Assis, Guilherme Tavares
    Ferreira, Anderson A.
    INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (05) : 1120 - 1138
  • [42] Web as a Corpus: Going Beyond the n-gram
    Nakov, Preslav
    INFORMATION RETRIEVAL, RUSSIR 2014, 2015, 505 : 185 - 228
  • [43] Research of Affective Recognize Based on N-gram
    Xue Weimin
    Lin Benjing
    Yu Bing
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 702 - +
  • [44] Applications of Boolean equations in n-gram analysis
    Marovac, Ulfeta
    ICIST '18: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES, 2018,
  • [45] Perplexity of n-Gram and Dependency Language Models
    Popel, Martin
    Marecek, David
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 173 - 180
  • [46] MIXTURE OF MIXTURE N-GRAM LANGUAGE MODELS
    Sak, Hasim
    Allauzen, Cyril
    Nakajima, Kaisuke
    Beaufays, Francoise
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 31 - 36
  • [47] Differentiable N-gram objective on abstractive summarization
    Zhu, Yunqi
    Yang, Xuebing
    Wu, Yuanyuan
    Zhu, Mingjin
    Zhang, Wensheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [48] A variant of n-gram based language classification
    Tomovic, Andrija
    Janicic, Predrag
    AI(ASTERISK)IA 2007: ARTIFICIAL INTELLIGENCE AND HUMAN-ORIENTED COMPUTING, 2007, 4733 : 410 - +
  • [49] Twitter n-gram corpus with demographic metadata
    Amaç Herdağdelen
    Language Resources and Evaluation, 2013, 47 : 1127 - 1147
  • [50] ADtrees for sequential data and n-gram counting
    Van Dam, Rob
    Ventura, Dan
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 766 - 771