ON THE USE OF N-GRAM TRANSDUCERS FOR DIALOGUE ANNOTATION

被引：2

作者：

Tamarit, Vicent ^{[1
]}

Martinez-Hinarejos, Carlos-D. ^{[1
]}

Benedi, Jose-Miguel ^{[1
]}

机构：

[1] Univ Politecn Valencia, Inst Tecnol Informat, Valencia, Spain

来源：

SPOKEN DIALOGUE SYSTEMS: TECHNOLOGY AND DESIGN | 2011年

关键词：

Statistical models; Dialogue annotation;

D O I：

10.1007/978-1-4419-7934-6_11

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The implementation of dialogue systems is one of the most interesting applications of language technologies. Statistical models can be used in this implementation, allowing for a more flexible approach than when using rules defined by a human expert. However, statistical models require large amounts of dialogues annotated with dialogue-function labels (usually Dialogue Acts), and the annotation process is hard and time-consuming. Consequently, the use of other statistical models to obtain faster annotations is really interesting for the development of dialogue systems. In this work we compare two statistical models for dialogue annotation, a more classical Hidden Markov Model (HMM) based model and the new N-gram Transducers (NGT) model. This comparison is performed on two corpora of different nature, the well-known SwitchBoard corpus and the DIHANA corpus. The results show that the NGT model produces a much more accurate annotation that the HMM-based model (even 11% less error in the SwitchBoard corpus).

引用

页码：255 / 276

页数：22

共 50 条

[41] DERIN: A data extraction information and n-gram
Lopes Figueiredo, Leandro Neiva
de Assis, Guilherme Tavares
Ferreira, Anderson A.
INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (05) : 1120 - 1138
[42] Web as a Corpus: Going Beyond the n-gram
Nakov, Preslav
INFORMATION RETRIEVAL, RUSSIR 2014, 2015, 505 : 185 - 228
[43] Research of Affective Recognize Based on N-gram
Xue Weimin
Lin Benjing
Yu Bing
2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 702 - +
[44] Applications of Boolean equations in n-gram analysis
Marovac, Ulfeta
ICIST '18: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES, 2018,
[45] Perplexity of n-Gram and Dependency Language Models
Popel, Martin
Marecek, David
TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 173 - 180
[46] MIXTURE OF MIXTURE N-GRAM LANGUAGE MODELS
Sak, Hasim
Allauzen, Cyril
Nakajima, Kaisuke
Beaufays, Francoise
2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 31 - 36
[47] Differentiable N-gram objective on abstractive summarization
Zhu, Yunqi
Yang, Xuebing
Wu, Yuanyuan
Zhu, Mingjin
Zhang, Wensheng
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
[48] A variant of n-gram based language classification
Tomovic, Andrija
Janicic, Predrag
AI(ASTERISK)IA 2007: ARTIFICIAL INTELLIGENCE AND HUMAN-ORIENTED COMPUTING, 2007, 4733 : 410 - +
[49] Twitter n-gram corpus with demographic metadata
Amaç Herdağdelen
Language Resources and Evaluation, 2013, 47 : 1127 - 1147
[50] ADtrees for sequential data and n-gram counting
Van Dam, Rob
Ventura, Dan
2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 766 - 771

← 1 2 3 4 5 →