Unsegmented Dialogue Act Annotation and Decoding With N-Gram Transducers

被引：5

作者：

Martinez-Hinarejos, Carlos-D. ^{[1
]}

Benedi, Jose-Miguel ^{[1
]}

Tamarit, Vicent ^{[1
]}

机构：

[1] Univ Politecn Valencia, Pattern Recognit & Human Language Technol Ctr, Valencia 46022, Spain

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2015年 / 23卷 / 01期

关键词：

Dialogue annotation; n-gram transducer; spoken dialogue systems; SEGMENTATION; FRAMEWORK; AGENDA; STATE;

D O I：

10.1109/TASLP.2014.2377595

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Most studies on dialogue corpora, as well as most dialogue systems, employ dialogue acts as the basic units for interpreting discourse structure, user input and system actions. The definition of the discourse structure and the dialogue strategy consequently require the tagging of dialogue corpora in terms of dialogue acts. The tagging problem presents two basic variants: a batch variant (annotation of whole dialogues, in order to define dialogue strategy or study discourse structure) and an online variant (decoding of the dialogue act sequence of a given turn, in order to interpret user intentions). In the two variants is unusual having the segmentation of each turn into the dialogue meaningful units (segments) to which a dialogue act is assigned. In this paper we present the use of the N-Gram Transducer technique for tagging dialogues, without needing to provide a prior segmentation, in these two different variants (dialogue annotation and turn decoding). Experiments were performed in two corpora of different nature and results show that N-Gram Transducer models are suitable for these tasks and provide good performance.

引用

页码：198 / 211

页数：14

共 50 条

[1] Improving unsegmented dialogue turns annotation with N-gram transducers
Instituto Tecnologico de Informatica, Universidad Politecnica de Valencia, Camino de Vera, s/n, 46022 Valencia, Spain
PACLIC 23 - Proc. 23rd Pacific Asia Conf. Lang. Inf. Comput., 2009, (335-344):
[2] ON THE USE OF N-GRAM TRANSDUCERS FOR DIALOGUE ANNOTATION
Tamarit, Vicent
Martinez-Hinarejos, Carlos-D.
Benedi, Jose-Miguel
SPOKEN DIALOGUE SYSTEMS: TECHNOLOGY AND DESIGN, 2011, : 255 - 276
[3] Direct and Wordgraph-Based Confidence Measures in Dialogue Annotation with N-Gram Transducers
Martinez-Hinarejos, Carlos-D.
Tamarit, Vicent
Benedi, Jose-Miguel
HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2014, 8387 : 264 - 275
[4] N-gram language models for document image decoding
Kopec, GE
Said, MR
Popat, K
DOCUMENT RECOGNITION AND RETRIEVAL IX, 2002, 4670 : 191 - 202
[5] Inference of stochastic finite-state transducers using N-gram mixtures
Alabau, Vicente
Casacuberta, Francisco
Vidal, Enrique
Juan, Alfons
PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2007, 4478 : 282 - +
[6] High Order N-gram Model Construction and Application Based on Natural Annotation
Wang, Qibo
Rao, Gaoqi
Xun, Endong
CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 : 321 - 328
[7] N-gram Insight
Prans, George
AMERICAN SCIENTIST, 2011, 99 (05) : 356 - 357
[8] Evaluation of HMM-based models for the annotation of unsegmented dialogue turns
Martinez-Hinarejos, Carlos-D.
Tamarit, Vicent
Benedi, Jose-Miguel
LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1608 - 1613
[9] N-gram MalGAN: Evading machine learning detection via feature n-gram
Zhu, Enmin
Zhang, Jianjie
Yan, Jijie
Chen, Kongyang
Gao, Chongzhi
DIGITAL COMMUNICATIONS AND NETWORKS, 2022, 8 (04) : 485 - 491
[10] Pseudo-Conventional N-Gram Representation of the Discriminative N-Gram Model for LVCSR
Zhou, Zhengyu
Meng, Helen
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (06) : 943 - 952

← 1 2 3 4 5 →