Copy Mechanism and Tailored Training for Character-Based Data-to-Text Generation

被引：1

作者：

Roberti, Marco ^{[1
]}

Bonetta, Giovanni ^{[1
]}

Cancelliere, Rossella ^{[1
]}

Gallinari, Patrick ^{[2
,3
]}

机构：

[1] Univ Turin, Comp Sci Dept, Via Pessinetto 12, I-12149 Turin, Italy

[2] Sorbonne Univ, 4 Pl Jussieu, F-75005 Paris, France

[3] Criteo AI Lab, 32 Rue Blanche, F-75009 Paris, France

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II | 2020年 / 11907卷

关键词：

Natural language processing; Data-to-text generation; Deep learning; Sequence-to-sequence; Dataset;

D O I：

10.1007/978-3-030-46147-8_39

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the last few years, many different methods have been focusing on using deep recurrent neural networks for natural language generation. The most widely used sequence-to-sequence neural methods are word-based: as such, they need a pre-processing step called delexicalization (conversely, relexicalization) to deal with uncommon or unknown words. These forms of processing, however, give rise to models that depend on the vocabulary used and are not completely neural. In this work, we present an end-to-end sequence-to-sequence model with attention mechanism which reads and generates at a character level, no longer requiring delexicalization, tokenization, nor even lowercasing. Moreover, since characters constitute the common "building blocks" of every text, it also allows a more general approach to text generation, enabling the possibility to exploit transfer learning for training. These skills are obtained thanks to two major features: (i) the possibility to alternate between the standard generation mechanism and a copy one, which allows to directly copy input facts to produce outputs, and (ii) the use of an original training pipeline that further improves the quality of the generated texts. We also introduce a new dataset called E2E+, designed to highlight the copying capabilities of character-based models, that is a modified version of the well-known E2E dataset used in the E2E Challenge. We tested our model according to five broadly accepted metrics (including the widely used bleu), showing that it yields competitive performance with respect to both character-based and word-based approaches.

引用

页码：648 / 664

页数：17

共 50 条

[21] Plot Generation with Character-Based Decisions
Barbosa, Simone D. J.
Guilherme da Silva, Fabio A.
Furtado, Antonio L.
Casanova, Marco A.
COMPUTERS IN ENTERTAINMENT, 2014, 12 (03) : 1 - 21
[22] A Case-Based Approach for Content Planning in Data-to-Text Generation
Upadhyay, Ashish
Massie, Stewart
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2022, 2022, 13405 : 380 - 394
[23] Narrative context-based data-to-text generation for ambient intelligence
Jang, Jungsun
Noh, Hyungjong
Lee, Yeonsoo
Pantel, Soo-Min
Rim, Haechang
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (04) : 1421 - 1429
[24] uFACT: Unfaithful Alien-Corpora Training for Semantically Consistent Data-to-Text Generation
Anders, Tisha
Coca, Alexandru
Byrne, Bill
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2836 - 2841
[25] Narrative context-based data-to-text generation for ambient intelligence
Jungsun Jang
Hyungjong Noh
Yeonsoo Lee
Soo-Min Pantel
Haechang Rim
Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 1421 - 1429
[26] Domain-independent Data-to-Text Generation for Open Data
Burgdorf, Andreas
Barkmann, Micaela
Pomp, Andre
Meisen, Tobias
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 95 - 106
[27] Character-based handwritten text transcription with attention networks
Jason Poulos
Rafael Valle
Neural Computing and Applications, 2021, 33 : 10563 - 10573
[28] Character-based handwritten text transcription with attention networks
Poulos, Jason
Valle, Rafael
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (16): : 10563 - 10573
[29] Character-Based Handwritten Text Recognition of Multilingual Documents
del Agua, Miguel A.
Serrano, Nicolas
Civera, Jorge
Juan, Alfons
ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 187 - 196
[30] Neural data-to-text generation with dynamic content planning
Chen, Kai
Li, Fayuan
Hu, Baotian
Peng, Weihua
Chen, Qingcai
Yu, Hong
Xiang, Yang
KNOWLEDGE-BASED SYSTEMS, 2021, 215

← 1 2 3 4 5 →