Copy Mechanism and Tailored Training for Character-Based Data-to-Text Generation

被引:1
|
作者
Roberti, Marco [1 ]
Bonetta, Giovanni [1 ]
Cancelliere, Rossella [1 ]
Gallinari, Patrick [2 ,3 ]
机构
[1] Univ Turin, Comp Sci Dept, Via Pessinetto 12, I-12149 Turin, Italy
[2] Sorbonne Univ, 4 Pl Jussieu, F-75005 Paris, France
[3] Criteo AI Lab, 32 Rue Blanche, F-75009 Paris, France
关键词
Natural language processing; Data-to-text generation; Deep learning; Sequence-to-sequence; Dataset;
D O I
10.1007/978-3-030-46147-8_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, many different methods have been focusing on using deep recurrent neural networks for natural language generation. The most widely used sequence-to-sequence neural methods are word-based: as such, they need a pre-processing step called delexicalization (conversely, relexicalization) to deal with uncommon or unknown words. These forms of processing, however, give rise to models that depend on the vocabulary used and are not completely neural. In this work, we present an end-to-end sequence-to-sequence model with attention mechanism which reads and generates at a character level, no longer requiring delexicalization, tokenization, nor even lowercasing. Moreover, since characters constitute the common "building blocks" of every text, it also allows a more general approach to text generation, enabling the possibility to exploit transfer learning for training. These skills are obtained thanks to two major features: (i) the possibility to alternate between the standard generation mechanism and a copy one, which allows to directly copy input facts to produce outputs, and (ii) the use of an original training pipeline that further improves the quality of the generated texts. We also introduce a new dataset called E2E+, designed to highlight the copying capabilities of character-based models, that is a modified version of the well-known E2E dataset used in the E2E Challenge. We tested our model according to five broadly accepted metrics (including the widely used bleu), showing that it yields competitive performance with respect to both character-based and word-based approaches.
引用
收藏
页码:648 / 664
页数:17
相关论文
共 50 条
  • [21] Plot Generation with Character-Based Decisions
    Barbosa, Simone D. J.
    Guilherme da Silva, Fabio A.
    Furtado, Antonio L.
    Casanova, Marco A.
    COMPUTERS IN ENTERTAINMENT, 2014, 12 (03) : 1 - 21
  • [22] A Case-Based Approach for Content Planning in Data-to-Text Generation
    Upadhyay, Ashish
    Massie, Stewart
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2022, 2022, 13405 : 380 - 394
  • [23] Narrative context-based data-to-text generation for ambient intelligence
    Jang, Jungsun
    Noh, Hyungjong
    Lee, Yeonsoo
    Pantel, Soo-Min
    Rim, Haechang
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (04) : 1421 - 1429
  • [24] uFACT: Unfaithful Alien-Corpora Training for Semantically Consistent Data-to-Text Generation
    Anders, Tisha
    Coca, Alexandru
    Byrne, Bill
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2836 - 2841
  • [25] Narrative context-based data-to-text generation for ambient intelligence
    Jungsun Jang
    Hyungjong Noh
    Yeonsoo Lee
    Soo-Min Pantel
    Haechang Rim
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 1421 - 1429
  • [26] Domain-independent Data-to-Text Generation for Open Data
    Burgdorf, Andreas
    Barkmann, Micaela
    Pomp, Andre
    Meisen, Tobias
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 95 - 106
  • [27] Character-based handwritten text transcription with attention networks
    Jason Poulos
    Rafael Valle
    Neural Computing and Applications, 2021, 33 : 10563 - 10573
  • [28] Character-based handwritten text transcription with attention networks
    Poulos, Jason
    Valle, Rafael
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (16): : 10563 - 10573
  • [29] Character-Based Handwritten Text Recognition of Multilingual Documents
    del Agua, Miguel A.
    Serrano, Nicolas
    Civera, Jorge
    Juan, Alfons
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 187 - 196
  • [30] Neural data-to-text generation with dynamic content planning
    Chen, Kai
    Li, Fayuan
    Hu, Baotian
    Peng, Weihua
    Chen, Qingcai
    Yu, Hong
    Xiang, Yang
    KNOWLEDGE-BASED SYSTEMS, 2021, 215