Domain-independent Data-to-Text Generation for Open Data

被引:0
|
作者
Burgdorf, Andreas [1 ]
Barkmann, Micaela [1 ]
Pomp, Andre [1 ]
Meisen, Tobias [1 ]
机构
[1] Univ Wuppertal, Chair Technol & Management Digital Transformat, Wuppertal, Germany
来源
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA) | 2022年
关键词
Open Data; Data to Text Generation; Natural Language Generation; Transformer; Semantic Data Management;
D O I
10.5220/0011272900003269
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a result of the efforts of the Open Data movements, the number of Open Data portals and the amount of data published in them is steadily increasing. An aspect that increases the utilizability of data enormously but is nevertheless often neglected is the enrichment of data with textual data documentation. However, the creation of descriptions of sufficient quality is time-consuming and thus cost-intensive. One approach to solving this problem is Data to text generation which creates descriptions to raw data. In the past, promising results were achieved on data from Wikipedia. Based on a seq2seq model developed for such purposes, we investigate whether this technique can also be applied in the Open Data domain and the associated challenges. In three studies, we reproduce the results obtained from a previous work and apply them to additional datasets with new challenges in terms of data nature and data volume. We can conclude that previous methods are not suitable to be applied in the Open Data sector without further modification, but the results still exceed our expectations and show the potential of applicability.
引用
收藏
页码:95 / 106
页数:12
相关论文
共 50 条
  • [21] Controlling hallucinations at word level in data-to-text generation
    Clement Rebuffel
    Marco Roberti
    Laure Soulier
    Geoffrey Scoutheeten
    Rossella Cancelliere
    Patrick Gallinari
    Data Mining and Knowledge Discovery, 2022, 36 : 318 - 354
  • [22] A Data-to-Text Generation Model with Deduplicated Content Planning
    Wang, Mengda
    Cao, Jianjun
    Yu, Xu
    Nie, Zibo
    BIG DATA, BIGDATA 2022, 2022, 1709 : 92 - 103
  • [23] Neural Data-to-Text Generation Guided by Predicted Plan
    Gao, Hanning
    Wei, Zhihua
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING (ICICSE 2022), 2022, : 53 - 59
  • [24] Controlling hallucinations at word level in data-to-text generation
    Rebuffel, Clement
    Roberti, Marco
    Soulier, Laure
    Scoutheeten, Geoffrey
    Cancelliere, Rossella
    Gallinari, Patrick
    DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (01) : 318 - 354
  • [25] Neural Data-to-Text Generation with LM-based Text Augmentation
    Chang, Ernie
    Shen, Xiaoyu
    Zhu, Dawei
    Demberg, Vera
    Su, Hui
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 758 - 768
  • [26] Partially-Aligned Data-to-Text Generation with Distant Supervision
    Fu, Zihao
    Shi, Bei
    Lam, Wai
    Bing, Lidong
    Liu, Zhiyuan
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 9183 - 9193
  • [27] A Logic Aware Neural Generation Method for Explainable Data-to-text
    Lin, Xiexiong
    Li, Huaisong
    Huang, Tao
    Wang, Feng
    Chao, Linlin
    Zhuang, Fuzhen
    Wang, Taifeng
    Zhang, Tianyi
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3318 - 3326
  • [28] Stochastic Data-to-Text Generation Using Syntactic Dependency Information
    Seifossadat, Elham
    Sameti, Hossein
    COMPUTER SPEECH AND LANGUAGE, 2022, 76
  • [29] Entity-Based Semantic Adequacy for Data-to-Text Generation
    Faille, Juliette
    Gatt, Albert
    Gardent, Claire
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1530 - 1540
  • [30] Exploring Abductive Reasoning in Language Models for Data-to-Text Generation
    Onderkova, Kristyna
    Nickles, Matthias
    2023 31ST IRISH CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COGNITIVE SCIENCE, AICS, 2023,