Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data

被引:0
|
作者
Fillippova, Katja [1 ]
机构
[1] Google Res, Berlin, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural text generation (data- or text-to-text) demonstrates remarkable performance when training data is abundant which for many applications is not the case. To collect a large corpus of parallel data, heuristic rules are often used but they inevitably let noise into the data, such as phrases in the output which cannot be explained by the input. Consequently, models pick up on the noise and may hallucinategenerate fluent but unsupported text. Our contribution is a simple but powerful technique to treat such hallucinations as a controllable aspect of the generated text, without dismissing any input and without modifying the model architecture. On the WikiBio corpus (Lebret et al., 2016), a particularly noisy dataset, we demonstrate the efficacy of the technique both in an automatic and in a human evaluation.
引用
收藏
页码:864 / 870
页数:7
相关论文
共 50 条
  • [1] Learning to Rank from Noisy Data
    Ding, Wenkui
    Geng, Xiubo
    Zhang, Xu-Dong
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 7 (01)
  • [2] Learning Programs from Noisy Data
    Raychev, Veselin
    Bielik, Pavol
    Vechev, Martin
    Krause, Andreas
    ACM SIGPLAN NOTICES, 2016, 51 (01) : 761 - 774
  • [3] Learning programs from noisy data
    Raychev V.
    Bielik P.
    Vechev M.
    Krause A.
    1600, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (51): : 761 - 774
  • [4] Learning to Generate Visual Questions with Noisy Supervision
    Shen, Kai
    Wu, Lingfei
    Tang, Siliang
    Zhuang, Yueting
    He, Zhen
    Ding, Zhuoye
    Xiao, Yun
    Long, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Learning from Noisy Data with Robust Representation Learning
    Li, Junnan
    Xiong, Caiming
    Hoi, Steven C. H.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9465 - 9474
  • [6] MetaLabelNet: Learning to Generate Soft-Labels From Noisy-Labels
    Algan, Gorkem
    Ulusoy, Ilkay
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4352 - 4362
  • [7] Learning Explanatory Rules from Noisy Data
    Evans, Richard
    Grefenstette, Edward
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 : 1 - 64
  • [8] Robust Graph Learning From Noisy Data
    Kang, Zhao
    Pan, Haiqi
    Hoi, Steven C. H.
    Xu, Zenglin
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (05) : 1833 - 1843
  • [9] Learning explanatory rules from noisy data
    1600, AI Access Foundation (61):
  • [10] Learning to Learn from Noisy Labeled Data
    Li, Junnan
    Wong, Yongkang
    Zhao, Qi
    Kankanhalli, Mohan S.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5046 - 5054