Causal Intervention for Abstractive Related Work Generation

被引:0
|
作者
Liu, Jiachang [1 ]
Zhang, Qi [2 ,5 ]
Shi, Chongyang [1 ]
Naseem, Usman [3 ]
Wang, Shoujin [4 ]
Hu, Liang [2 ,5 ]
Tsang, Ivor W. [6 ,7 ]
机构
[1] Beijing Inst Technol, Beijing, Peoples R China
[2] Tongji Univ, Shanghai, Peoples R China
[3] James Cook Univ, Townsville, Qld, Australia
[4] Univ Technol Sydney, Sydney, NSW, Australia
[5] DeepBlue Acad Sci, Shanghai, Peoples R China
[6] Agcy Sci Technol & Res, CFAR, Singapore, Singapore
[7] Agcy Sci Technol & Res, IHPC, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Abstractive related work generation has attracted increasing attention in generating coherent related work that helps readers grasp the current research. However, most existing models ignore the inherent causality during related work generation, leading to spurious correlations which downgrade the models' generation quality and generalizability. In this study, we argue that causal intervention can address such limitations and improve the quality and coherence of generated related work. To this end, we propose a novel Causal Intervention Module for Related Work Generation (CaM) to effectively capture causalities in the generation process. Specifically, we first model the relations among the sentence order, document (reference) correlations, and transitional content in related work generation using a causal graph. Then, to implement causal interventions and mitigate the negative impact of spurious correlations, we use do-calculus to derive ordinary conditional probabilities and identify causal effects through CaM. Finally, we subtly fuse CaM with Transformer to obtain an end-to-end related work generation framework. Extensive experiments on two real-world datasets show that CaM can effectively promote the model to learn causal relations and thus produce related work of higher quality and coherence.
引用
收藏
页码:2148 / 2159
页数:12
相关论文
共 50 条
  • [21] Varieties of causal intervention
    Korb, KB
    Hope, LR
    Nicholson, AE
    Axnick, K
    PRICAI 2004: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3157 : 322 - 331
  • [22] Abstractive headline generation using WIDL-expressions
    Soricut, R.
    Marcu, D.
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) : 1536 - 1548
  • [23] Causal inference of the effectiveness of a return to work intervention from observational data: the role of selection
    Joling, C.
    Groot, W.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2006, 16 : 57 - 57
  • [24] Intervention research, theoretical mechanisms, and causal processes related to externalizing behavior patterns
    Hinshaw, SP
    DEVELOPMENT AND PSYCHOPATHOLOGY, 2002, 14 (04) : 789 - 818
  • [25] Paraphrastic Fusion for Abstractive Multi-Sentence Compression Generation
    Nayeem, Mir Tafseer
    Chali, Yllias
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2223 - 2226
  • [26] Work related asthma. A causal analysis controlling the healthy worker effect
    Dumas, Orianne
    Le Moual, Nicole
    Siroux, Valerie
    Heederik, Dick
    Garcia-Aymerich, Judith
    Varraso, Raphaelle
    Kauffmann, Francine
    Basagana, Xavier
    OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 2013, 70 (09) : 603 - 610
  • [27] Automatic generation of related work through summarizing citations
    Chen, Jingqiang
    Hai Zhuge
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (03):
  • [28] Multicomponent intervention for work-related upper extremity disorders
    Feuerstein, M
    Marshall, L
    Shaw, WS
    Burrell, LM
    JOURNAL OF OCCUPATIONAL REHABILITATION, 2000, 10 (01) : 71 - 83
  • [29] Improving Faithfulness in Abstractive Summarization with Contrast Candidate Generation and Selection
    Chen, Sihao
    Zhang, Fan
    Sone, Kazoo
    Roth, Dan
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5935 - 5941
  • [30] Work-related low back pain: secondary intervention
    Snook, SH
    JOURNAL OF ELECTROMYOGRAPHY AND KINESIOLOGY, 2004, 14 (01) : 153 - 160