Improving Transformer with Sequential Context Representations for Abstractive Text Summarization

被引:20
|
作者
Cai, Tian [1 ,2 ]
Shen, Mengjun [1 ,2 ]
Peng, Huailiang [1 ,2 ]
Jiang, Lei [1 ]
Dai, Qiong [1 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Transformer; Abstractive summarization;
D O I
10.1007/978-3-030-32233-5_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent dominant approaches for abstractive text summarization are mainly RNN-based encoder-decoder framework, these methods usually suffer from the poor semantic representations for long sequences. In this paper, we propose a new abstractive summarization model, called RC-Transformer (RCT). The model is not only capable of learning longterm dependencies, but also addresses the inherent shortcoming of Transformer on insensitivity to word order information. We extend the Transformer with an additional RNN-based encoder to capture the sequential context representations. In order to extract salient information effectively, we further construct a convolution module to filter the sequential context with local importance. The experimental results on Gigaword and DUC-2004 datasets show that our proposed model achieves the state-of-the-art performance, even without introducing external information. In addition, our model also owns an advantage in speed over the RNN-based models.
引用
下载
收藏
页码:512 / 524
页数:13
相关论文
共 50 条
  • [21] Unsupervised Abstractive Summarization of Bengali Text Documents
    Chowdhury, Radia Rayan
    Nayeem, Mir Tafseer
    Mim, Tahsin Tasnim
    Chowdhury, Md Saifur Rahman
    Jannat, Taufiqul
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2612 - 2619
  • [22] Variational Neural Decoder for Abstractive Text Summarization
    Zhao, Huan
    Cao, Jie
    Xu, Mingquan
    Lu, Jian
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2020, 17 (02) : 537 - 552
  • [23] Reinforcement Learning Models for Abstractive Text Summarization
    Buciumas, Sergiu
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 270 - 271
  • [24] Improving ROUGE-1 by 6%: A novel multilingual transformer for abstractive news summarization
    Kumar, Sandeep
    Solanki, Arun
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (20):
  • [25] Abstractive Text Summarization Using Multimodal Information
    Rafi, Shaik
    Das, Ranjita
    2023 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2023, : 141 - 145
  • [26] Abstractive Text Summarization via Stacked LSTM
    Siddhartha, Ireddy
    Zhan, Huixin
    Sheng, Victor S.
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 437 - 442
  • [27] Highlighted Word Encoding for Abstractive Text Summarization
    Lal, Daisy Monika
    Singh, Krishna Pratap
    Tiwary, Uma Shanker
    INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 77 - 86
  • [28] Generative Adversarial Network for Abstractive Text Summarization
    Liu, Linqing
    Lu, Yao
    Yang, Min
    Qu, Qiang
    Zhu, Jia
    Li, Hongyan
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8109 - 8110
  • [29] Evaluating the Factual Consistency of Abstractive Text Summarization
    Kryscinski, Wojciech
    McCann, Bryan
    Xiong, Caiming
    Socher, Richard
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 9332 - 9346
  • [30] Abstractive Text Summarization by Incorporating Reader Comments
    Gao, Shen
    Chen, Xiuying
    Li, Piji
    Ren, Zhaochun
    Bing, Lidong
    Zhao, Dongyan
    Yan, Rui
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6399 - 6406