Plausibility-promoting generative adversarial network for abstractive text summarization with multi-task constraint

被引:24
|
作者
Yang, Min [1 ]
Wang, Xintong [2 ]
Lu, Yao [3 ]
Lv, Jianming [2 ]
Shen, Ying [4 ]
Li, Chengming [1 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Beijing, Peoples R China
[2] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Guangdong, Peoples R China
[3] Univ Waterloo, Sch Comp Sci, Waterloo, ON, Canada
[4] Peking Univ, Shenzhen Grad Sch, Sch Elect & Comp Engn, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Abstractive text summarization; Generative adversarial network; Multi-task learning;
D O I
10.1016/j.ins.2020.02.040
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text summarization is an essential task in natural language processing, which aims to generate concise and condensed summaries retaining the salient information of the input document. Despite the progress of previous work, generating summaries, which are informative, grammatically correct and diverse, remains challenging in practice. In this paper, we present a Plausibility-promoting Generative Adversarial Network for Abstractive Text Summarization with Multi-Task constraint (PGAN-ATSMT), which shows promising performance for generating informative, grammatically correct, and novel summaries. First, PGAN-ATSMT adopts a plausibility-promoting generative adversarial network, which jointly trains a discriminative model D and a generative model G via adversarial learning. The generative model G employs the sequence-to-sequence architecture as its backbone, taking as input the original text and generating a corresponding summary. A novel language model based discriminator D is proposed to distinguish the generated summaries by G from the ground truth summaries without the saturation issue in the previous binary classifier discriminator. The generative model G and the discriminative model D are learned with a minimax two-player game, thus this adversarial process can eventually adjust G to produce high-quality and plausible summaries. Second, we propose two extended regularizations for the generative model G using the multi-task learning, sharing its LSTM encoder and LSTM decoder with text categorization task and syntax annotation task, respectively. The auxiliary tasks help to improve the quality of locating salient information of a document and generate high-quality summaries from language modeling perspective alleviating the issues of incomplete sentences and duplicated words. Experimental results on two benchmark datasets illustrate that PGAN-ATSMT achieves better performance than the state-of-the-art baseline methods in terms of both quantitative and qualitative evaluations. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:46 / 61
页数:16
相关论文
共 50 条
  • [1] Generative Adversarial Network for Abstractive Text Summarization
    Liu, Linqing
    Lu, Yao
    Yang, Min
    Qu, Qiang
    Zhu, Jia
    Li, Hongyan
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8109 - 8110
  • [2] A Multi-Task Learning Framework for Abstractive Text Summarization
    Lu, Yao
    Liu, Linqing
    Jiang, Zhile
    Yang, Min
    Goebel, Randy
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9987 - 9988
  • [3] Multi-task learning for abstractive text summarization with key information guide network
    Weiran Xu
    Chenliang Li
    Minghao Lee
    Chi Zhang
    [J]. EURASIP Journal on Advances in Signal Processing, 2020
  • [4] Multi-task learning for abstractive text summarization with key information guide network
    Xu, Weiran
    Li, Chenliang
    Lee, Minghao
    Zhang, Chi
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2020, 2020 (01)
  • [5] Abstractive Text Summarization Using Generative Adversarial Network and Relation Extraction
    Jing, Liwei
    Yang, Lina
    Li, Xichun
    Meng, Zuqiang
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 203 - 206
  • [6] A novel semantic-enhanced generative adversarial network for abstractive text summarization
    Vo, Tham
    [J]. SOFT COMPUTING, 2023, 27 (10) : 6267 - 6280
  • [7] A novel semantic-enhanced generative adversarial network for abstractive text summarization
    Tham Vo
    [J]. Soft Computing, 2023, 27 : 6267 - 6280
  • [8] Multi-Task Learning for Abstractive and Extractive Summarization
    Chen, Yangbin
    Ma, Yun
    Mao, Xudong
    Li, Qing
    [J]. DATA SCIENCE AND ENGINEERING, 2019, 4 (01) : 14 - 23
  • [9] Multi-Task Learning for Abstractive and Extractive Summarization
    Yangbin Chen
    Yun Ma
    Xudong Mao
    Qing Li
    [J]. Data Science and Engineering, 2019, 4 (1) : 14 - 23
  • [10] Multi-task and Generative Adversarial Learning for Robust and Sustainable Text Classification
    Breazzano, Claudia
    Croce, Danilo
    Basili, Roberto
    [J]. AIXIA 2021 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13196 : 228 - 244