Skeleton to Abstraction: An Attentive Information Extraction Schema for Enhancing the Saliency of Text Summarization

被引:6
|
作者
Xiang, Xiujuan [1 ,2 ,3 ]
Xu, Guangluan [1 ,2 ,3 ]
Fu, Xingyu [1 ,2 ,3 ]
Wei, Yang [2 ,3 ]
Jin, Li [2 ,3 ]
Wang, Lei [2 ,3 ]
机构
[1] Univ Chinese Acad Sci, 19 A,Yuquan Rd, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Elect, 19 North Fourth Ring West Rd, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Key Lab Spatial Informat Proc & Appl Syst Technol, 19 North Fourth Ring West Rd, Beijing 100190, Peoples R China
关键词
recurrent neural network (RNN); abstractive text summarization; information extraction; attention mechanism; semantic relevance; saliency of summarization;
D O I
10.3390/info9090217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current popular abstractive summarization is based on an attentional encoder-decoder framework. Based on the architecture, the decoder generates a summary according to the full text that often results in the decoder being interfered by some irrelevant information, thereby causing the generated summaries to suffer from low saliency. Besides, we have observed the process of people writing summaries and find that they write a summary based on the necessary information rather than the full text. Thus, in order to enhance the saliency of the abstractive summarization, we propose an attentive information extraction model. It consists of a multi-layer perceptron (MLP) gated unit that pays more attention to the important information of the source text and a similarity module to encourage high similarity between the reference summary and the important information. Before the summary decoder, the MLP and the similarity module work together to extract the important information for the decoder, thus obtaining the skeleton of the source text. This effectively reduces the interference of irrelevant information to the decoder, therefore improving the saliency of the summary. Our proposed model was tested on CNN/Daily Mail and DUC-2004 datasets, and achieved a 42.01 ROUGE-1 f-score and 33.94 ROUGE-1, recall respectively. The result outperforms the state-of-the-art abstractive model on the same dataset. In addition, by subjective human evaluation, the saliency of the generated summaries was further enhanced.
引用
收藏
页数:19
相关论文
共 16 条
  • [1] Text Summarization towards Scientific Information Extraction
    Keller, Abigail
    Furst, Jacob
    Raicu, Daniela
    Hastings, Peter
    Tchoua, Roselyne
    [J]. 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON E-SCIENCE (ESCIENCE 2022), 2022, : 225 - 235
  • [2] Enhancing Biomedical Text Summarization Using Semantic Relation Extraction
    Shang, Yue
    Li, Yanpeng
    Lin, Hongfei
    Yang, Zhihao
    [J]. PLOS ONE, 2011, 6 (08):
  • [3] Structured Text Summarization via Open Domain Information Extraction
    Hao, Zengguang
    Xu, Binxia
    Zheng, Shiyuan
    Gao, Yang
    [J]. PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 701 - 706
  • [4] Information-content based sentence extraction for text summarization
    Mallett, D
    Elding, J
    Nascimento, MA
    [J]. ITCC 2004: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 2, PROCEEDINGS, 2004, : 214 - 218
  • [5] INFORMATION EXTRACTION AND TEXT SUMMARIZATION USING LINGUISTIC KNOWLEDGE ACQUISITION
    RAU, LF
    JACOBS, PS
    ZERNIK, U
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1989, 25 (04) : 419 - 428
  • [6] Open information extraction as an intermediate semantic structure for Persian text summarization
    Rahat, Mahmoud
    Talebpour, Alireza
    [J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2018, 19 (04) : 339 - 352
  • [7] An approach for transgender population information extraction and summarization from clinical trial text
    Chen, Boyu
    Jin, Hao
    Yang, Zhiwen
    Qu, Yingying
    Weng, Heng
    Hao, Tianyong
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 2)
  • [8] An approach for transgender population information extraction and summarization from clinical trial text
    Boyu Chen
    Hao Jin
    Zhiwen Yang
    Yingying Qu
    Heng Weng
    Tianyong Hao
    [J]. BMC Medical Informatics and Decision Making, 19
  • [9] A global and local information extraction model incorporating selection mechanism for abstractive text summarization
    Li, Yuanyuan
    Huang, Yuan
    Huang, Weijian
    Wang, Wei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 4859 - 4886
  • [10] Word Embedding-based Text Processing for Comprehensive Summarization and Distinct Information Extraction
    Wan, Xiangpeng
    Ghazzai, Hakim
    Massoud, Yehia
    [J]. 2020 IEEE TECHNOLOGY & ENGINEERING MANAGEMENT CONFERENCE (TEMSCON 2020), 2020,