Joint Hierarchical Semantic Clipping and Sentence Extraction for Document Summarization

被引:2
|
作者
Yan, Wanying [1 ]
Guo, Junjun [1 ]
机构
[1] Kunming Univ Sci & Technol, Coll Informat Engn & Automat, Kunming, Yunnan, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Extractive Summarization; Hierarchical Selective Encoding; Redundant Information Clipping;
D O I
10.3745/JIPS.04.0181
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extractive document summarization aims to select a few sentences while preserving its main information on a given document, but the current extractive methods do not consider the sentence-information repeat problem especially for news document summarization. In view of the importance and redundancy of news text information, in this paper, we propose a neural extractive summarization approach with joint sentence semantic clipping and selection, which can effectively solve the problem of news text summary sentence repetition. Specifically, a hierarchical selective encoding network is constructed for both sentence-level and document-level document representations, and data containing important information is extracted on news text; a sentence extractor strategy is then adopted for joint scoring and redundant information clipping. This way, our model strikes a balance between important information extraction and redundant information filtering. Experimental results on both CNN/Daily Mail dataset and Court Public Opinion News dataset we built are presented to show the effectiveness of our proposed approach in terms of ROUGE metrics, especially for redundant information filtering.
引用
收藏
页码:820 / 831
页数:12
相关论文
共 50 条
  • [41] Extractive Document Summarization Based on Hierarchical GRU
    Zhang, Yong
    Liao, Jinzhi
    Tang, Jiuyang
    Xiao, Weidong
    Wang, Yuheng
    2018 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS 2018), 2018, : 341 - 346
  • [42] Hierarchical Transformers for Multi-Document Summarization
    Liu, Yang
    Lapata, Mirella
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5070 - 5081
  • [43] A topic Approach to Sentence Ordering for Multi-document Summarization
    Na, Liu
    Peng, Xiao
    Ying, Lu
    Tang Xiao-jun
    Wang Hai-wen
    Li Ming-xia
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1390 - 1395
  • [44] A hybrid sentence ordering strategy in multi-document summarization
    He, Yanxiang
    Liu, Dexi
    Yang, Hua
    Ji, Donghong
    Teng, Chong
    Qi, Wenqing
    WEB INFORMATION SYSTEMS - WISE 2006, PROCEEDINGS, 2006, 4255 : 339 - 349
  • [45] Categorized Text Document Summarization in the Kannada Language by Sentence Ranking
    Jayashree, R.
    Murthy, Srikanta K.
    Anami, Basavaraj S.
    2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 776 - 781
  • [46] Sentence Reduction Algorithms to Improve Multi-document Summarization
    Silveira, Sara Botelho
    Branco, Antonio
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2013, 2014, 449 : 261 - 276
  • [47] An adjacency model for sentence ordering in multi-document summarization
    Nie, Yu
    Ji, Donghong
    Yang, Lingpeng
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 313 - 322
  • [48] Integrating Semantic Scenario and Word Relations for Abstractive Sentence Summarization
    Guan, Yong
    Guo, Shaoru
    Li, Ru
    Li, Xiaoli
    Zhang, Hu
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2522 - 2529
  • [49] A Hybrid Text Summarization Method With Sentence-extraction
    Zhao, Xiaojuan
    PROCEEDINGS OF 2010 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND INDUSTRIAL ENGINEERING, VOLS I AND II, 2010, : 729 - 733
  • [50] Text Summarization by Sentence Extraction Using Unsupervised Learning
    Garcia-Hernandez, Rene Arnulfo
    Montiel, Romyna
    Ledeneva, Yulia
    Rendon, Erendira
    Gelbukh, Alexander
    Cruz, Rafael
    MICAI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5317 : 133 - +