Joint Hierarchical Semantic Clipping and Sentence Extraction for Document Summarization

被引:2
|
作者
Yan, Wanying [1 ]
Guo, Junjun [1 ]
机构
[1] Kunming Univ Sci & Technol, Coll Informat Engn & Automat, Kunming, Yunnan, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Extractive Summarization; Hierarchical Selective Encoding; Redundant Information Clipping;
D O I
10.3745/JIPS.04.0181
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extractive document summarization aims to select a few sentences while preserving its main information on a given document, but the current extractive methods do not consider the sentence-information repeat problem especially for news document summarization. In view of the importance and redundancy of news text information, in this paper, we propose a neural extractive summarization approach with joint sentence semantic clipping and selection, which can effectively solve the problem of news text summary sentence repetition. Specifically, a hierarchical selective encoding network is constructed for both sentence-level and document-level document representations, and data containing important information is extracted on news text; a sentence extractor strategy is then adopted for joint scoring and redundant information clipping. This way, our model strikes a balance between important information extraction and redundant information filtering. Experimental results on both CNN/Daily Mail dataset and Court Public Opinion News dataset we built are presented to show the effectiveness of our proposed approach in terms of ROUGE metrics, especially for redundant information filtering.
引用
收藏
页码:820 / 831
页数:12
相关论文
共 50 条
  • [1] MRS for multi-document summarization by sentence extraction
    Xu, Yong-Dong
    Zhang, Xiao-Dong
    Quan, Guang-Ri
    Wang, Ya-Dong
    TELECOMMUNICATION SYSTEMS, 2013, 53 (01) : 91 - 98
  • [2] MRS for multi-document summarization by sentence extraction
    Yong-Dong Xu
    Xiao-Dong Zhang
    Guang-Ri Quan
    Ya-Dong Wang
    Telecommunication Systems, 2013, 53 : 91 - 98
  • [3] Semantic Hierarchical Document Signature For Determining Sentence Similarity
    Manna, Sukanya
    Gedeon, Tom
    2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [4] Multi-document Text Summarization Using Sentence Extraction
    Ahuja, Ravinder
    Anand, Willson
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2016, 2017, 517 : 235 - 242
  • [5] A Joint Sentence Scoring and Selection Framework for Neural Extractive Document Summarization
    Zhou, Qingyu
    Yang, Nan
    Wei, Furu
    Huang, Shaohan
    Zhou, Ming
    Zhao, Tiejun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 671 - 681
  • [6] Sentence extraction using time features in multi-document summarization
    Lim, JM
    Kang, IS
    Bae, JHJ
    Lee, JH
    INFORMATION RETRIEVAL TECHNOLOGY, 2005, 3411 : 82 - 93
  • [7] Document Summarization Using Sentence-Level Semantic Based on Word Embeddings
    Al-Sabahi, Kamal
    Zhang Zuping
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2019, 29 (02) : 177 - 196
  • [8] Sentence Similarity Using Syntactic and Semantic Features for Multi-document Summarization
    Anjaneyulu, M.
    Sarma, S. S. V. N.
    Reddy, P. Vijaya Pal
    Chander, K. Prem
    Nagaprasad, S.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 471 - 485
  • [9] Multi-document summarization sentence ordering algorithm using semantic analysis
    Ji, Min
    Liao, Junbi
    Lei, Jingfa
    Yuan, Zhongfan
    Advances in Information Sciences and Service Sciences, 2012, 4 (14): : 125 - 131
  • [10] DOCUMENT SUMMARIZATION IN MALAYALAM WITH SENTENCE FRAMING
    Kishore, Kavya
    Gopal, Greeshma N.
    Neethu, P. H.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE (ICIS), 2016, : 194 - 200