Joint Hierarchical Semantic Clipping and Sentence Extraction for Document Summarization

被引:2
|
作者
Yan, Wanying [1 ]
Guo, Junjun [1 ]
机构
[1] Kunming Univ Sci & Technol, Coll Informat Engn & Automat, Kunming, Yunnan, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Extractive Summarization; Hierarchical Selective Encoding; Redundant Information Clipping;
D O I
10.3745/JIPS.04.0181
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extractive document summarization aims to select a few sentences while preserving its main information on a given document, but the current extractive methods do not consider the sentence-information repeat problem especially for news document summarization. In view of the importance and redundancy of news text information, in this paper, we propose a neural extractive summarization approach with joint sentence semantic clipping and selection, which can effectively solve the problem of news text summary sentence repetition. Specifically, a hierarchical selective encoding network is constructed for both sentence-level and document-level document representations, and data containing important information is extracted on news text; a sentence extractor strategy is then adopted for joint scoring and redundant information clipping. This way, our model strikes a balance between important information extraction and redundant information filtering. Experimental results on both CNN/Daily Mail dataset and Court Public Opinion News dataset we built are presented to show the effectiveness of our proposed approach in terms of ROUGE metrics, especially for redundant information filtering.
引用
收藏
页码:820 / 831
页数:12
相关论文
共 50 条
  • [31] Single document summarization using word and sentence embeddings
    Ayana
    PROCEEDINGS OF THE 2015 JOINT INTERNATIONAL MECHANICAL, ELECTRONIC AND INFORMATION TECHNOLOGY CONFERENCE (JIMET 2015), 2015, 10 : 523 - 526
  • [32] Comparative Document Summarization via Discriminative Sentence Selection
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Gong, Yihong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (03)
  • [33] Hierarchical Summarization: Scaling Up Multi-Document Summarization
    Christensen, Janara
    Soderland, Stephen
    Bansal, Gagan
    Mausam
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 902 - 912
  • [34] Comparative Document Summarization via Discriminative Sentence Selection
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Gong, Yihong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2013, 7 (01)
  • [35] Multi-document Summarization Based on Sentence Clustering
    Zheng, Hai-Tao
    Gong, Shu-Qin
    Chen, Hao
    Jiang, Yong
    Xia, Shu-Tao
    NEURAL INFORMATION PROCESSING (ICONIP 2014), PT II, 2014, 8835 : 429 - 436
  • [36] Multi-Document Summarization Using Sentence Clustering
    Gupta, Virendra Kumar
    Siddiqui, Tanveer J.
    4TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2012), 2012,
  • [37] Multi-layered Summarization of Spoken Document Archives by Information Extraction and Semantic Structuring
    Lee, Lin-shan
    Kong, Sheng-yi
    Pan, Yi-cheng
    Fu, Yi-sheng
    Huang, Yu-tsun
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1539 - 1542
  • [38] Text Summarization Based on Sentence Selection with Semantic Representation
    Zhang, Chi
    Zhang, Lei
    Wang, Chong-Jun
    Xie, Jun-Yuan
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 584 - 590
  • [39] New feature sets for summarization by sentence extraction
    van Halteren, H
    IEEE INTELLIGENT SYSTEMS, 2003, 18 (04): : 34 - 42
  • [40] An Effective Joint Framework for Document Summarization
    Gui, Min
    Zhang, Zhengkun
    Yang, Zhenglu
    Gu, Yanhui
    Xu, Guandong
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 121 - 122