Optimizing Sentence Modeling and Selection for Document Summarization

被引:0
|
作者
Yin, Wenpeng [1 ]
Pei, Yulong [2 ]
机构
[1] Univ Munich, Ctr Informat & Language Proc, Munich, Germany
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extractive document summarization aims to conclude given documents by extracting some salient sentences. Often, it faces two challenges: 1) how to model the information redundancy among candidate sentences; 2) how to select the most appropriate sentences. This paper attempts to build a strong summarizer DivSelect+CNNLM by presenting new algorithms to optimize each of them. Concretely, it proposes CNNLM, a novel neural network language model (NNLM) based on convolutional neural network (CNN), to project sentences into dense distributed representations, then models sentence redundancy by cosine similarity. Afterwards, it formulates the selection process as an optimization problem, constructing a diversified selection process (DivSelect) with the aim of selecting some sentences which have high prestige, meantime, are dis-similar with each other. Experimental results on DUC2002 and DUC2004 benchmark data sets demonstrate the effectiveness of our approach.
引用
收藏
页码:1383 / 1389
页数:7
相关论文
共 50 条
  • [1] A progressive sentence selection strategy for document summarization
    Ouyang, You
    Li, Wenjie
    Zhang, Renxian
    Li, Sujian
    Lu, Qin
    INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (01) : 213 - 221
  • [2] Comparative Document Summarization via Discriminative Sentence Selection
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Gong, Yihong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (03)
  • [3] Comparative Document Summarization via Discriminative Sentence Selection
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Gong, Yihong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2013, 7 (01)
  • [4] Research on sentence optimum selection algorithm for multi-document summarization
    Zhang, Shu
    Zhao, Tie-Jun
    Yao, Chao
    Zheng, De-Quan
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2008, 30 (12): : 2921 - 2925
  • [5] A Joint Sentence Scoring and Selection Framework for Neural Extractive Document Summarization
    Zhou, Qingyu
    Yang, Nan
    Wei, Furu
    Huang, Shaohan
    Zhou, Ming
    Zhao, Tiejun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 671 - 681
  • [6] DOCUMENT SUMMARIZATION IN MALAYALAM WITH SENTENCE FRAMING
    Kishore, Kavya
    Gopal, Greeshma N.
    Neethu, P. H.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE (ICIS), 2016, : 194 - 200
  • [7] Document Summarization Using Sentence Features
    Rautray, Rasmita
    Balabantaray, Rakesh Chandra
    Bhardwaj, Anisha
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2015, 5 (01) : 36 - 47
  • [8] TWO-STAGE SENTENCE SELECTION APPROACH FOR MULTI-DOCUMENT SUMMARIZATION
    Zhang Shu Zhao Tiejun Zheng Dequan Zhao Hua (Department of Computer Science and Technology
    Journal of Electronics(China), 2008, (04) : 562 - 567
  • [9] Sentence selection for generic document summarization using an adaptive differential evolution algorithm
    Alguliev, Rasim M.
    Aliguliyev, Ramiz M.
    Mehdiyev, Chingiz A.
    SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (04) : 213 - 222
  • [10] Estimating Risk of Picking a Sentence for Document Summarization
    Kumar, Chandan
    Pingali, Prasad
    Varma, Vasudeva
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2009, 5449 : 571 - 581