Single document summarization using the information from documents with the same topic

被引:5
|
作者
Mao, Xiangke [1 ,2 ,3 ]
Huang, Shaobin [1 ]
Shen, Linshan [1 ]
Li, Rongsheng [1 ]
Yang, Hui [2 ,3 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[2] CETC Big Data Res Inst Co Ltd, Guiyang 550022, Peoples R China
[3] Big Data Applicat Improving Govt Governance Capab, Guiyang 550022, Peoples R China
关键词
Extractive summarization; Neighborhood documents; Graph model; Biased LexRank; SENTENCE SCORING TECHNIQUES;
D O I
10.1016/j.knosys.2021.107265
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The essence of extractive summarization is to measure the importance of sentences in the document. When extracting summary from a single document, it is difficult to comprehensively and effectively evaluate the importance of sentences due to the lack of information. In this paper, we propose a kind of single document summarization method using information from documents under the same topic. This method integrates the topic information from neighborhood documents and statistical information from the target document to calculate the score of sentences. Then the scoring results are used as a prior scores for each sentence in the target document. After the target document is represented by the sentence graph, the final score of the sentences are obtained by the biased random walk algorithm. Finally, the Maximal Marginal Relevance (MMR) algorithm is used to select the sentences to form summary. The experimental results on the DUC2001 and DUC2002 datasets show that the effect of extracting summary is improved by incorporating information from the documents under the same topic. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Multi-document Summarization using Probabilistic Topic-based Network Models
    Yang, Cheng-Zen
    Fan, Jhih-Shang
    Liu, Yu-Fan
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2016, 32 (06) : 1613 - 1634
  • [32] Single Document Extractive Text Summarization Using Genetic Algorithms
    Chatterjee, Niladri
    Mittal, Amol
    Goyal, Shubham
    2012 THIRD INTERNATIONAL CONFERENCE ON EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2012, : 19 - 23
  • [33] A topic modeling based approach to novel document automatic summarization
    Wu, Zongda
    Lei, Li
    Li, Guiling
    Huang, Hui
    Zheng, Chengren
    Chen, Enhong
    Xu, Guandong
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 84 : 12 - 23
  • [34] Topic-Sensitive Multi-document Summarization Algorithm
    Liu Na
    Tang Xiao-jun
    Lu Ying
    Li Ming-xia
    Wang Hai-wen
    Xiao Peng
    2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 69 - 74
  • [35] A novel contextual topic model for multi-document summarization
    Yang, Guangbing
    Wen, Dunwei
    Kinshuk
    Chen, Nian-Shing
    Sutinen, Erkki
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (03) : 1340 - 1352
  • [36] A topic Approach to Sentence Ordering for Multi-document Summarization
    Na, Liu
    Peng, Xiao
    Ying, Lu
    Tang Xiao-jun
    Wang Hai-wen
    Li Ming-xia
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1390 - 1395
  • [37] Topic-Sensitive Multi-document Summarization Algorithm
    Liu Na
    Di Tang
    Lu Ying
    Tang Xiao-jun
    Wang Hai-wen
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 12 (04) : 1375 - 1389
  • [38] Topic-Guided Abstractive Multi-Document Summarization
    Cui, Peng
    Hu, Le
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1463 - 1472
  • [39] Chinese spoken document summarization using probabilistic latent topical information
    Chen, Berlin
    Yeh, Yao-Ming
    Huang, Yao-Min
    Chen, Yi-Ting
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 969 - 972
  • [40] Mining Both Commonality and Specificity From Multiple Documents for Multi-Document Summarization
    Ma, Bing
    IEEE ACCESS, 2024, 12 : 54371 - 54381