Single document summarization using the information from documents with the same topic

被引:5
|
作者
Mao, Xiangke [1 ,2 ,3 ]
Huang, Shaobin [1 ]
Shen, Linshan [1 ]
Li, Rongsheng [1 ]
Yang, Hui [2 ,3 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[2] CETC Big Data Res Inst Co Ltd, Guiyang 550022, Peoples R China
[3] Big Data Applicat Improving Govt Governance Capab, Guiyang 550022, Peoples R China
关键词
Extractive summarization; Neighborhood documents; Graph model; Biased LexRank; SENTENCE SCORING TECHNIQUES;
D O I
10.1016/j.knosys.2021.107265
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The essence of extractive summarization is to measure the importance of sentences in the document. When extracting summary from a single document, it is difficult to comprehensively and effectively evaluate the importance of sentences due to the lack of information. In this paper, we propose a kind of single document summarization method using information from documents under the same topic. This method integrates the topic information from neighborhood documents and statistical information from the target document to calculate the score of sentences. Then the scoring results are used as a prior scores for each sentence in the target document. After the target document is represented by the sentence graph, the final score of the sentences are obtained by the biased random walk algorithm. Finally, the Maximal Marginal Relevance (MMR) algorithm is used to select the sentences to form summary. The experimental results on the DUC2001 and DUC2002 datasets show that the effect of extracting summary is improved by incorporating information from the documents under the same topic. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Automatic Single Document Text Summarization Using Key Concepts in Documents
    Sarkar, Kamal
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2013, 9 (04): : 602 - 620
  • [2] Using Topic Themes for Multi-Document Summarization
    Harabagiu, Sanda
    Lacatusu, Finley
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (03)
  • [3] Single Document Summarization Based on Local Topic Identification and Word Frequency
    Teng, Zhi
    Liu, Ye
    Ren, Fuji
    Tsuchiya, Seiji
    Ren, Fuji
    PROCEEDINGS OF THE SPECIAL SESSION OF THE SEVENTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE - MICAI 2008, 2008, : 37 - +
  • [4] A topic modeled unsupervised approach to single document extractive text summarization
    Srivastava, Ridam
    Singh, Prabhav
    Rana, K. P. S.
    Kumar, Vineet
    KNOWLEDGE-BASED SYSTEMS, 2022, 246
  • [5] Topic Generation for Web Document Summarization
    Hsu, Heng-Yao
    Tsai, Chun-Wei
    Chiang, Ming-Chao
    Yang, Chu-Sing
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 3701 - +
  • [6] A study of extractive summarization of long documents incorporating local topic and hierarchical information
    Wang, Ting
    Yang, Chuan
    Zou, Maoyang
    Liang, Jiaying
    Xiang, Dong
    Yang, Wenjie
    Wang, Hongyang
    Li, Jia
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [7] Organization of Documents for Multiple Document Summarization
    Wang, Fu Lee
    Wong, Tak-Lam
    Mak, Aston Nai Hong
    SHORT PAPER PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON WEB-BASED LEARNING, 2009, : 98 - +
  • [8] Information extraction and summarization from medical documents
    Spyropoulos, CD
    Karkatetsis, V
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (02) : 107 - 110
  • [9] Spoken document summarization using relevant information
    Chen, Yi-Ting
    Lin, Shih-Hsiang
    Wang, Hsin-Min
    Chen, Berlin
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 189 - +
  • [10] Summarization of Multi-Document Topic Hierarchies using Submodular Mixtures
    Bairi, Ramakrishna B.
    Iyer, Rishabh
    Ramakrishnan, Ganesh
    Bilmes, Jeff
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 553 - 563