Single document summarization using the information from documents with the same topic

被引:5
|
作者
Mao, Xiangke [1 ,2 ,3 ]
Huang, Shaobin [1 ]
Shen, Linshan [1 ]
Li, Rongsheng [1 ]
Yang, Hui [2 ,3 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[2] CETC Big Data Res Inst Co Ltd, Guiyang 550022, Peoples R China
[3] Big Data Applicat Improving Govt Governance Capab, Guiyang 550022, Peoples R China
关键词
Extractive summarization; Neighborhood documents; Graph model; Biased LexRank; SENTENCE SCORING TECHNIQUES;
D O I
10.1016/j.knosys.2021.107265
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The essence of extractive summarization is to measure the importance of sentences in the document. When extracting summary from a single document, it is difficult to comprehensively and effectively evaluate the importance of sentences due to the lack of information. In this paper, we propose a kind of single document summarization method using information from documents under the same topic. This method integrates the topic information from neighborhood documents and statistical information from the target document to calculate the score of sentences. Then the scoring results are used as a prior scores for each sentence in the target document. After the target document is represented by the sentence graph, the final score of the sentences are obtained by the biased random walk algorithm. Finally, the Maximal Marginal Relevance (MMR) algorithm is used to select the sentences to form summary. The experimental results on the DUC2001 and DUC2002 datasets show that the effect of extracting summary is improved by incorporating information from the documents under the same topic. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Single Document Summarization as Tree Induction
    Liu, Yang
    Titov, Ivan
    Lapata, Mirella
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1745 - 1755
  • [42] Unsupervised neural networks for automatic Arabic text summarization using document clustering and topic modeling
    Alami, Nabil
    Meknassi, Mohammed
    En-nahnahi, Noureddine
    El Adlouni, Yassine
    Ammor, Ouafae
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 172
  • [43] Using Topic in Summarization for Vietnamese Paragraph
    Dieu, Dat Tien
    Dinh, Dien
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 735 - 740
  • [44] Image Summarization using Topic Modelling
    Sharma, Vasu
    Agrawal, Nishant
    Singh, Puneet
    Kulshreshtha, Rajat
    Kumar, Akshay
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (ICSIPA), 2015, : 226 - 231
  • [45] Using Topic Labels for Text Summarization
    Kou, Wanqiu
    Li, Fang
    Ye, Zhe
    ADVANCES IN ARTIFICIAL INTELLIGENCE: FROM THEORY TO PRACTICE (IEA/AIE 2017), PT II, 2017, 10351 : 448 - 457
  • [46] Leveraging Information Bottleneck for Scientific Document Summarization
    Ju, Jiaxin
    Liu, Ming
    Koh, Huan Yee
    Jin, Yuan
    Du, Lan
    Pan, Shirui
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4091 - 4098
  • [47] Extractive spoken document summarization for information retrieval
    Chen, Berlin
    Chen, Yi-Ting
    PATTERN RECOGNITION LETTERS, 2008, 29 (04) : 426 - 437
  • [48] SINGLE DOCUMENT TEXT SUMMARIZATION USING RANDOM INDEXING AND NEURAL NETWORKS
    Chatterjee, Niladri
    Bhardwaj, Avikant
    KEOD 2010: Proceedings of the International Conference on Knowledge Engineering and Ontology Development, 2010, : 171 - 176
  • [49] Empirical Analysis of Single and Multi Document Summarization using Clustering Algorithms
    Bewoor, Mrunal S.
    Patil, Suhas H.
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2018, 8 (01) : 2562 - 2567
  • [50] Single document extractive text summarization using cuckoo search algorithm
    Pati, Siba Prasad
    Rautray, Rasmita
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (05): : 1089 - 1097