Comparison of Graph Based Document Summarization Method

被引:0
|
作者
Kaynar, Oguz [1 ]
Gormez, Yasin [1 ]
Isik, Yunus Emre [1 ]
Demirkoparan, Ferhan [1 ]
机构
[1] Cumuriyet Univ, Yonetim Bilisim Sistcmleri, Sivas, Turkey
关键词
document Summarization; lexRank; textRank; longest Common Subsequence;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Today, with the development of the internet, documents containing information such as articles, news, web pages are produced and stored in digital environment. However, the increase in the number of media where people are able to add new contents such as social media, Twitter, and blog has increased the amount of information on the internet to enormous size. However, it is very difficult and time-consuming to determine whether or not information under research is reached. Automated document summarization systems can reduce the size of the text while keeping the important part of the text and present quickly whether the text contains the desired information. In this study, graph based document summarization methods are discussed. Besides the LexRank method, TextRank algorithm is used with 4 different similarity methods. Unlike other studies, Longest Common Subsequence (LCS), a similarity measure method, is used as a measure of similarity between nodes in the TextRank algorithm. Among the similarity measurement methods used, the longest subset achieved the best success by taking 0,510 Rogue1 and 0,266 Rouge-2 scores in English dataset. Similarly, the same method yields 0,742 Rouge-1 and 0,676 Rouge-2 scores in Turkish data set, which are better than other methods.
引用
收藏
页码:598 / 603
页数:6
相关论文
共 50 条
  • [31] Multi-document extractive summarization using semantic graph
    del Camino Valle, Oleyda
    Simon-Cuevas, Alfredo
    Valladares-Valdes, Eduardo
    Olivas, Jose A.
    Romero, Francisco P.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 103 - 110
  • [32] StarSum: A Simple Star Graph for Multi-document Summarization
    Al-Dhelaan, Mohammed
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 715 - 718
  • [33] PSG:a two-layer graph model for document summarization
    Heng CHEN
    Hai JIN
    Feng ZHAO
    Frontiers of Computer Science, 2014, 8 (01) : 119 - 130
  • [34] PSG: a two-layer graph model for document summarization
    Heng Chen
    Hai Jin
    Feng Zhao
    Frontiers of Computer Science, 2014, 8 : 119 - 130
  • [35] An Entailment-based Scoring Method for Content Selection in Document Summarization
    Dang Hoang Long
    Minh-Tien Nguyen
    Ngo Xuan Bach
    Le-Minh Nguyen
    Tu Minh Phuong
    PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2018), 2018, : 122 - 129
  • [36] Compressed Heterogeneous Graph for Abstractive Multi-Document Summarization
    Li, Miao
    Qi, Jianzhong
    Lau, Jey Han
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13085 - 13093
  • [37] Unsupervised Document Summarization Using Clusters of Dependency Graph Nodes
    El-Kilany, Ayman
    Saleh, Iman
    2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 557 - 561
  • [38] MULTI-DOCUMENT SUMMARIZATION SYSTEMS COMPARISON
    Li, Lei
    Heng, Wei
    Liu, Ping'an
    2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1409 - 1413
  • [39] PSG: a two-layer graph model for document summarization
    Chen, Heng
    Jin, Hai
    Zhao, Feng
    FRONTIERS OF COMPUTER SCIENCE, 2014, 8 (01) : 119 - 130
  • [40] Parallel Relationship Graph to Improve Multi-Document Summarization
    Lu, Menghua
    Liang, Lijia
    Liu, Gongshen
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 630 - 642