Comparison of Graph Based Document Summarization Method

被引:0
|
作者
Kaynar, Oguz [1 ]
Gormez, Yasin [1 ]
Isik, Yunus Emre [1 ]
Demirkoparan, Ferhan [1 ]
机构
[1] Cumuriyet Univ, Yonetim Bilisim Sistcmleri, Sivas, Turkey
关键词
document Summarization; lexRank; textRank; longest Common Subsequence;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Today, with the development of the internet, documents containing information such as articles, news, web pages are produced and stored in digital environment. However, the increase in the number of media where people are able to add new contents such as social media, Twitter, and blog has increased the amount of information on the internet to enormous size. However, it is very difficult and time-consuming to determine whether or not information under research is reached. Automated document summarization systems can reduce the size of the text while keeping the important part of the text and present quickly whether the text contains the desired information. In this study, graph based document summarization methods are discussed. Besides the LexRank method, TextRank algorithm is used with 4 different similarity methods. Unlike other studies, Longest Common Subsequence (LCS), a similarity measure method, is used as a measure of similarity between nodes in the TextRank algorithm. Among the similarity measurement methods used, the longest subset achieved the best success by taking 0,510 Rogue1 and 0,266 Rouge-2 scores in English dataset. Similarly, the same method yields 0,742 Rouge-1 and 0,676 Rouge-2 scores in Turkish data set, which are better than other methods.
引用
收藏
页码:598 / 603
页数:6
相关论文
共 50 条
  • [41] Using a multimedia semantic graph for web document visualization and summarization
    Antonio M. Rinaldi
    Cristiano Russo
    Multimedia Tools and Applications, 2021, 80 : 3885 - 3925
  • [42] Using a multimedia semantic graph for web document visualization and summarization
    Rinaldi, Antonio M.
    Russo, Cristiano
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 3885 - 3925
  • [43] A novel partitioning-based clustering method and generic document summarization
    Aliguliyev, Ramiz M.
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS PROCEEDINGS, 2006, : 626 - 629
  • [44] Graph-based Growing self-organizing map for Single Document Summarization (GGSDS)
    Alfarra, Mahmoud
    Alfarra, Abdalfattah M.
    Salahedden, Ahmed
    2019 IEEE 7TH PALESTINIAN INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (PICECE), 2019,
  • [45] Mutual-reinforcement document summarization using embedded graph based sentence clustering for storytelling
    Zhang, Zhengchen
    Ge, Shuzhi Sam
    He, Hongsheng
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (04) : 767 - 778
  • [46] Document Summarization Based on Semantic Representations
    Zhang, Hui
    Zhang, Xueliang
    Gao, Guanglai
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 152 - 155
  • [47] Document Summarization Based on Word Associations
    Gross, Oskar
    Doucet, Antoine
    Toivonen, Hannu
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1023 - 1026
  • [48] WordNet-based document summarization
    Dang, Chenghua
    Luo, Xinjun
    WSEAS: ADVANCES ON APPLIED COMPUTER AND APPLIED COMPUTATIONAL SCIENCE, 2008, : 383 - +
  • [49] Extractive Text Summarization Using Ontology and Graph-Based Method
    Yongkiatpanich, Chuleepohn
    Wichadakul, Duangdao
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 105 - 110
  • [50] Text Summarization Method Based on Gated Attention Graph Neural Network
    Huang, Jingui
    Wu, Wenya
    Li, Jingyi
    Wang, Shengchun
    SENSORS, 2023, 23 (03)