Comparison of Graph Based Document Summarization Method

被引:0
|
作者
Kaynar, Oguz [1 ]
Gormez, Yasin [1 ]
Isik, Yunus Emre [1 ]
Demirkoparan, Ferhan [1 ]
机构
[1] Cumuriyet Univ, Yonetim Bilisim Sistcmleri, Sivas, Turkey
关键词
document Summarization; lexRank; textRank; longest Common Subsequence;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Today, with the development of the internet, documents containing information such as articles, news, web pages are produced and stored in digital environment. However, the increase in the number of media where people are able to add new contents such as social media, Twitter, and blog has increased the amount of information on the internet to enormous size. However, it is very difficult and time-consuming to determine whether or not information under research is reached. Automated document summarization systems can reduce the size of the text while keeping the important part of the text and present quickly whether the text contains the desired information. In this study, graph based document summarization methods are discussed. Besides the LexRank method, TextRank algorithm is used with 4 different similarity methods. Unlike other studies, Longest Common Subsequence (LCS), a similarity measure method, is used as a measure of similarity between nodes in the TextRank algorithm. Among the similarity measurement methods used, the longest subset achieved the best success by taking 0,510 Rogue1 and 0,266 Rouge-2 scores in English dataset. Similarly, the same method yields 0,742 Rouge-1 and 0,676 Rouge-2 scores in Turkish data set, which are better than other methods.
引用
收藏
页码:598 / 603
页数:6
相关论文
共 50 条
  • [1] Tamil Document Summarization using semantic graph method
    Banu, M.
    Karthika, C.
    Sudarmani, P.
    Geetha, T., V
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 128 - 134
  • [2] Enhanced Graph Based Approach for Multi Document Summarization
    Hariharan, Shanmugasundaram
    Ramkumar, Thirunavukarasu
    Srinivasan, Rengaramanujam
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2013, 10 (04) : 334 - 341
  • [3] Comparison of Document Index Graph Using TextRank and HITS Weighting Method in Automatic Text Summarization
    Hadyan, Fadhlil
    Shaufiah
    Bijaksana, Moch. Arif
    1ST INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2016 : APPLIED INFORMATICS TOWARD SMART ENVIRONMENT, PEOPLE, AND SOCIETY, 2017, 801
  • [4] Graph-based extractive text summarization based on single document
    Avaneesh Kumar Yadav
    Rama Shankar Ranvijay
    Ashish Kumar Yadav
    Multimedia Tools and Applications, 2024, 83 : 18987 - 19013
  • [5] Extractive Document Summarization Model Based on Heterogeneous Graph and Keywords
    Zhu Q.
    Wang Y.
    Xu J.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2024, 53 (02): : 259 - 270
  • [6] Graph Based Automatic Document Summarization with Different Similarity Methods
    Kaynar, Oguz
    Isik, Yunus Emre
    Gormez, Yasin
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [7] A Graph Based Query Focused Multi-Document Summarization
    Balaji, J.
    Geetha, T.
    Parthasarathi, Ranjani
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2014, 10 (01) : 16 - 41
  • [8] Graph-based extractive text summarization based on single document
    Yadav, Avaneesh Kumar
    Ranvijay, Rama Shankar
    Yadav, Rama Shankar
    Maurya, Ashish Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 18987 - 19013
  • [9] Learning Similarity Functions in Graph-Based Document Summarization
    Ouyang, You
    Li, Wenjie
    Wei, Furu
    Lu, Qin
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 189 - 200
  • [10] An Extractive Malayalam Document Summarization Based on Graph Theoretic Approach
    Ajmal, E. B.
    Haroon, Rosna P.
    PROCEEDINGS 2015 FIFTH INTERNATIONAL CONFERENCE ON E-LEARNING (ECONF 2015), 2015, : 237 - 240