Comparison of Graph Based Document Summarization Method

被引:0
|
作者
Kaynar, Oguz [1 ]
Gormez, Yasin [1 ]
Isik, Yunus Emre [1 ]
Demirkoparan, Ferhan [1 ]
机构
[1] Cumuriyet Univ, Yonetim Bilisim Sistcmleri, Sivas, Turkey
关键词
document Summarization; lexRank; textRank; longest Common Subsequence;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Today, with the development of the internet, documents containing information such as articles, news, web pages are produced and stored in digital environment. However, the increase in the number of media where people are able to add new contents such as social media, Twitter, and blog has increased the amount of information on the internet to enormous size. However, it is very difficult and time-consuming to determine whether or not information under research is reached. Automated document summarization systems can reduce the size of the text while keeping the important part of the text and present quickly whether the text contains the desired information. In this study, graph based document summarization methods are discussed. Besides the LexRank method, TextRank algorithm is used with 4 different similarity methods. Unlike other studies, Longest Common Subsequence (LCS), a similarity measure method, is used as a measure of similarity between nodes in the TextRank algorithm. Among the similarity measurement methods used, the longest subset achieved the best success by taking 0,510 Rogue1 and 0,266 Rouge-2 scores in English dataset. Similarly, the same method yields 0,742 Rouge-1 and 0,676 Rouge-2 scores in Turkish data set, which are better than other methods.
引用
收藏
页码:598 / 603
页数:6
相关论文
共 50 条
  • [21] Grapharizer: A Graph-Based Technique for Extractive Multi-Document Summarization
    Jalil, Zakia
    Nasir, Muhammad
    Alazab, Moutaz
    Nasir, Jamal
    Amjad, Tehmina
    Alqammaz, Abdullah
    ELECTRONICS, 2023, 12 (08)
  • [22] Extractive multi-document text summarization based on graph independent sets
    Uckan, Taner
    Karci, Ali
    EGYPTIAN INFORMATICS JOURNAL, 2020, 21 (03) : 145 - 157
  • [23] Summarizing learning materials using graph based multi-document summarization
    Krishnaveni P.
    Balasundaram S.R.
    International Journal of Web-Based Learning and Teaching Technologies, 2021, 16 (05) : 39 - 57
  • [24] A Proposed Textual Graph Based Model for Arabic Multi-document Summarization
    Alwan, Muneer A.
    Onsi, Hoda M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (06) : 435 - 439
  • [25] Enhancing Multi-Document Summarization with Cross-Document Graph-based Information Extraction
    Zhang, Zixuan
    Elfardy, Heba
    Dreyer, Markus
    Small, Kevin
    Ji, Heng
    Bansal, Mohit
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1696 - 1707
  • [26] Comparison of Multi Document Summarization Techniques
    Nedunchelian, R.
    Muthucumarasamy, R.
    Saranathan, E.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (03): : 155 - 160
  • [27] Automatic Text Document Summarization Using Graph Based Centrality Measures on Lexical Network
    Yadav, Chandra Shakhar
    Sharan, Aditi
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2018, 8 (03) : 14 - 32
  • [28] Graph-Based Query-Focused Multi-document Summarization Using Improved Affinity Graph
    Hu, Po
    He, Jiacong
    Zhang, Yong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2015, 2015, 9403 : 336 - 347
  • [29] Incorporating External Knowledge into Unsupervised Graph Model for Document Summarization
    Tang, Tiancheng
    Yuan, Tianyi
    Tang, Xinhuai
    Chen, Delai
    ELECTRONICS, 2020, 9 (09) : 1 - 13
  • [30] HHGraphSum: Hierarchical heterogeneous graph learning for extractive document summarization
    Hao, Pengyi
    Wu, Cunqi
    Bai, Cong
    DISPLAYS, 2025, 86