An Extractive Malayalam Document Summarization Based on Graph Theoretic Approach

被引:2
|
作者
Ajmal, E. B. [1 ]
Haroon, Rosna P. [1 ]
机构
[1] Ilahia Coll Engn & Technol, Dept CSE, Muvattupuzha, India
关键词
Undirected Graph; Node; Edge; Common word; Threshold; Sub graph;
D O I
10.1109/ECONF.2015.41
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Text summarization is a way to condense the large amount of information into a concise form by the process of selection of important information and discarding unimportant and redundant information. The need for Text summarization has increased much due to the abundance of documents in the internet. Even though a lot of text summarization systems have been developed for summarizing documents in various languages, there is no such well performing system for Malayalam. In this paper, we propose the use of Graph theoretic approach for summarizing Malayalam documents that is motivated by the method of identification of themes. After the common preprocessing steps, namely, stop word removal and stemming, sentences in the documents are represented as nodes in an undirected graph. There is a node for every sentence. Two sentences are connected with an edge if the two sentences share some common words, or in other words, their (cosine, or such) similarity is above some threshold. This representation yields two results: The partitions contained in the graph (that is those sub-graphs that are unconnected to the other sub graphs), form distinct topics covered in the documents. The second result yielded by the graph-theoretic method is the identification of the important sentences in the document. We apply graph theoretic approach on Malayalam text summarization task and achieve comparable results to the state of the art.
引用
收藏
页码:237 / 240
页数:4
相关论文
共 50 条
  • [1] Malayalam Text Summarization: An Extractive Approach
    Krishnaprasad, P.
    Sooryanarayanan, A.
    Ramanujan, Ajeesh
    [J]. 2016 INTERNATIONAL CONFERENCE ON NEXT GENERATION INTELLIGENT SYSTEMS (ICNGIS), 2016, : 40 - 43
  • [2] Graph-based extractive text summarization based on single document
    Avaneesh Kumar Yadav
    Rama Shankar Ranvijay
    Ashish Kumar Yadav
    [J]. Multimedia Tools and Applications, 2024, 83 : 18987 - 19013
  • [3] Graph-based extractive text summarization based on single document
    Yadav, Avaneesh Kumar
    Ranvijay, Rama Shankar
    Yadav, Rama Shankar
    Maurya, Ashish Kumar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 18987 - 19013
  • [4] Attention based Abstractive Summarization of Malayalam Document
    Nambiar, Sindhya K.
    Peter, David S.
    Idicula, Sumam Mary
    [J]. AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 250 - 257
  • [5] Extractive Arabic Text Summarization-Graph-Based Approach
    AL-Khassawneh, Yazan Alaya
    Hanandeh, Essam Said
    [J]. ELECTRONICS, 2023, 12 (02)
  • [6] Malayalam Text Summarization: Minimum Spanning Tree Based Graph Reduction Approach
    Raj, Rahul M.
    Haroon, Rosna P.
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION, & AUTOMATION (ICACCA) (FALL), 2016, : 246 - +
  • [7] Grapharizer: A Graph-Based Technique for Extractive Multi-Document Summarization
    Jalil, Zakia
    Nasir, Muhammad
    Alazab, Moutaz
    Nasir, Jamal
    Amjad, Tehmina
    Alqammaz, Abdullah
    [J]. ELECTRONICS, 2023, 12 (08)
  • [8] Extractive multi-document text summarization based on graph independent sets
    Uckan, Taner
    Karci, Ali
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2020, 21 (03) : 145 - 157
  • [9] Enhanced Graph Based Approach for Multi Document Summarization
    Hariharan, Shanmugasundaram
    Ramkumar, Thirunavukarasu
    Srinivasan, Rengaramanujam
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2013, 10 (04) : 334 - 341
  • [10] Multi-document extractive summarization using semantic graph
    del Camino Valle, Oleyda
    Simon-Cuevas, Alfredo
    Valladares-Valdes, Eduardo
    Olivas, Jose A.
    Romero, Francisco P.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 103 - 110