SGATS: Semantic Graph-based Automatic Text Summarization from Hindi Text Documents

被引:2
|
作者
Joshi, Manju Lata [1 ,3 ]
Joshi, Nisheeth [1 ,3 ]
Mittal, Namita [2 ]
机构
[1] Banasthali Vidyapith, Banasthali, Niwai, India
[2] Malaviya Natl Inst Technol, Dept Comp Engn, Jaipur, Rajasthan, India
[3] Banasthali Univ, Dept Comp Sci, Niwai, Rajasthan, India
关键词
Semantic network; graphical measures; ROUGE correlation coefficient; RECOGNITION; ONTOLOGY; RANKING;
D O I
10.1145/3464381
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating a coherent summary of the text is a challenging task in the field of Natural Language Processing (NLP). Various Automatic Text Summarization techniques have been developed for abstractive as well as extractive summarization. This study focuses on extractive summarization which is a process containing selected delineative paragraphs or sentences from the original text and combining these into smaller forms than the document(s) to generate a summary. The methods that have been used for extractive summarization are based on a graph-theoretic approach, machine learning, Latent Semantic Analysis (LSA), neural networks, cluster, and fuzzy logic. In this paper, a semantic graph-based approach SGATS (Semantic Graph-based approach forAutomatic Text Summarization) is proposed to generate an extractive summary. The proposed approach constructs a semantic graph of the original Hindi text document by establishing a semantic relationship between sentences of the document using Hindi Wordnet ontology as a background knowledge source. Once the semantic graph is constructed, fourteen different graph theoretical measures are applied to rank the document sentences depending on their semantic scores. The proposed approach is applied to two data sets of different domains of Tourism and Health. The performance of the proposed approach is compared with the state-of-the-art TextRank algorithm and human-annotated summary. The performance of the proposed system is evaluated using widely accepted ROUGE measures. The outcomes exhibit that our proposed system produces better results than TextRank for health domain corpus and comparable results for tourism corpus. Further, correlation coefficient methods are applied to find a correlation between eight different graphical measures and it is observed that most of the graphical measures are highly correlated.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Semantic Graph Based Automatic Text Summarization for Hindi Documents Using Particle Swarm Optimization
    Dalal, Vipul
    Malik, Latesh
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 2, 2018, 84 : 284 - 289
  • [2] EdgeSumm: Graph-based framework for automatic text summarization
    El-Kassas, Wafaa S.
    Salama, Cherif R.
    Rafea, Ahmed A.
    Mohamed, Hoda K.
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [3] Graph Based Technique for Hindi Text Summarization
    Kumar, K. Vimal
    Yadav, Divakar
    Sharma, Arun
    [J]. INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, 2015, 339 : 301 - 310
  • [4] Graph-Based Suggestion For Text Summarization
    Hark, Cengiz
    Uckan, Taner
    Seyyarer, Ebubekir
    Karci, Ali
    [J]. 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [5] Improving the performance of semantic graph-based keyword extraction and text summarization using fuzzy relations in Hindi Wordnet
    Joshi, Manju Lata
    Mittal, Namita
    Joshi, Nisheeth
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (03) : 3771 - 3788
  • [6] Graph-based extractive text summarization method for Hausa text
    Bichi, Abdulkadir Abubakar
    Samsudin, Ruhaidah
    Hassan, Rohayanti
    Hasan, Layla Rasheed Abdallah
    Rogo, Abubakar Ado
    [J]. PLOS ONE, 2023, 18 (05):
  • [7] Text Summarization of Hindi Documents using Rule Based Approach
    Gupta, Manisha
    Garg, Naresh Kumar
    [J]. 2016 INTERNATIONAL CONFERENCE ON MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING (ICMETE), 2016, : 366 - 370
  • [8] Graph-based abstractive biomedical text summarization
    Givchi, Azadeh
    Ramezani, Reza
    Baraani-Dastjerdi, Ahmad
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 132
  • [9] New Graph-Based Text Summarization Method
    alZahir, Saif
    Fatima, Qandeel
    Cenek, Martin
    [J]. 2015 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2015, : 396 - 401
  • [10] Impact of Similarity Measures in Graph-based Automatic Text Summarization of Konkani Texts
    D'Silva, Jovi
    Sharma, Uzzal
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (02)