An extractive text summarization approach using tagged-LDA based topic modeling

被引:0
|
作者
Ruby Rani
D. K. Lobiyal
机构
[1] Jawaharlal Nehru University,School of Computer & Systems Sciences
来源
关键词
Topic modeling; Hindi novel; Topic diversity; Retention ratio; Tagged-LDA;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic text summarization is an exertion of contriving the abridged form of a text document covering salient knowledge. Numerous statistical, linguistic, rule-based, and position-based text summarization approaches have been explored for different rich-resourced languages. For under-resourced languages such as Hindi, automatic text summarization is a challenging task and still an unsolved problem. Another issue with such languages is the unavailability of corpus and the inadequacy of the processing tools. In this paper, we proposed an extractive lexical knowledge-rich topic modeling text summarization approach for Hindi novels and stories in which we implemented four independent variants using different sentence weighting schemes. We prepared a corpus of Hindi Novels and stories since the absence of a corpus. We used a smoothing technique for edifying and variety summaries followed by evaluating the efficacy of generated summaries against three metrics (gist diversity, retention ratio, and ROUGE score). The results manifest that the proposed model produces abridge, articulate and coherent summaries. To investigate the performance of the proposed model, we simulate the experiments on the English dataset as well. Further, we compare our models with the baselines and traditional topic modeling approach, where we show that the proposed model has confessed optimal results.
引用
收藏
页码:3275 / 3305
页数:30
相关论文
共 50 条
  • [31] Extractive Text Summarization Using Topological Features
    Kumar, Ankit
    Sarkar, Apurba
    [J]. COMBINATORIAL IMAGE ANALYSIS, IWCIA 2022, 2023, 13348 : 105 - 121
  • [32] Extractive Text Summarization using Deep Learning
    Shirwandkar, Nikhil S.
    Kulkarni, Samidha
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [33] A Frequency-Driven Approach for Extractive Text Summarization
    Zadgaonkar, Ashwini, V
    [J]. INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 37 - 43
  • [34] Extractive Text Summarization Using Ontology and Graph-Based Method
    Yongkiatpanich, Chuleepohn
    Wichadakul, Duangdao
    [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 105 - 110
  • [35] DeepSumm: Exploiting topic models and sequence to sequence networks for extractive text summarization
    Joshi, Akanksha
    Fidalgo, Eduardo
    Alegre, Enrique
    Fernandez-Robles, Laura
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
  • [36] Automatic Extractive Text Summarization Based on Fuzzy Logic: A Sentence Oriented Approach
    Hannah, M. Esther
    Geetha, T. V.
    Mukherjee, Saswati
    [J]. SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT I, 2011, 7076 : 530 - +
  • [37] Random Indexing and Modified Random Indexing based approach for extractive text summarization
    Chatterjee, Niladri
    Sahoo, Pramod Kumar
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 29 (01): : 32 - 44
  • [38] A topic modeling based approach to novel document automatic summarization
    Wu, Zongda
    Lei, Li
    Li, Guiling
    Huang, Hui
    Zheng, Chengren
    Chen, Enhong
    Xu, Guandong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 84 : 12 - 23
  • [39] Hierarchical Summarization of Text Documents Using Topic Modeling and Formal Concept Analysis
    Akhtar, Nadeem
    Javed, Hira
    Ahmad, Tameem
    [J]. DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2018, VOL 2, 2019, 839 : 21 - 33
  • [40] A Statistical Language Modeling Framework for Extractive Summarization of Text Documents
    Gupta P.
    Nigam S.
    Singh R.
    [J]. SN Computer Science, 4 (6)