An extractive text summarization approach using tagged-LDA based topic modeling

被引:0
|
作者
Ruby Rani
D. K. Lobiyal
机构
[1] Jawaharlal Nehru University,School of Computer & Systems Sciences
来源
关键词
Topic modeling; Hindi novel; Topic diversity; Retention ratio; Tagged-LDA;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic text summarization is an exertion of contriving the abridged form of a text document covering salient knowledge. Numerous statistical, linguistic, rule-based, and position-based text summarization approaches have been explored for different rich-resourced languages. For under-resourced languages such as Hindi, automatic text summarization is a challenging task and still an unsolved problem. Another issue with such languages is the unavailability of corpus and the inadequacy of the processing tools. In this paper, we proposed an extractive lexical knowledge-rich topic modeling text summarization approach for Hindi novels and stories in which we implemented four independent variants using different sentence weighting schemes. We prepared a corpus of Hindi Novels and stories since the absence of a corpus. We used a smoothing technique for edifying and variety summaries followed by evaluating the efficacy of generated summaries against three metrics (gist diversity, retention ratio, and ROUGE score). The results manifest that the proposed model produces abridge, articulate and coherent summaries. To investigate the performance of the proposed model, we simulate the experiments on the English dataset as well. Further, we compare our models with the baselines and traditional topic modeling approach, where we show that the proposed model has confessed optimal results.
引用
收藏
页码:3275 / 3305
页数:30
相关论文
共 50 条
  • [1] An extractive text summarization approach using tagged-LDA based topic modeling
    Rani, Ruby
    Lobiyal, D. K.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 3275 - 3305
  • [2] Extractive text summarization using clustering-based topic modeling
    Belwal, Ramesh Chandra
    Rai, Sawan
    Gupta, Atul
    [J]. SOFT COMPUTING, 2023, 27 (07) : 3965 - 3982
  • [3] Extractive text summarization using clustering-based topic modeling
    Ramesh Chandra Belwal
    Sawan Rai
    Atul Gupta
    [J]. Soft Computing, 2023, 27 : 3965 - 3982
  • [4] Topic Modeling Based Text Summarization Approach
    Yu, Shusi
    Wang, Wei
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING APPLICATIONS (CSEA 2015), 2015, : 203 - 207
  • [5] A new graph-based extractive text summarization using keywords or topic modeling
    Ramesh Chandra Belwal
    Sawan Rai
    Atul Gupta
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 8975 - 8990
  • [6] A new graph-based extractive text summarization using keywords or topic modeling
    Belwal, Ramesh Chandra
    Rai, Sawan
    Gupta, Atul
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (10) : 8975 - 8990
  • [7] A topic modeled unsupervised approach to single document extractive text summarization
    Srivastava, Ridam
    Singh, Prabhav
    Rana, K. P. S.
    Kumar, Vineet
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 246
  • [8] A Framework for Extractive Text Summarization using Semantic Graph Based Approach
    Ullah, Shofi
    Al Islam, A. B. M. Alim
    [J]. 2019 6TH INTERNATIONAL CONFERENCE ON NETWORKING, SYSTEMS AND SECURITY (NSYSS 2019), 2019, : 48 - 55
  • [9] Extractive text summarization using deep learning approach
    Yadav A.K.
    Singh A.
    Dhiman M.
    Vineet
    Kaundal R.
    Verma A.
    Yadav D.
    [J]. International Journal of Information Technology, 2022, 14 (5) : 2407 - 2415
  • [10] ANALYSING FUZZY BASED APPROACH FOR EXTRACTIVE TEXT SUMMARIZATION
    Sharaff, Aakanksha
    Khaire, Amit Siddharth
    Sharma, Dimple
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 906 - 910