Unsupervised Abstractive Summarization of Bengali Text Documents

被引:0
|
作者
Chowdhury, Radia Rayan [1 ]
Nayeem, Mir Tafseer [1 ]
Mim, Tahsin Tasnim [1 ]
Chowdhury, Md Saifur Rahman [1 ]
Jannat, Taufiqul [1 ]
机构
[1] Ahsanullah Univ Sci & Tech, Dhaka, Bangladesh
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Abstractive summarization systems generally rely on large collections of documentsummary pairs. However, the performance of abstractive systems remains a challenge due to the unavailability of the parallel data for low-resource languages like Bengali. To overcome this problem, we propose a graph-based unsupervised abstractive summarization system in the single-document setting for Bengali text documents, which requires only a PartOf-Speech (POS) tagger and a pre-trained language model trained on Bengali texts. We also provide a human-annotated dataset with document-summary pairs to evaluate our abstractive model and to support the comparison of future abstractive summarization systems of the Bengali Language. We conduct experiments on this dataset and compare our system with several well-established unsupervised extractive summarization systems. Our unsupervised abstractive summarization model outperforms the baselines without being exposed to any human-annotated reference summaries.(1)
引用
收藏
页码:2612 / 2619
页数:8
相关论文
共 50 条
  • [1] Sentence Similarity Measurement for Bengali Abstractive Text Summarization
    Masum, Abu Kaisar Mohammad
    Abujar, Sheikh
    Tusher, Raja Tariqul Hasan
    Faisal, Fahad
    Hossain, Syed Akhter
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [2] Bengali abstractive text summarization using sequence to sequence RNNs
    Talukder, Md Ashraful Islam
    Abujar, Sheikh
    Masum, Abu Kaisar Mohammad
    Faisal, Fahad
    Hossain, Syed Akhter
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [3] Unsupervised Abstractive Text Summarization with Length Controlled Autoencoder
    Dugar, Abhinav
    Singh, Gaurav
    Navyasree, B.
    Kumar, Anand M.
    [J]. 2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [4] Domain-Aware Abstractive Text Summarization for Medical Documents
    Gigioli, Paul
    Sagar, Nikhita
    Voyles, Joseph
    Rao, Anand
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1155 - 1162
  • [5] Domain-Aware Abstractive Text Summarization for Medical Documents
    Gigioli, Paul
    Sagar, Nikhita
    Voyles, Joseph
    Rao, Anand
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2338 - 2343
  • [6] Review on Abstractive Text Summarization Techniques (ATST) for single and multi documents
    Modi, Shivangi
    Oza, Rachana
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), 2018, : 1173 - 1176
  • [7] An approach to Abstractive Text Summarization
    Huong Thanh Le
    Tien Manh Le
    [J]. 2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 371 - 376
  • [8] Abstractive text summarization for Hungarian
    Yang, Zijian Gyozo
    Agocs, Adam
    Kusper, Gabor
    Varadi, Tamas
    [J]. ANNALES MATHEMATICAE ET INFORMATICAE, 2021, 53 : 299 - 316
  • [9] A Survey on Abstractive Text Summarization
    Moratanch, N.
    Chitrakala, S.
    [J]. PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT 2016), 2016,
  • [10] Survey on Abstractive Text Summarization
    Raphal, Nithin
    Duwarah, Hemanta
    Daniel, Philemon
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 513 - 517