Automated Bengali Document Summarization By Collaborating Individual Word & Sentence Scoring

被引:0
|
作者
Chandro, Porimol [1 ]
Arif, Md Faizul Huq [1 ]
Rahman, Md Mahbubur [2 ]
Siddik, Md Saeed [2 ]
Rahman, Mohammad Sayeedur [2 ]
Rahman, Md Abdur [3 ]
机构
[1] WUB, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] IIT, Dhaka, Bangladesh
[3] Univ Dhaka, CARS, Dhaka, Bangladesh
关键词
Bengali Document Summarization; Text Extraction; Information Retrieval; Word Tokenization; Word Stemming; Sentence Scoring; Sentence Ranking;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bengali documents are increasing on the World Wide Web and it is becoming a overwhelming problem for the increasing large number of web users to reviewing and reduce the information. Many researches have been conducted in the field of Natural Language Processing for English documents and in order to serve with satisfactory accuracy. This research work proposed a simple and powerful extraction based method for summarizing of the Bengali text documents. The system could summarize a single document at a time. The ultimate objective of the proposed methodology helps readers to get summary and insight of the Bengali documents without reading revealing the in-depth details. In the proposed Bengali documents summary generation method there are four features: Preprocessing, Sentence Ranking and Summarization, Combining Parameters for Sentence Ranking, Summary Generator. The results of performance evaluation show that the average scores of Precision, Recall and final scores are 0.80, 0.67, and 0.72 respectively.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Bengali Document Clustering using Word Movers Distance
    Ahmad, Adnan
    Amin, Md. Ruhul
    Chowdhury, Farida
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [32] Text document summarization using word embedding
    Mohd, Mudasir
    Jan, Rafiya
    Shah, Muzaffar
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 143 (143)
  • [33] Improvements in Multi-Document Abstractive Summarization using Multi Sentence Compression with Word Graph and Node Alignment
    Agarwal, Raksha
    Chatterjee, Niladri
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 190
  • [34] Leveraging Word Embeddings for Spoken Document Summarization
    Chen, Kuan-Yu
    Liu, Shih-Hung
    Wang, Hsin-Min
    Chen, Berlin
    Chen, Hsin-Hsi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1383 - 1387
  • [35] An Approach for Bengali Text Summarization using Word2Vector
    Abujar, Sheikh
    Masum, Abu Kaisar Mohammad
    Mohibullah, Md
    Ohidujjaman
    Hossain, Syed Akhter
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [36] Single document text summarization technique using optimal combination of cuckoo search algorithm, sentence scoring and sentiment score
    Mandal S.
    Singh G.K.
    Pal A.
    International Journal of Information Technology, 2021, 13 (5) : 1805 - 1813
  • [37] Document Summarization using a Scoring-Based Representation
    Villa Monte, Augusto
    Lanzarini, Laura
    Rojas Flores, Luis
    Olivas Varela, Jose A.
    PROCEEDINGS OF THE 2016 XLII LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2016,
  • [38] A topic Approach to Sentence Ordering for Multi-document Summarization
    Na, Liu
    Peng, Xiao
    Ying, Lu
    Tang Xiao-jun
    Wang Hai-wen
    Li Ming-xia
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1390 - 1395
  • [39] Joint Hierarchical Semantic Clipping and Sentence Extraction for Document Summarization
    Yan, Wanying
    Guo, Junjun
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2020, 16 (04): : 820 - 831
  • [40] Multi-document Text Summarization Using Sentence Extraction
    Ahuja, Ravinder
    Anand, Willson
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2016, 2017, 517 : 235 - 242