Automated Bengali Document Summarization By Collaborating Individual Word & Sentence Scoring

被引:0
|
作者
Chandro, Porimol [1 ]
Arif, Md Faizul Huq [1 ]
Rahman, Md Mahbubur [2 ]
Siddik, Md Saeed [2 ]
Rahman, Mohammad Sayeedur [2 ]
Rahman, Md Abdur [3 ]
机构
[1] WUB, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] IIT, Dhaka, Bangladesh
[3] Univ Dhaka, CARS, Dhaka, Bangladesh
关键词
Bengali Document Summarization; Text Extraction; Information Retrieval; Word Tokenization; Word Stemming; Sentence Scoring; Sentence Ranking;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bengali documents are increasing on the World Wide Web and it is becoming a overwhelming problem for the increasing large number of web users to reviewing and reduce the information. Many researches have been conducted in the field of Natural Language Processing for English documents and in order to serve with satisfactory accuracy. This research work proposed a simple and powerful extraction based method for summarizing of the Bengali text documents. The system could summarize a single document at a time. The ultimate objective of the proposed methodology helps readers to get summary and insight of the Bengali documents without reading revealing the in-depth details. In the proposed Bengali documents summary generation method there are four features: Preprocessing, Sentence Ranking and Summarization, Combining Parameters for Sentence Ranking, Summary Generator. The results of performance evaluation show that the average scores of Precision, Recall and final scores are 0.80, 0.67, and 0.72 respectively.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A fusion of variants of sentence scoring methods and collaborative word rankings for document summarization
    Verma, Pradeepika
    Verma, Anshul
    Pal, Sukomal
    EXPERT SYSTEMS, 2022, 39 (06)
  • [2] Single document summarization using word and sentence embeddings
    Ayana
    PROCEEDINGS OF THE 2015 JOINT INTERNATIONAL MECHANICAL, ELECTRONIC AND INFORMATION TECHNOLOGY CONFERENCE (JIMET 2015), 2015, 10 : 523 - 526
  • [3] Automated Bangla Text Summarization by Sentence Scoring and Ranking
    Efat, Md. Iftekharul Alam
    Ibrahim, Mohammad
    Kayesh, Humayun
    2013 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2013,
  • [4] Subtopic-focused sentence scoring in multi-document summarization
    Li Sujian
    Qu Weiguang
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 98 - +
  • [5] A Joint Sentence Scoring and Selection Framework for Neural Extractive Document Summarization
    Zhou, Qingyu
    Yang, Nan
    Wei, Furu
    Huang, Shaohan
    Zhou, Ming
    Zhao, Tiejun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 671 - 681
  • [6] Document Summarization Using Sentence-Level Semantic Based on Word Embeddings
    Al-Sabahi, Kamal
    Zhang Zuping
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2019, 29 (02) : 177 - 196
  • [7] Sentence Similarity Measurement for Bengali Abstractive Text Summarization
    Masum, Abu Kaisar Mohammad
    Abujar, Sheikh
    Tusher, Raja Tariqul Hasan
    Faisal, Fahad
    Hossain, Syed Akhter
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [8] Assessing shallow sentence scoring techniques and combinations for single and multi-document summarization
    Oliveira, Hilario
    Ferreira, Rafael
    Lima, Rinaldo
    Lins, Rafael Dueire
    Freitas, Fred
    Riss, Marcelo
    Simske, Steven J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 65 : 68 - 86
  • [9] DOCUMENT SUMMARIZATION IN MALAYALAM WITH SENTENCE FRAMING
    Kishore, Kavya
    Gopal, Greeshma N.
    Neethu, P. H.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE (ICIS), 2016, : 194 - 200
  • [10] Document Summarization Using Sentence Features
    Rautray, Rasmita
    Balabantaray, Rakesh Chandra
    Bhardwaj, Anisha
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2015, 5 (01) : 36 - 47