Automated Bengali Document Summarization By Collaborating Individual Word & Sentence Scoring

被引:0
|
作者
Chandro, Porimol [1 ]
Arif, Md Faizul Huq [1 ]
Rahman, Md Mahbubur [2 ]
Siddik, Md Saeed [2 ]
Rahman, Mohammad Sayeedur [2 ]
Rahman, Md Abdur [3 ]
机构
[1] WUB, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] IIT, Dhaka, Bangladesh
[3] Univ Dhaka, CARS, Dhaka, Bangladesh
关键词
Bengali Document Summarization; Text Extraction; Information Retrieval; Word Tokenization; Word Stemming; Sentence Scoring; Sentence Ranking;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bengali documents are increasing on the World Wide Web and it is becoming a overwhelming problem for the increasing large number of web users to reviewing and reduce the information. Many researches have been conducted in the field of Natural Language Processing for English documents and in order to serve with satisfactory accuracy. This research work proposed a simple and powerful extraction based method for summarizing of the Bengali text documents. The system could summarize a single document at a time. The ultimate objective of the proposed methodology helps readers to get summary and insight of the Bengali documents without reading revealing the in-depth details. In the proposed Bengali documents summary generation method there are four features: Preprocessing, Sentence Ranking and Summarization, Combining Parameters for Sentence Ranking, Summary Generator. The results of performance evaluation show that the average scores of Precision, Recall and final scores are 0.80, 0.67, and 0.72 respectively.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Assessing sentence scoring techniques for extractive text summarization
    Ferreira, Rafael
    Cabral, Luciano de Souza
    Lins, Rafael Dueire
    Pereira e Silva, Gabriel
    Freitas, Fred
    Cavalcanti, George D. C.
    Lima, Rinaldo
    Simske, Steven J.
    Favaro, Luciano
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (14) : 5755 - 5764
  • [22] Automated Text Summarization: Sentence Refinement Approach
    Jusoh, Shaidah
    Masoud, Abdulsalam M.
    Alfawareh, Hejab M.
    DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS, PT 2, 2011, 189 : 207 - +
  • [23] Jointly Learning Topics in Sentence Embedding for Document Summarization
    Gao, Yang
    Xu, Yue
    Huang, Heyan
    Liu, Qian
    Wei, Linjing
    Liu, Luyang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (04) : 688 - 699
  • [24] MRS for multi-document summarization by sentence extraction
    Xu, Yong-Dong
    Zhang, Xiao-Dong
    Quan, Guang-Ri
    Wang, Ya-Dong
    TELECOMMUNICATION SYSTEMS, 2013, 53 (01) : 91 - 98
  • [25] Comparative Document Summarization via Discriminative Sentence Selection
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Gong, Yihong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (03)
  • [26] MRS for multi-document summarization by sentence extraction
    Yong-Dong Xu
    Xiao-Dong Zhang
    Guang-Ri Quan
    Ya-Dong Wang
    Telecommunication Systems, 2013, 53 : 91 - 98
  • [27] Comparative Document Summarization via Discriminative Sentence Selection
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Gong, Yihong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2013, 7 (01)
  • [28] Multi-document Summarization Based on Sentence Clustering
    Zheng, Hai-Tao
    Gong, Shu-Qin
    Chen, Hao
    Jiang, Yong
    Xia, Shu-Tao
    NEURAL INFORMATION PROCESSING (ICONIP 2014), PT II, 2014, 8835 : 429 - 436
  • [29] Multi-Document Summarization Using Sentence Clustering
    Gupta, Virendra Kumar
    Siddiqui, Tanveer J.
    4TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2012), 2012,
  • [30] Extractive summarization based on word information and sentence position
    Cruz, CM
    Urrea, AM
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 653 - 656