Automated Bengali Document Summarization By Collaborating Individual Word & Sentence Scoring

被引:0
|
作者
Chandro, Porimol [1 ]
Arif, Md Faizul Huq [1 ]
Rahman, Md Mahbubur [2 ]
Siddik, Md Saeed [2 ]
Rahman, Mohammad Sayeedur [2 ]
Rahman, Md Abdur [3 ]
机构
[1] WUB, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] IIT, Dhaka, Bangladesh
[3] Univ Dhaka, CARS, Dhaka, Bangladesh
关键词
Bengali Document Summarization; Text Extraction; Information Retrieval; Word Tokenization; Word Stemming; Sentence Scoring; Sentence Ranking;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bengali documents are increasing on the World Wide Web and it is becoming a overwhelming problem for the increasing large number of web users to reviewing and reduce the information. Many researches have been conducted in the field of Natural Language Processing for English documents and in order to serve with satisfactory accuracy. This research work proposed a simple and powerful extraction based method for summarizing of the Bengali text documents. The system could summarize a single document at a time. The ultimate objective of the proposed methodology helps readers to get summary and insight of the Bengali documents without reading revealing the in-depth details. In the proposed Bengali documents summary generation method there are four features: Preprocessing, Sentence Ranking and Summarization, Combining Parameters for Sentence Ranking, Summary Generator. The results of performance evaluation show that the average scores of Precision, Recall and final scores are 0.80, 0.67, and 0.72 respectively.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A hybrid sentence ordering strategy in multi-document summarization
    He, Yanxiang
    Liu, Dexi
    Yang, Hua
    Ji, Donghong
    Teng, Chong
    Qi, Wenqing
    WEB INFORMATION SYSTEMS - WISE 2006, PROCEEDINGS, 2006, 4255 : 339 - 349
  • [42] Categorized Text Document Summarization in the Kannada Language by Sentence Ranking
    Jayashree, R.
    Murthy, Srikanta K.
    Anami, Basavaraj S.
    2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 776 - 781
  • [43] Sentence Reduction Algorithms to Improve Multi-document Summarization
    Silveira, Sara Botelho
    Branco, Antonio
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2013, 2014, 449 : 261 - 276
  • [44] An adjacency model for sentence ordering in multi-document summarization
    Nie, Yu
    Ji, Donghong
    Yang, Lingpeng
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 313 - 322
  • [45] Association among reading summarization, word recognition, and sentence comprehension
    Lee, CH
    PERCEPTUAL AND MOTOR SKILLS, 2003, 96 (03) : 1133 - 1138
  • [46] Integrating Semantic Scenario and Word Relations for Abstractive Sentence Summarization
    Guan, Yong
    Guo, Shaoru
    Li, Ru
    Li, Xiaoli
    Zhang, Hu
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2522 - 2529
  • [47] Induction of word and phrase alignments for automatic document summarization
    Daume, H
    Marcu, D
    COMPUTATIONAL LINGUISTICS, 2005, 31 (04) : 505 - 530
  • [48] A Scoring Model Assisted by Frequency for Multi-Document Summarization
    Yu, Yue
    Wu, Mutong
    Su, Weifeng
    Cheung, Yiu-ming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 309 - 320
  • [49] Multi-label Sentence Classification Using Bengali Word Embedding Model
    Hasan, Md. Nowshad
    Bhowmik, Sourav
    Rahaman, Md. Mahfuzur
    2017 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT 2017), 2017,
  • [50] Cohesion-based Sentence Ordering for Multi-document Summarization
    Jiang, Xiaoyu
    2016 INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING AND COMMUNICATIONS TECHNOLOGY (IECT 2016), 2016, : 78 - 83