A hybrid machine learning model for multi-document summarization

被引:0
|
作者
Mohamed Abdel Fattah
机构
[1] KSA,Department of Computer Sciences, CCSE Taibah University
[2] FIE Helwan University,Department of Electronics Technology
来源
Applied Intelligence | 2014年 / 40卷
关键词
Multi-document automatic summarization; Maximum entropy; Naive-Bayes; Support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
This work proposes an approach that uses statistical tools to improve content selection in multi-document automatic text summarization. The method uses a trainable summarizer, which takes into account several features: the similarity of words among sentences, the similarity of words among paragraphs, the text format, cue-phrases, a score related to the frequency of terms in the whole document, the title, sentence location and the occurrence of non-essential information. The effect of each of these sentence features on the summarization task is investigated. These features are then used in combination to construct text summarizer models based on a maximum entropy model, a naive-Bayes classifier, and a support vector machine. To produce the final summary, the three models are combined into a hybrid model that ranks the sentences in order of importance. The performance of this new method has been tested using the DUC 2002 data corpus. The effectiveness of this technique is measured using the ROUGE score, and the results are promising when compared with some existing techniques.
引用
收藏
页码:592 / 600
页数:8
相关论文
共 50 条
  • [41] Aspect Based Multi-Document Summarization
    Sahoo, Deepak
    Balabantaray, Rakesh
    Phukon, Mridumoni
    Saikia, Saibali
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 873 - 877
  • [42] Hierarchical Summarization: Scaling Up Multi-Document Summarization
    Christensen, Janara
    Soderland, Stephen
    Bansal, Gagan
    Mausam
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 902 - 912
  • [43] A novel approach to multi-document summarization
    Qiu, Li-Qing
    Pang, Bin
    Lin, Sai-Qun
    Chen, Peng
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 187 - +
  • [44] Hierarchical Transformers for Multi-Document Summarization
    Liu, Yang
    Lapata, Mirella
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5070 - 5081
  • [45] Query-oriented unsupervised multi-document summarization via deep learning model
    Zhong, Sheng-hua
    Liu, Yan
    Li, Bin
    Long, Jing
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 8146 - 8155
  • [46] Multi-document summarization using a clustering-based hybrid strategy
    Nie, Yu
    Ji, Donghong
    Yang, Lingpeng
    Niu, Zhengyu
    He, Tingting
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 608 - 614
  • [47] Tiered sentence based topic model for multi-document summarization
    Akhtar, Nadeem
    Beg, M. M. Sufyan
    Javed, Hira
    Hussain, Md Muzakkir
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (08): : 2131 - 2141
  • [48] An Improved LDA Multi-Document Summarization Model Based on TensorFlow
    Zhong, Ying
    Tang, Zhuo
    Ding, Xiaofei
    Zhu, Li
    Le, Yuquan
    Li, Kenli
    Li, Keqin
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 255 - 259
  • [49] A Fuzzy-Rough Hybrid Approach to Multi-document Extractive Summarization
    Huang, Hsun-Hui
    Yang, Horng-Chang
    Kuo, Yau-Hwang
    HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 1, PROCEEDINGS, 2009, : 168 - +
  • [50] Research On Multi-document Summarization Based On LDA Topic Model
    Bian, Jinqiang
    Jiang, Zengru
    Chen, Qian
    2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 2, 2014, : 113 - 116