A hybrid machine learning model for multi-document summarization

被引:0
|
作者
Mohamed Abdel Fattah
机构
[1] KSA,Department of Computer Sciences, CCSE Taibah University
[2] FIE Helwan University,Department of Electronics Technology
来源
Applied Intelligence | 2014年 / 40卷
关键词
Multi-document automatic summarization; Maximum entropy; Naive-Bayes; Support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
This work proposes an approach that uses statistical tools to improve content selection in multi-document automatic text summarization. The method uses a trainable summarizer, which takes into account several features: the similarity of words among sentences, the similarity of words among paragraphs, the text format, cue-phrases, a score related to the frequency of terms in the whole document, the title, sentence location and the occurrence of non-essential information. The effect of each of these sentence features on the summarization task is investigated. These features are then used in combination to construct text summarizer models based on a maximum entropy model, a naive-Bayes classifier, and a support vector machine. To produce the final summary, the three models are combined into a hybrid model that ranks the sentences in order of importance. The performance of this new method has been tested using the DUC 2002 data corpus. The effectiveness of this technique is measured using the ROUGE score, and the results are promising when compared with some existing techniques.
引用
收藏
页码:592 / 600
页数:8
相关论文
共 50 条
  • [1] A hybrid machine learning model for multi-document summarization
    Fattah, Mohamed Abdel
    APPLIED INTELLIGENCE, 2014, 40 (04) : 592 - 600
  • [2] A Hybrid Topic Model for Multi-Document Summarization
    Xu, JinAn
    Liu, JiangMing
    Araki, Kenji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (05): : 1089 - 1094
  • [3] A Hybrid Hierarchical Model for Multi-Document Summarization
    Celikyilmaz, Asli
    Hakkani-Tur, Dilek
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 815 - 824
  • [4] A hybrid model for sentence ordering in extractive multi-document summarization
    Liu, Dexi
    Zhang, Zengchang
    He, Yanxiang
    Ji, Donghong
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 588 - 592
  • [5] BHLM: Bayesian theory-based hybrid learning model for multi-document summarization
    Suneetha, S.
    Reddy, A. Venugopal
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2018, 9 (02)
  • [6] Multi-document Summarization for E-Learning
    Wang, Fu Lee
    Kwan, Reggie
    Hung, Sheung Lun
    HYBRID LEARNING AND EDUCATION, PROCEEDINGS, 2009, 5685 : 353 - +
  • [7] Mixture of Topic Model for Multi-document Summarization
    Liu Na
    Li Ming-xia
    Lu Ying
    Tang Xiao-jun
    Wang Hai-wen
    Xiao Peng
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 5168 - 5172
  • [8] A Multi-Document Coverage Reward for RELAXed Multi-Document Summarization
    Parnell, Jacob
    Unanue, Inigo Jauregi
    Piccardi, Massimo
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5112 - 5128
  • [9] A Hybrid Solution To Abstractive Multi-Document Summarization Using Supervised and Unsupervised Learning
    Bhagchandani, Gaurav
    Bodra, Deep
    Gangan, Abhishek
    Mulla, Nikahat
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 566 - 570
  • [10] A hybrid sentence ordering strategy in multi-document summarization
    He, Yanxiang
    Liu, Dexi
    Yang, Hua
    Ji, Donghong
    Teng, Chong
    Qi, Wenqing
    WEB INFORMATION SYSTEMS - WISE 2006, PROCEEDINGS, 2006, 4255 : 339 - 349