A hybrid machine learning model for multi-document summarization

被引:0
|
作者
Mohamed Abdel Fattah
机构
[1] KSA,Department of Computer Sciences, CCSE Taibah University
[2] FIE Helwan University,Department of Electronics Technology
来源
Applied Intelligence | 2014年 / 40卷
关键词
Multi-document automatic summarization; Maximum entropy; Naive-Bayes; Support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
This work proposes an approach that uses statistical tools to improve content selection in multi-document automatic text summarization. The method uses a trainable summarizer, which takes into account several features: the similarity of words among sentences, the similarity of words among paragraphs, the text format, cue-phrases, a score related to the frequency of terms in the whole document, the title, sentence location and the occurrence of non-essential information. The effect of each of these sentence features on the summarization task is investigated. These features are then used in combination to construct text summarizer models based on a maximum entropy model, a naive-Bayes classifier, and a support vector machine. To produce the final summary, the three models are combined into a hybrid model that ranks the sentences in order of importance. The performance of this new method has been tested using the DUC 2002 data corpus. The effectiveness of this technique is measured using the ROUGE score, and the results are promising when compared with some existing techniques.
引用
收藏
页码:592 / 600
页数:8
相关论文
共 50 条
  • [21] Learning to Estimate the Importance of Sentences for Multi-Document Summarization
    Minh-Tien Nguyen
    Thi-Hai-Nang Nguyen
    Hoang-Diep Nguyen
    Van-Hau Nguyen
    PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2018, : 31 - 36
  • [22] Automated Multi-Document Biomedical Text Summarization Using Deep Learning Model
    Almasoud, Ahmed S.
    Hassine, Siwar Ben Haj
    Al-Wesabi, Fahd N.
    Nour, Mohamed K.
    Hilal, Anwer Mustafa
    Al Duhayyim, Mesfer
    Hamza, Manar Ahmed
    Motwakel, Abdelwahed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5799 - 5815
  • [23] A Scoring Model Assisted by Frequency for Multi-Document Summarization
    Yu, Yue
    Wu, Mutong
    Su, Weifeng
    Cheung, Yiu-ming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 309 - 320
  • [24] A novel contextual topic model for multi-document summarization
    Yang, Guangbing
    Wen, Dunwei
    Kinshuk
    Chen, Nian-Shing
    Sutinen, Erkki
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (03) : 1340 - 1352
  • [25] Nutri-bullets Hybrid: Consensual Multi-document Summarization
    Shah, Darsh J.
    Yu, Lili
    Lei, Tao
    Barzilay, Regina
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5213 - 5222
  • [26] An adjacency model for sentence ordering in multi-document summarization
    Nie, Yu
    Ji, Donghong
    Yang, Lingpeng
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 313 - 322
  • [27] A NEW MODEL FOR ARABIC MULTI-DOCUMENT TEXT SUMMARIZATION
    Abu Maria, Khulood
    Jaber, Khalid Mohammad
    Ibrahim, Mossab Nabil
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2018, 14 (04): : 1443 - 1452
  • [28] Multi-document Summarization via Deep Learning Techniques: A Survey
    Ma, Congbo
    Zhang, Wei Emma
    Guo, Mingyu
    Wang, Hu
    Sheng, Quan Z.
    ACM COMPUTING SURVEYS, 2023, 55 (05)
  • [29] Abstractive Multi-document Summarization Using Deep Learning Approaches
    Poornima, Murkute
    Pulipati, Venkateswara Rao
    Kumar, T. Sunil
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 57 - 68
  • [30] A preference learning approach to sentence ordering for multi-document summarization
    Bollegala, Danushka
    Okazaki, Naoaki
    Ishizuka, Mitsuru
    INFORMATION SCIENCES, 2012, 217 : 78 - 95