A hybrid machine learning model for multi-document summarization

被引：0

作者：

Mohamed Abdel Fattah

机构：

[1] KSA,Department of Computer Sciences, CCSE Taibah University

[2] FIE Helwan University,Department of Electronics Technology

来源：

Applied Intelligence | 2014年 / 40卷

关键词：

Multi-document automatic summarization; Maximum entropy; Naive-Bayes; Support vector machine;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This work proposes an approach that uses statistical tools to improve content selection in multi-document automatic text summarization. The method uses a trainable summarizer, which takes into account several features: the similarity of words among sentences, the similarity of words among paragraphs, the text format, cue-phrases, a score related to the frequency of terms in the whole document, the title, sentence location and the occurrence of non-essential information. The effect of each of these sentence features on the summarization task is investigated. These features are then used in combination to construct text summarizer models based on a maximum entropy model, a naive-Bayes classifier, and a support vector machine. To produce the final summary, the three models are combined into a hybrid model that ranks the sentences in order of importance. The performance of this new method has been tested using the DUC 2002 data corpus. The effectiveness of this technique is measured using the ROUGE score, and the results are promising when compared with some existing techniques.

引用

页码：592 / 600

页数：8

共 50 条

[41] Aspect Based Multi-Document Summarization
Sahoo, Deepak
Balabantaray, Rakesh
Phukon, Mridumoni
Saikia, Saibali
2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 873 - 877
[42] Hierarchical Summarization: Scaling Up Multi-Document Summarization
Christensen, Janara
Soderland, Stephen
Bansal, Gagan
Mausam
PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 902 - 912
[43] A novel approach to multi-document summarization
Qiu, Li-Qing
Pang, Bin
Lin, Sai-Qun
Chen, Peng
DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 187 - +
[44] Hierarchical Transformers for Multi-Document Summarization
Liu, Yang
Lapata, Mirella
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5070 - 5081
[45] Query-oriented unsupervised multi-document summarization via deep learning model
Zhong, Sheng-hua
Liu, Yan
Li, Bin
Long, Jing
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 8146 - 8155
[46] Multi-document summarization using a clustering-based hybrid strategy
Nie, Yu
Ji, Donghong
Yang, Lingpeng
Niu, Zhengyu
He, Tingting
INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 608 - 614
[47] Tiered sentence based topic model for multi-document summarization
Akhtar, Nadeem
Beg, M. M. Sufyan
Javed, Hira
Hussain, Md Muzakkir
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (08): : 2131 - 2141
[48] An Improved LDA Multi-Document Summarization Model Based on TensorFlow
Zhong, Ying
Tang, Zhuo
Ding, Xiaofei
Zhu, Li
Le, Yuquan
Li, Kenli
Li, Keqin
2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 255 - 259
[49] A Fuzzy-Rough Hybrid Approach to Multi-document Extractive Summarization
Huang, Hsun-Hui
Yang, Horng-Chang
Kuo, Yau-Hwang
HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 1, PROCEEDINGS, 2009, : 168 - +
[50] Research On Multi-document Summarization Based On LDA Topic Model
Bian, Jinqiang
Jiang, Zengru
Chen, Qian
2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 2, 2014, : 113 - 116

← 1 2 3 4 5 →