Unsupervised extractive multi-document summarization method based on transfer learning from BERT multi-task fine-tuning

被引：21

作者：

Lamsiyah, Salima ^{[1
]}

El Mahdaouy, Abdelkader ^{[3
]}

Ouatik, Said El Alaoui ^{[1
,2
]}

Espinasse, Bernard ^{[4
]}

机构：

[1] Sidi Mohamed Ben Abdellah Univ, FSDM, Lab Informat Signals Automat & Cognitivism, BP 1796, Fez Atlas 30003, Morocco

[2] Ibn Tofail Univ, Natl Sch Appl Sci, Lab Engn Sci, Kenitra, Morocco

[3] Mohammed VI Polytech Univ UM6P, Sch Comp Sci UM6P CS, Ben Guerir, Morocco

[4] Univ Toulon & Var, Aix Marseille Univ, CNRS, LIS,UMR 7020, Toulon, France

来源：

JOURNAL OF INFORMATION SCIENCE | 2023年 / 49卷 / 01期

关键词：

BERT fine-tuning; multi-document summarization; multi-task learning; sentence representation learning; transfer learning; SENTENCE SCORING TECHNIQUES;

D O I：

10.1177/0165551521990616

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Text representation is a fundamental cornerstone that impacts the effectiveness of several text summarization methods. Transfer learning using pre-trained word embedding models has shown promising results. However, most of these representations do not consider the order and the semantic relationships between words in a sentence, and thus they do not carry the meaning of a full sentence. To overcome this issue, the current study proposes an unsupervised method for extractive multi-document summarization based on transfer learning from BERT sentence embedding model. Moreover, to improve sentence representation learning, we fine-tune BERT model on supervised intermediate tasks from GLUE benchmark datasets using single-task and multi-task fine-tuning methods. Experiments are performed on the standard DUC'2002-2004 datasets. The obtained results show that our method has significantly outperformed several baseline methods and achieves a comparable and sometimes better performance than the recent state-of-the-art deep learning-based methods. Furthermore, the results show that fine-tuning BERT using multi-task learning has considerably improved the performance.

引用

页码：164 / 182

页数：19

共 50 条

[31] Query-oriented unsupervised multi-document summarization via deep learning model
Zhong, Sheng-hua
Liu, Yan
Li, Bin
Long, Jing
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 8146 - 8155
[32] Heuristic Initialization And Similarity Integration Based Model for Improving Extractive Multi-Document Summarization
Kadhim, Nasreen J.
Mohammed, Dheyaa Abdulameer
JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (05): : 330 - 350
[33] A Dialogues Summarization Algorithm Based on Multi-task Learning
Chen, Haowei
Li, Chen
Liang, Jiajing
Tian, Lihua
NEURAL PROCESSING LETTERS, 2024, 56 (03)
[34] Unsupervised Query-Focused Multi-Document Summarization using the Cross Entropy Method
Feigenblat, Guy
Roitman, Haggai
Boni, Odellia
Konopnicki, David
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 961 - 964
[35] A Comparative Study of Deep Learning Approaches for Query-Focused Extractive Multi-Document Summarization
Yuliska
Sakai, Tetsuya
2019 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT), 2019, : 153 - 157
[36] From coarse to fine: Enhancing multi-document summarization with multi-granularity relationship-based extractor
Zhang, Ming
Lu, Jiyu
Yang, Jiahao
Zhou, Jun
Wan, Meilin
Zhang, Xuejun
INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (03)
[37] Unsupervised domain adaptation: A multi-task learning-based method
Zhang, Jing
Li, Wanqing
Ogunbona, Philip
KNOWLEDGE-BASED SYSTEMS, 2019, 186
[38] Decomposition-based multi-objective differential evolution for extractive multi-document automatic text summarization
Wahab, Muhammad Hafizul Hazmi
Hamid, Nor Asilah Wati Abdul
Subramaniam, Shamala
Latip, Rohaya
Othman, Mohamed
APPLIED SOFT COMPUTING, 2024, 151
[39] Summarizing learning materials using graph based multi-document summarization
Krishnaveni P.
Balasundaram S.R.
International Journal of Web-Based Learning and Teaching Technologies, 2021, 16 (05) : 39 - 57
[40] An Indicator-based Multi-Objective Optimization Approach Applied to Extractive Multi-Document Text Summarization
Sanchez-Gomez, J.
Vega-Rodriguez, M.
Perez, C.
IEEE LATIN AMERICA TRANSACTIONS, 2019, 17 (08) : 1291 - 1299

← 1 2 3 4 5 →