Unsupervised extractive multi-document summarization method based on transfer learning from BERT multi-task fine-tuning

被引:21
|
作者
Lamsiyah, Salima [1 ]
El Mahdaouy, Abdelkader [3 ]
Ouatik, Said El Alaoui [1 ,2 ]
Espinasse, Bernard [4 ]
机构
[1] Sidi Mohamed Ben Abdellah Univ, FSDM, Lab Informat Signals Automat & Cognitivism, BP 1796, Fez Atlas 30003, Morocco
[2] Ibn Tofail Univ, Natl Sch Appl Sci, Lab Engn Sci, Kenitra, Morocco
[3] Mohammed VI Polytech Univ UM6P, Sch Comp Sci UM6P CS, Ben Guerir, Morocco
[4] Univ Toulon & Var, Aix Marseille Univ, CNRS, LIS,UMR 7020, Toulon, France
关键词
BERT fine-tuning; multi-document summarization; multi-task learning; sentence representation learning; transfer learning; SENTENCE SCORING TECHNIQUES;
D O I
10.1177/0165551521990616
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text representation is a fundamental cornerstone that impacts the effectiveness of several text summarization methods. Transfer learning using pre-trained word embedding models has shown promising results. However, most of these representations do not consider the order and the semantic relationships between words in a sentence, and thus they do not carry the meaning of a full sentence. To overcome this issue, the current study proposes an unsupervised method for extractive multi-document summarization based on transfer learning from BERT sentence embedding model. Moreover, to improve sentence representation learning, we fine-tune BERT model on supervised intermediate tasks from GLUE benchmark datasets using single-task and multi-task fine-tuning methods. Experiments are performed on the standard DUC'2002-2004 datasets. The obtained results show that our method has significantly outperformed several baseline methods and achieves a comparable and sometimes better performance than the recent state-of-the-art deep learning-based methods. Furthermore, the results show that fine-tuning BERT using multi-task learning has considerably improved the performance.
引用
收藏
页码:164 / 182
页数:19
相关论文
共 50 条
  • [31] Query-oriented unsupervised multi-document summarization via deep learning model
    Zhong, Sheng-hua
    Liu, Yan
    Li, Bin
    Long, Jing
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 8146 - 8155
  • [32] Heuristic Initialization And Similarity Integration Based Model for Improving Extractive Multi-Document Summarization
    Kadhim, Nasreen J.
    Mohammed, Dheyaa Abdulameer
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (05): : 330 - 350
  • [33] A Dialogues Summarization Algorithm Based on Multi-task Learning
    Chen, Haowei
    Li, Chen
    Liang, Jiajing
    Tian, Lihua
    NEURAL PROCESSING LETTERS, 2024, 56 (03)
  • [34] Unsupervised Query-Focused Multi-Document Summarization using the Cross Entropy Method
    Feigenblat, Guy
    Roitman, Haggai
    Boni, Odellia
    Konopnicki, David
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 961 - 964
  • [35] A Comparative Study of Deep Learning Approaches for Query-Focused Extractive Multi-Document Summarization
    Yuliska
    Sakai, Tetsuya
    2019 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT), 2019, : 153 - 157
  • [36] From coarse to fine: Enhancing multi-document summarization with multi-granularity relationship-based extractor
    Zhang, Ming
    Lu, Jiyu
    Yang, Jiahao
    Zhou, Jun
    Wan, Meilin
    Zhang, Xuejun
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (03)
  • [37] Unsupervised domain adaptation: A multi-task learning-based method
    Zhang, Jing
    Li, Wanqing
    Ogunbona, Philip
    KNOWLEDGE-BASED SYSTEMS, 2019, 186
  • [38] Decomposition-based multi-objective differential evolution for extractive multi-document automatic text summarization
    Wahab, Muhammad Hafizul Hazmi
    Hamid, Nor Asilah Wati Abdul
    Subramaniam, Shamala
    Latip, Rohaya
    Othman, Mohamed
    APPLIED SOFT COMPUTING, 2024, 151
  • [39] Summarizing learning materials using graph based multi-document summarization
    Krishnaveni P.
    Balasundaram S.R.
    International Journal of Web-Based Learning and Teaching Technologies, 2021, 16 (05) : 39 - 57
  • [40] An Indicator-based Multi-Objective Optimization Approach Applied to Extractive Multi-Document Text Summarization
    Sanchez-Gomez, J.
    Vega-Rodriguez, M.
    Perez, C.
    IEEE LATIN AMERICA TRANSACTIONS, 2019, 17 (08) : 1291 - 1299