Data Selection Curriculum for Abstractive Text Summarization

被引:0
|
作者
Sun, Shichao [1 ]
Yuan, Ruifeng [1 ]
He, Jianfei [2 ]
Cao, Ziqiang [3 ]
Li, Wenjie [1 ]
Jia, Xiaohua [2 ]
机构
[1] Hong Kong Polytech Univ, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Hong Kong, Peoples R China
[3] Soochow Univ, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Abstractive Text Summarization (ATS) models are commonly trained using large-scale data that is randomly shuffled. However, the impact of data selection and data ordering on ATS models remains a relatively unexplored research area, where a significant challenge lies in accurately assessing the learning difficulty of each training instance. This study introduces a Data Selection Curriculum (DSC) scoring system that incorporates both the difficulty of improving ATS model via an instance and the expected performance on this instance. By selectively excluding excessively simple and overly complex instances, the training efficiency can be optimized. Furthermore, curriculum learning is integrated to accelerate convergence and improve performance by gradually increasing the learning difficulty, inspired by human learners. Experimental results on the CNN/DailyMail dataset demonstrate that our approach surpasses potent baselines, utilizing a mere 20% of the available instances.
引用
收藏
页码:7990 / 7995
页数:6
相关论文
共 50 条
  • [21] Highlighted Word Encoding for Abstractive Text Summarization
    Lal, Daisy Monika
    Singh, Krishna Pratap
    Tiwary, Uma Shanker
    INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 77 - 86
  • [22] Abstractive Text Summarization by Incorporating Reader Comments
    Gao, Shen
    Chen, Xiuying
    Li, Piji
    Ren, Zhaochun
    Bing, Lidong
    Zhao, Dongyan
    Yan, Rui
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6399 - 6406
  • [23] Improving Abstractive Text Summarization with History Aggregation
    Liao, Pengcheng
    Zhang, Chuang
    Chen, Xiaojun
    Zhou, Xiaofei
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [24] A global and local information extraction model incorporating selection mechanism for abstractive text summarization
    Li, Yuanyuan
    Huang, Yuan
    Huang, Weijian
    Wang, Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 4859 - 4886
  • [25] A global and local information extraction model incorporating selection mechanism for abstractive text summarization
    Yuanyuan Li
    Yuan Huang
    Weijian Huang
    Wei Wang
    Multimedia Tools and Applications, 2024, 83 : 4859 - 4886
  • [26] Exploring Explainable Selection to Control Abstractive Summarization
    Wang Haonan
    Gao Yang
    Bai Yu
    Lapata, Mirella
    Huang Heyan
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13933 - 13941
  • [27] ATSSI: Abstractive Text Summarization using Sentiment Infusion
    Bhargava, Rupal
    Sharma, Yashvardhan
    Sharma, Gargi
    TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 404 - 411
  • [28] Unsupervised Abstractive Text Summarization with Length Controlled Autoencoder
    Dugar, Abhinav
    Singh, Gaurav
    Navyasree, B.
    Kumar, Anand M.
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [29] Abstractive Text Summarization Based on Semantic Alignment Network
    Wu S.
    Huang D.
    Li J.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2021, 57 (01): : 1 - 6
  • [30] Abstractive Text Summarization Using Enhanced Attention Model
    Roul, Rajendra Kumar
    Joshi, Pratik Madhav
    Sahoo, Jajati Keshari
    INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 63 - 76