Data Selection Curriculum for Abstractive Text Summarization

被引:0
|
作者
Sun, Shichao [1 ]
Yuan, Ruifeng [1 ]
He, Jianfei [2 ]
Cao, Ziqiang [3 ]
Li, Wenjie [1 ]
Jia, Xiaohua [2 ]
机构
[1] Hong Kong Polytech Univ, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Hong Kong, Peoples R China
[3] Soochow Univ, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Abstractive Text Summarization (ATS) models are commonly trained using large-scale data that is randomly shuffled. However, the impact of data selection and data ordering on ATS models remains a relatively unexplored research area, where a significant challenge lies in accurately assessing the learning difficulty of each training instance. This study introduces a Data Selection Curriculum (DSC) scoring system that incorporates both the difficulty of improving ATS model via an instance and the expected performance on this instance. By selectively excluding excessively simple and overly complex instances, the training efficiency can be optimized. Furthermore, curriculum learning is integrated to accelerate convergence and improve performance by gradually increasing the learning difficulty, inspired by human learners. Experimental results on the CNN/DailyMail dataset demonstrate that our approach surpasses potent baselines, utilizing a mere 20% of the available instances.
引用
收藏
页码:7990 / 7995
页数:6
相关论文
共 50 条
  • [1] Abstractive Text Summarization for the Urdu Language: Data and Methods
    Awais, Muhammad
    Muhammad Adeel Nawab, Rao
    IEEE ACCESS, 2024, 12 : 61198 - 61210
  • [2] A Few Good Sentences: Content Selection for Abstractive Text Summarization
    Srivastava, Vivek
    Bhat, Savita
    Pedanekar, Niranjan
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 124 - 141
  • [3] An approach to Abstractive Text Summarization
    Huong Thanh Le
    Tien Manh Le
    2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 371 - 376
  • [4] A Survey on Abstractive Text Summarization
    Moratanch, N.
    Chitrakala, S.
    PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT 2016), 2016,
  • [5] Abstractive text summarization for Hungarian
    Yang, Zijian Gyozo
    Agocs, Adam
    Kusper, Gabor
    Varadi, Tamas
    ANNALES MATHEMATICAE ET INFORMATICAE, 2021, 53 : 299 - 316
  • [6] Survey on Abstractive Text Summarization
    Raphal, Nithin
    Duwarah, Hemanta
    Daniel, Philemon
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 513 - 517
  • [7] Abstractive Text Summarization Using Hybrid Technique of Summarization
    Liaqat, Muhammad Irfan
    Hamid, Isma
    Nawaz, Qamar
    Shafique, Nida
    2022 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2022), 2022, : 141 - 144
  • [8] Dual Encoding for Abstractive Text Summarization
    Yao, Kaichun
    Zhang, Libo
    Du, Dawei
    Luo, Tiejian
    Tao, Lili
    Wu, Yanjun
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (03) : 985 - 996
  • [9] Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive Summarization
    Magooda, Ahmed
    Litman, Diane
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2043 - 2052
  • [10] Inducing Causal Structure for Abstractive Text Summarization
    Chen, Lu
    Zhang, Ruqing
    Huang, Wei
    Chen, Wei
    Guo, Jiafeng
    Cheng, Xueqi
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 213 - 223