Universal Language Model Fine-tuning for Text Classification

被引:0
|
作者
Howard, Jeremy [1 ]
Ruder, Sebastian [2 ,3 ]
机构
[1] Univ San Francisco, Fast Ai, San Francisco, CA 94117 USA
[2] NUI Galway, Insight Ctr, Galway, Ireland
[3] Aylien Ltd, Dublin, Ireland
基金
爱尔兰科学基金会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a language model. Our method significantly outperforms the state-of-the-art on six text classification tasks, reducing the error by 18-24% on the majority of datasets. Furthermore, with only 100 labeled examples, it matches the performance of training from scratch on 100x more data. We opensource our pretrained models and code(1).
引用
收藏
页码:328 / 339
页数:12
相关论文
共 50 条
  • [41] Fine-tuning Convolutional Neural Networks for fine art classification
    Cetinic, Eva
    Lipic, Tomislav
    Grgic, Sonja
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 107 - 118
  • [42] Guided Recommendation for Model Fine-Tuning
    Li, Hao
    Fowlkes, Charless
    Yang, Hao
    Dabeer, Onkar
    Tu, Zhuowen
    Soatto, Stefano
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3633 - 3642
  • [43] Model Editing by Standard Fine-Tuning
    Gangadhar, Govind
    Stratos, Karl
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 5907 - 5913
  • [44] Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model
    Zhang, Hengyuan
    Wu, Yanru
    Li, Dawei
    Yang, Sak
    Zhao, Rui
    Jiang, Yong
    Tan, Fei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 7467 - 7509
  • [45] Fine-tuning
    不详
    AVIATION WEEK & SPACE TECHNOLOGY, 2001, 155 (02): : 21 - 21
  • [46] Fine-Tuning a Large Language Model with Reinforcement Learning for Educational Question Generation
    Lamsiyah, Salima
    El Mahdaouy, Abdelkader
    Nourbakhsh, Aria
    Schommer, Christoph
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024, 2024, 14829 : 424 - 438
  • [47] WalkLM: A Uniform Language Model Fine-tuning Framework for Attributed Graph Embedding
    Tan, Yanchao
    Zhou, Zihao
    Lv, Hang
    Liu, Weiming
    Yang, Carl
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [48] Fine-tuning
    Rachel Smallridge
    Nature Reviews Molecular Cell Biology, 2004, 5 (2) : 79 - 79
  • [49] Fine-Tuning
    Manson, Neil A.
    TPM-THE PHILOSOPHERS MAGAZINE, 2019, (86): : 99 - 105
  • [50] Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
    Xu, Runxin
    Luo, Fuli
    Zhang, Zhiyuan
    Tan, Chuanqi
    Chang, Baobao
    Huang, Songfang
    Huang, Fei
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9514 - 9528