Universal Language Model Fine-tuning for Text Classification

被引:0
|
作者
Howard, Jeremy [1 ]
Ruder, Sebastian [2 ,3 ]
机构
[1] Univ San Francisco, Fast Ai, San Francisco, CA 94117 USA
[2] NUI Galway, Insight Ctr, Galway, Ireland
[3] Aylien Ltd, Dublin, Ireland
基金
爱尔兰科学基金会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a language model. Our method significantly outperforms the state-of-the-art on six text classification tasks, reducing the error by 18-24% on the majority of datasets. Furthermore, with only 100 labeled examples, it matches the performance of training from scratch on 100x more data. We opensource our pretrained models and code(1).
引用
收藏
页码:328 / 339
页数:12
相关论文
共 50 条
  • [1] Efficient fine-tuning of short text classification based on large language model
    Wang, Likun
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 33 - 38
  • [2] Patent classification by fine-tuning BERT language model
    Lee, Jieh-Sheng
    Hsiang, Jieh
    WORLD PATENT INFORMATION, 2020, 61
  • [3] Extreme Fine-tuning: A Novel and Fast Fine-tuning Approach for Text Classification
    Jiaramaneepinit, Boonnithi
    Chay-intr, Thodsaporn
    Funakoshi, Kotaro
    Okumura, Manabu
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 368 - 379
  • [4] Enhanced Discriminative Fine-Tuning of Large Language Models for Chinese Text Classification
    Song, Jinwang
    Zan, Hongying
    Zhang, Kunli
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 168 - 174
  • [5] Improving Universal Language Model Fine-Tuning using Attention Mechanism
    Santos, Flavio A. O.
    Ponce-Guevara, K. L.
    Macedo, David
    Zanchettin, Cleber
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [6] A Comparative Analysis of Instruction Fine-Tuning Large Language Models for Financial Text Classification
    Fatemi, Sorouralsadat
    Hu, Yuheng
    Mousavi, Maryam
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
  • [7] BERT MODEL FINE-TUNING FOR TEXT CLASSIFICATION IN KNEE OA RADIOLOGY REPORTS
    Chen, L.
    Shah, R.
    Link, T.
    Bucknor, M.
    Majumdar, S.
    Pedoia, V.
    OSTEOARTHRITIS AND CARTILAGE, 2020, 28 : S315 - S316
  • [8] Fine-tuning large language models for chemical text mining
    Zhang, Wei
    Wang, Qinggong
    Kong, Xiangtai
    Xiong, Jiacheng
    Ni, Shengkun
    Cao, Duanhua
    Niu, Buying
    Chen, Mingan
    Li, Yameng
    Zhang, Runze
    Wang, Yitian
    Zhang, Lehan
    Li, Xutong
    Xiong, Zhaoping
    Shi, Qian
    Huang, Ziming
    Fu, Zunyun
    Zheng, Mingyue
    CHEMICAL SCIENCE, 2024, 15 (27) : 10600 - 10611
  • [9] Better Fine-Tuning via Instance Weighting for Text Classification
    Wang, Zhi
    Bi, Wei
    Wang, Yan
    Liu, Xiaojiang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7241 - 7248
  • [10] Lazy fine-tuning algorithms for naive Bayesian text classification
    El Hindi, Khalil M.
    Aljulaidan, Reem R.
    AlSalman, Hussien
    APPLIED SOFT COMPUTING, 2020, 96