TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing

被引:0
|
作者
Yang, Ziqing [1 ]
Cui, Yiming [1 ,2 ]
Chen, Zhipeng [1 ]
Che, Wanxiang [2 ]
Liu, Ting [2 ]
Wang, Shijin [1 ,3 ]
Hu, Guoping [1 ]
机构
[1] iFLYTEK Res, State Key Lab Cognit Intelligence, Hefei, Anhui, Peoples R China
[2] Harbin Inst Technol, Res Ctr Social Comp & Informat Retrieval SCIR, Harbin, Peoples R China
[3] iFLYTEK AI Res Hebei, Langfang, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce TextBrewer, an open-source knowledge distillation toolkit designed for natural language processing. It works with different neural network models and supports various kinds of supervised learning tasks, such as text classification, reading comprehension, sequence labeling. TextBrewer provides a simple and uniform workflow that enables quick setting up of distillation experiments with highly flexible configurations. It offers a set of predefined distillation methods and can be extended with custom code. As a case study, we use TextBrewer to distill BERT on several typical NLP tasks. With simple configurations, we achieve results that are comparable with or even higher than the public distilled BERT models with similar numbers of parameters.(1)
引用
收藏
页码:9 / 16
页数:8
相关论文
共 50 条
  • [31] μDIC: An open-source toolkit for digital image correlation
    Olufsen, Sindre Nordmark
    Andersen, Marius Endre
    Fagerholt, Egil
    SOFTWAREX, 2020, 11
  • [32] OpenNMT: Open-Source Toolkit for Neural Machine Translation
    Klein, Guillaume
    Kim, Yoon
    Deng, Yuntian
    Senellart, Jean
    Rush, Alexander M.
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 67 - 72
  • [33] Towards Zero-Shot Knowledge Distillation for Natural Language Processing
    Rashid, Ahmad
    Lioutas, Vasileios
    Ghaddar, Abbas
    Rezagholizadeh, Mehdi
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 6551 - 6561
  • [34] GDP: an open-source GNSS data preprocessing toolkit
    Zhengsheng Chen
    Yang Cui
    Linyang Li
    Qinghua Zhang
    Zhiping Lu
    Xuerui Li
    Yingcai Kuang
    Kaichun Yang
    Fengjuan Rong
    GPS Solutions, 2020, 24
  • [35] KIPET - AN OPEN-SOURCE KINETIC PARAMETER ESTIMATION TOOLKIT
    Short, Michael
    Schenk, Christina
    Thierry, David
    Rodriguez, Jose Santiago
    Biegler, Lorenz T.
    Garcia-Munoz, Salvador
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON FOUNDATIONS OF COMPUTER-AIDED PROCESS DESIGN, 2019, 47 : 299 - 304
  • [36] DadmaTools: a Natural Language Processing Toolkit for the Persian Language
    Etezadi, Romina
    Karrabi, Mohammad
    Maduyieh, Najmeh Zare
    Sajadi, Mohammad Bagher
    Pilehvar, Mohammad Taher
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2022, : 124 - 130
  • [37] Open-source solutions for SPIMage processing
    Schmied, Christopher
    Stamataki, Evangelia
    Tomancak, Pavel
    QUANTITATIVE IMAGING IN CELL BIOLOGY, 2014, 123 : 505 - 529
  • [38] Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing
    Brierley, Claire
    Sawalha, Majdi
    Atwell, Eric
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1011 - 1016
  • [39] CircadiPy: An open-source toolkit for analyzing chronobiology time series
    Carvalho-Moreira, Joao Pedro
    Guarnieri, Leonardo de Oliveira
    Passos, Matheus Costa
    Emrich, Felipe
    Bargi-Souza, Paula
    Peliciari-Garcia, Rodrigo Antonio
    Moraes, Marcio Flavio Dutra
    JOURNAL OF NEUROSCIENCE METHODS, 2024, 411
  • [40] An open-source radiotherapy image registration toolkit integrated with CERR
    Wu, Y.
    Yang, D.
    Khullar, D.
    El Naqa, I.
    Deasy, J.
    MEDICAL PHYSICS, 2007, 34 (06) : 2397 - 2397