TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing

被引:0
|
作者
Yang, Ziqing [1 ]
Cui, Yiming [1 ,2 ]
Chen, Zhipeng [1 ]
Che, Wanxiang [2 ]
Liu, Ting [2 ]
Wang, Shijin [1 ,3 ]
Hu, Guoping [1 ]
机构
[1] iFLYTEK Res, State Key Lab Cognit Intelligence, Hefei, Anhui, Peoples R China
[2] Harbin Inst Technol, Res Ctr Social Comp & Informat Retrieval SCIR, Harbin, Peoples R China
[3] iFLYTEK AI Res Hebei, Langfang, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce TextBrewer, an open-source knowledge distillation toolkit designed for natural language processing. It works with different neural network models and supports various kinds of supervised learning tasks, such as text classification, reading comprehension, sequence labeling. TextBrewer provides a simple and uniform workflow that enables quick setting up of distillation experiments with highly flexible configurations. It offers a set of predefined distillation methods and can be extended with custom code. As a case study, we use TextBrewer to distill BERT on several typical NLP tasks. With simple configurations, we achieve results that are comparable with or even higher than the public distilled BERT models with similar numbers of parameters.(1)
引用
收藏
页码:9 / 16
页数:8
相关论文
共 50 条
  • [31] Towards Zero-Shot Knowledge Distillation for Natural Language Processing
    Rashid, Ahmad
    Lioutas, Vasileios
    Ghaddar, Abbas
    Rezagholizadeh, Mehdi
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 6551 - 6561
  • [32] Microsoft ICECAPS: An Open-Source Toolkit for Conversation Modeling
    Shiv, Vighnesh Leonardo
    Quirk, Chris
    Suri, Anshuman
    Gao, Xiang
    Shahid, Khuram
    Govindarajan, Nithya
    Zhang, Yizhe
    Gao, Jianfeng
    Galley, Michel
    Brockett, Chris
    Menon, Tulasi
    Dolan, Bill
    [J]. PROCEEDINGS OF THE 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, (ACL 2019), 2019, : 123 - 128
  • [33] OpenNMT: Open-Source Toolkit for Neural Machine Translation
    Klein, Guillaume
    Kim, Yoon
    Deng, Yuntian
    Senellart, Jean
    Rush, Alexander M.
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 67 - 72
  • [34] KIPET - AN OPEN-SOURCE KINETIC PARAMETER ESTIMATION TOOLKIT
    Short, Michael
    Schenk, Christina
    Thierry, David
    Rodriguez, Jose Santiago
    Biegler, Lorenz T.
    Garcia-Munoz, Salvador
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON FOUNDATIONS OF COMPUTER-AIDED PROCESS DESIGN, 2019, 47 : 299 - 304
  • [35] GDP: an open-source GNSS data preprocessing toolkit
    Zhengsheng Chen
    Yang Cui
    Linyang Li
    Qinghua Zhang
    Zhiping Lu
    Xuerui Li
    Yingcai Kuang
    Kaichun Yang
    Fengjuan Rong
    [J]. GPS Solutions, 2020, 24
  • [36] DadmaTools: a Natural Language Processing Toolkit for the Persian Language
    Etezadi, Romina
    Karrabi, Mohammad
    Maduyieh, Najmeh Zare
    Sajadi, Mohammad Bagher
    Pilehvar, Mohammad Taher
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2022, : 124 - 130
  • [37] Open-source solutions for SPIMage processing
    Schmied, Christopher
    Stamataki, Evangelia
    Tomancak, Pavel
    [J]. QUANTITATIVE IMAGING IN CELL BIOLOGY, 2014, 123 : 505 - 529
  • [38] Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing
    Brierley, Claire
    Sawalha, Majdi
    Atwell, Eric
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1011 - 1016
  • [39] Methods and open-source toolkit for analyzing and visualizing challenge results
    Wiesenfarth, Manuel
    Reinke, Annika
    Landman, Bennett A.
    Eisenmann, Matthias
    Saiz, Laura Aguilera
    Cardoso, M. Jorge
    Maier-Hein, Lena
    Kopp-Schneider, Annette
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [40] SymCog: An open-source toolkit for assessing human symbolic cognition
    Flurie, Maurice
    Kelly, Alexandra
    Olson, Ingrid R.
    Reilly, Jamie
    [J]. BEHAVIOR RESEARCH METHODS, 2023, 55 (02) : 807 - 823