A Data-Efficient Method for One-Shot Text Classification

被引:0
|
作者
Wang, Hsin-Yang [1 ]
Liu, Mu [1 ]
Yamashita, Katsushi [1 ]
Okamoto, Yasuhiro [2 ]
Yamada, Satoshi [2 ]
机构
[1] SoftBank Corp, R&D Promot Off, Tokyo, Japan
[2] SoftBank Corp, AI Engn Dept, Tokyo, Japan
关键词
natural language processing; few-shot learning; text classification;
D O I
10.1109/CCAI55564.2022.9807798
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose BiGBERT (Binary Grouping BERT), a data-efficient training method for one-shot text classification. With the idea of One-vs-Rest method, we designed an extensible output layer for BERT, which can increase the usability of the training data. To evaluate our approach, we conducted extensive experiments on four celebrated text classification datasets, and reform these datasets into one-shot training scenario, which is approximately equal to the situation of our commercial datasets. The experiment result shows our approach achieves 54.9% in 5AbstractsGroup dataset, 40.2% in 20NewsGroup dataset, 57.0% in IMDB dataset, and 33.6% in TREC dataset. Overall, compare to the baseline BERT, our proposed method achieves 2.3% similar to 28.6% improved in accuracy. This result shows BiGBERT is stable and have significantly improved on one-shot text classification.
引用
收藏
页码:76 / 80
页数:5
相关论文
共 50 条
  • [1] An efficient compression method for one-shot multispectral camera
    Shinoda, Kazuma
    Murakami, Yuri
    Yamaguchi, Masahiro
    Ortega, Antonio
    2012 PICTURE CODING SYMPOSIUM (PCS), 2012, : 269 - 272
  • [2] Data-Efficient Language Shaped Few-shot Image Classification
    Liang, Zhenwen
    Zhang, Xiangliang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4680 - 4686
  • [3] Towards One-Shot Learning for Text Classification using Inductive Logic Programming
    Milani, Ghazal Afroozi
    Cyrus, Daniel
    Tamaddoni-Nezhad, Alireza
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2023, (385): : 69 - 79
  • [4] Latent-Variable Generative Models for Data-Efficient Text Classification
    Ding, Xiaoan
    Gimpel, Kevin
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 507 - 517
  • [5] One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
    Souibgui, Mohamed Ali
    Biten, Ali Furkan
    Dey, Sounak
    Fornes, Alicia
    Kessentini, Yousri
    Gomez, Lluis
    Karatzas, Dimosthenis
    Llados, Josep
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2563 - 2571
  • [6] A one-shot method for measurement of diffusion
    Song, YQ
    Tang, XP
    JOURNAL OF MAGNETIC RESONANCE, 2004, 170 (01) : 136 - 148
  • [7] Efficient methods for one-shot quantum communication
    Anurag Anshu
    Rahul Jain
    npj Quantum Information, 8
  • [8] Efficient methods for one-shot quantum communication
    Anshu, Anurag
    Jain, Rahul
    NPJ QUANTUM INFORMATION, 2022, 8 (01)
  • [9] Data-efficient classification of radio galaxies
    Samudre, Ashwin
    George, Lijo T.
    Bansal, Mahak
    Wadadekar, Yogesh
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2022, 509 (02) : 2269 - 2280
  • [10] One-shot Learning Approach for Unknown Malware Classification
    Tran, Trung Kien
    Sato, Hiroshi
    Kubo, Masao
    PROCEEDINGS OF THE 2018 5TH ASIAN CONFERENCE ON DEFENSE TECHNOLOGY (ACDT 2018), 2018, : 8 - 13