A weakly supervised textual entailment approach to zero-shot text classification

被引:0
|
作者
Pamies, Marc [1 ]
Llop, Joan [1 ]
Multari, Francesco [2 ]
Duran-Silva, Nicolau [2 ]
Parra-Rojas, Cesar [2 ]
Gonzalez-Agirre, Aitor [1 ]
Massucci, Francesco Alessandro [2 ]
Villegas, Marta [1 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona, Spain
[2] SIRIS Acad, Barcelona, Spain
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot text classification is a widely studied task that deals with a lack of annotated data. The most common approach is to reformulate it as a textual entailment problem, enabling classification into unseen classes. This work explores an effective approach that trains on a weakly supervised dataset generated from traditional classification data. We empirically study the relation between the performance of the entailment task, which is used as a proxy, and the target zero-shot text classification task. Our findings reveal that there is no linear correlation between both tasks, to the extent that it can be detrimental to lengthen the fine-tuning process even when the model is still learning, and propose a straightforward method to stop training on time. As a proof of concept, we introduce a domain-specific zero-shot text classifier that was trained on Microsoft Academic Graph data. The model, called SCIroShot, achieves stateof-the-art performance in the scientific domain and competitive results in other areas. Both the model and evaluation benchmark are publicly available on HuggingFace1 and GitHub2.
引用
收藏
页码:286 / 296
页数:11
相关论文
共 50 条
  • [1] Zero-Shot Text Classification with Semantically Extended Textual Entailment
    Liu, Tengfei
    Hu, Yongli
    Chen, Puman
    Sun, Yanfeng
    Yin, Baocai
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [2] Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach
    Yin, Wenpeng
    Hay, Jamaal
    Roth, Dan
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3914 - 3923
  • [3] Issues with Entailment-based Zero-shot Text Classification
    Ma, Tingting
    Yao, Jin-Ge
    Lin, Chin-Yew
    Zhao, Tiejun
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 786 - 796
  • [4] Weakly supervised classification model for zero-shot semantic segmentation
    Shen, Fengli
    Wang, Zong-Hui
    Lu, Zhe-Ming
    [J]. ELECTRONICS LETTERS, 2020, 56 (23) : 1247 - 1249
  • [5] Generalised Zero-shot Learning for Entailment-based Text Classification with Externa Knowledge
    Wang, Yuqi
    Wang, Wei
    Chen, Qi
    Huang, Kaizhu
    Anh Nguyen
    De, Suparna
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2022), 2022, : 19 - 25
  • [6] Zero-Shot Turkish Text Classification
    Birim, Ahmet
    Erden, Mustafa
    Arslan, Levent M.
    [J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [7] Retrieval Augmented Zero-Shot Text Classification
    Abdullahi, Tassallah
    Singh, Ritambhara
    Eickhoff, Carsten
    [J]. PROCEEDINGS OF THE 2024 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2024, 2024, : 195 - 203
  • [8] Weakly-Supervised Questions for Zero-Shot Relation Extraction
    Najafi, Saeed
    Fyshe, Alona
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3075 - 3087
  • [9] Unified benchmark for zero-shot Turkish text classification
    celik, Emrecan
    Dalyan, Tugba
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [10] Extreme Zero-Shot Learning for Extreme Text Classification
    Xiong, Yuanhao
    Chang, Wei-Cheng
    Hsieh, Cho-Jui
    Yu, Hsiang-Fu
    Dhillon, Inderjit
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5455 - 5468