Zero-Shot Learners for Natural Language Understanding via a Unified Multiple-Choice Perspective

被引:1
|
作者
Wang, Junjie [1 ]
Yang, Ping [2 ]
Gan, Ruyi [2 ]
Zhang, Yuxiang [1 ]
Zhang, Jiaxing [2 ]
Sakai, Tetsuya [1 ]
机构
[1] Waseda Univ, Shinjuku Ku, Tokyo 1698555, Japan
[2] Int Digital Econ Acad IDEA, Futian 518045, Shenzhen, Peoples R China
关键词
Multitasking; Multi-task learning; natural language understanding; zero-shot learning; KNOWLEDGE;
D O I
10.1109/ACCESS.2023.3343123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning is an approach where models generalize to unseen tasks without direct training on them. We introduce the Unified Multiple-Choice (UniMC) framework, which is format-independent, compatible with various formats, and applicable to tasks like text classification and sentiment analysis. Furthermore, we design a two-stage tuning method, initially training on multiple-choice formats to develop format-agnostic capabilities, and subsequently enabling direct predictions on unseen tasks for zero-shot learning. Our methodology avoids issues in large-scale models like FLAN, enhancing generalization and reducing parameters. In experiments, UniMC shows State-of-the-Art (SOTA) performance across out-of-domain and in-domain benchmarks, with only 235M parameters, far fewer than previous methods. Moreover, the UniMC-Chinese model excels beyond human performance on benchmarks like EPRSTMT and CHID-FC, underscoring its generalization capacity across languages. Additionally, ablation experiments demonstrate the effectiveness of our design. The code and model weights are available at https://github.com/IDEA-CCNL/Fengshenbang-LM/tree/main/fengshen/examples/unimc.
引用
收藏
页码:142829 / 142845
页数:17
相关论文
共 50 条
  • [31] Zero-shot stance detection via multi-perspective contrastive with unlabeled data
    Jiang, Yan
    Gao, Jinhua
    Shen, Huawei
    Cheng, Xueqi
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
  • [32] ONLINE ADAPTATIVE ZERO-SHOT LEARNING SPOKEN LANGUAGE UNDERSTANDING USING WORD-EMBEDDING
    Ferreira, Emmanuel
    Jabaian, Bassam
    Lefevre, Fabrice
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5321 - 5325
  • [33] CANZSL: Cycle-Consistent Adversarial Networks for Zero-Shot Learning from Natural Language
    Chen, Zhi
    Li, Jingjing
    Luo, Yadan
    Huang, Zi
    Yang, Yang
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 863 - 872
  • [34] Multiple Prompt Fusion for Zero-Shot Lesion Detection Using Vision-Language Models
    Guo, Miaotian
    Yi, Huahui
    Qin, Ziyuan
    Wang, Haiying
    Men, Aidong
    Lao, Qicheng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT V, 2023, 14224 : 283 - 292
  • [35] Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
    Wang, Zhenhailong
    Pan, Xiaoman
    Yu, Dian
    Yu, Dong
    Chen, Jianshu
    Ji, Heng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 3978 - 4004
  • [36] End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation
    Wu, Mingrui
    Gu, Jiaxin
    Shen, Yunhang
    Lin, Mingbao
    Chen, Chao
    Sun, Xiaoshuai
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2839 - 2846
  • [37] Towards Cognition-Aligned Visual Language Models via Zero-Shot Instance Retrieval
    Ma, Teng
    Organisciak, Daniel
    Ma, Wenbao
    Long, Yang
    ELECTRONICS, 2024, 13 (09)
  • [38] Zero-Shot Nuclei Detection via Visual-Language Pre-trained Models
    Wu, Yongjian
    Zhou, Yang
    Saiyin, Jiya
    Wei, Bingzheng
    lai, Maode
    Shou, Jianzhong
    Fan, Yubo
    Xu, Yan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 693 - 703
  • [39] Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding
    Cheng, Xuxin
    Zhu, Zhihong
    Yang, Bang
    Zhuang, Xianwei
    Li, Hongxiang
    Zou, Yuexian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 1806 - 1816
  • [40] Validation of a Zero-shot Learning Natural Language Processing Tool to Facilitate Data Abstraction for Urologic Research
    Kaufmann, Basil
    Busby, Dallin
    Das, Chandan Krushna
    Tillu, Neeraja
    Menon, Mani
    Tewari, Ashutosh K.
    Gorin, Michael A.
    EUROPEAN UROLOGY FOCUS, 2024, 10 (02): : 279 - 287