Zero-Shot Learners for Natural Language Understanding via a Unified Multiple-Choice Perspective

被引:1
|
作者
Wang, Junjie [1 ]
Yang, Ping [2 ]
Gan, Ruyi [2 ]
Zhang, Yuxiang [1 ]
Zhang, Jiaxing [2 ]
Sakai, Tetsuya [1 ]
机构
[1] Waseda Univ, Shinjuku Ku, Tokyo 1698555, Japan
[2] Int Digital Econ Acad IDEA, Futian 518045, Shenzhen, Peoples R China
关键词
Multitasking; Multi-task learning; natural language understanding; zero-shot learning; KNOWLEDGE;
D O I
10.1109/ACCESS.2023.3343123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning is an approach where models generalize to unseen tasks without direct training on them. We introduce the Unified Multiple-Choice (UniMC) framework, which is format-independent, compatible with various formats, and applicable to tasks like text classification and sentiment analysis. Furthermore, we design a two-stage tuning method, initially training on multiple-choice formats to develop format-agnostic capabilities, and subsequently enabling direct predictions on unseen tasks for zero-shot learning. Our methodology avoids issues in large-scale models like FLAN, enhancing generalization and reducing parameters. In experiments, UniMC shows State-of-the-Art (SOTA) performance across out-of-domain and in-domain benchmarks, with only 235M parameters, far fewer than previous methods. Moreover, the UniMC-Chinese model excels beyond human performance on benchmarks like EPRSTMT and CHID-FC, underscoring its generalization capacity across languages. Additionally, ablation experiments demonstrate the effectiveness of our design. The code and model weights are available at https://github.com/IDEA-CCNL/Fengshenbang-LM/tree/main/fengshen/examples/unimc.
引用
收藏
页码:142829 / 142845
页数:17
相关论文
共 50 条
  • [21] Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images
    Lu, Ming Y.
    Chen, Bowen
    Zhang, Andrew
    Williamson, Drew F. K.
    Chen, Richard J.
    Ding, Tong
    Le, Long Phi
    Chuang, Yung-Sung
    Mahmood, Faisal
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19764 - 19775
  • [22] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
    Yang, Antoine
    Miech, Antoine
    Sivic, Josef
    Laptev, Ivan
    Schmid, Cordelia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [23] Zero-Shot Temporal Action Detection via Vision-Language Prompting
    Nag, Sauradip
    Zhu, Xiatian
    Song, Yi-Zhe
    Xiang, Tao
    COMPUTER VISION - ECCV 2022, PT III, 2022, 13663 : 681 - 697
  • [24] AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages
    Ebrahimi, Abteen
    Mager, Manuel
    Oncevay, Arturo
    Chaudhary, Vishrav
    Chiruzzo, Luis
    Fan, Angela
    Ortega, John E.
    Ramos, Ricardo
    Rios, Annette
    Meza-Ruiz, Ivan
    Gimenez-Lugo, Gustavo A.
    Mager, Elisabeth
    Neubig, Graham
    Palmer, Alexis
    Coto-Solano, Rolando
    Ngoc Thang Vu
    Kann, Katharina
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6279 - 6299
  • [25] CrossAligner & Co: Zero-Shot Transfer Methods for Task-Oriented Cross-lingual Natural Language Understanding
    Gritta, Milan
    Hu, Ruoyu
    Iacobacci, Ignacio
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 4048 - 4061
  • [26] ADVERSARIAL BANDIT FOR ONLINE INTERACTIVE ACTIVE LEARNING OF ZERO-SHOT SPOKEN LANGUAGE UNDERSTANDING
    Ferreira, Emmanuel
    Masson, Alexandre Reiffers
    Jabaian, Bassam
    Lefevre, Fabrice
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6155 - 6159
  • [27] Zero-shot domain adaptation for natural language inference by projecting superficial words out
    Cui, Wanyun
    Zheng, Guangyu
    Wang, Wei
    KNOWLEDGE-BASED SYSTEMS, 2021, 227
  • [28] Distractor Analysis and Selection for Multiple-Choice Cloze Questions for Second-Language Learners
    Gao, Lingyu
    Gimpel, Kevin
    Jensson, Arnar
    INNOVATIVE USE OF NLP FOR BUILDING EDUCATIONAL APPLICATIONS, 2020, : 102 - 114
  • [29] ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
    Yang, Bang
    Liu, Fenglin
    Zou, Yuexian
    Wu, Xian
    Wang, Yaowei
    Clifton, David A.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5712 - 5724
  • [30] ENHANCING CLASS UNDERSTANDING VIA PROMPT-TUNING FOR ZERO-SHOT TEXT CLASSIFICATION
    Dan, Yuhao
    Zhou, Jie
    Chen, Qin
    Bai, Qingchun
    He, Liang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4303 - 4307