Zero-Shot Learners for Natural Language Understanding via a Unified Multiple-Choice Perspective

被引:1
|
作者
Wang, Junjie [1 ]
Yang, Ping [2 ]
Gan, Ruyi [2 ]
Zhang, Yuxiang [1 ]
Zhang, Jiaxing [2 ]
Sakai, Tetsuya [1 ]
机构
[1] Waseda Univ, Shinjuku Ku, Tokyo 1698555, Japan
[2] Int Digital Econ Acad IDEA, Futian 518045, Shenzhen, Peoples R China
关键词
Multitasking; Multi-task learning; natural language understanding; zero-shot learning; KNOWLEDGE;
D O I
10.1109/ACCESS.2023.3343123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning is an approach where models generalize to unseen tasks without direct training on them. We introduce the Unified Multiple-Choice (UniMC) framework, which is format-independent, compatible with various formats, and applicable to tasks like text classification and sentiment analysis. Furthermore, we design a two-stage tuning method, initially training on multiple-choice formats to develop format-agnostic capabilities, and subsequently enabling direct predictions on unseen tasks for zero-shot learning. Our methodology avoids issues in large-scale models like FLAN, enhancing generalization and reducing parameters. In experiments, UniMC shows State-of-the-Art (SOTA) performance across out-of-domain and in-domain benchmarks, with only 235M parameters, far fewer than previous methods. Moreover, the UniMC-Chinese model excels beyond human performance on benchmarks like EPRSTMT and CHID-FC, underscoring its generalization capacity across languages. Additionally, ablation experiments demonstrate the effectiveness of our design. The code and model weights are available at https://github.com/IDEA-CCNL/Fengshenbang-LM/tree/main/fengshen/examples/unimc.
引用
收藏
页码:142829 / 142845
页数:17
相关论文
共 50 条
  • [1] Modeling Zero-Shot Relation Classification as a Multiple-Choice Problem
    Lan, Yuquan
    Li, Dongxu
    Zhang, Yunqi
    Zhao, Hui
    Zhao, Gang
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [2] Zero-Shot Reward Specification via Grounded Natural Language
    Mahmoudieh, Parsa
    Pathak, Deepak
    Darrell, Trevor
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [3] Zero-shot autonomous robot manipulation via natural language
    Han, Changheon
    Lee, Jiho
    Lee, Hojun
    Sim, Yuseop
    Jeon, Jurim
    Jun, Martin Byung-Guk
    MANUFACTURING LETTERS, 2024, 42 : 16 - 20
  • [4] Zero-shot Natural Language Video Localization
    Nam, Jinwoo
    Ahn, Daechul
    Kang, Dongyeop
    Ha, Seong Jong
    Choi, Jonghyun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1450 - 1459
  • [5] Zero-Shot Adaptive Transfer for Conversational Language Understanding
    Lee, Sungjin
    Jha, Rahul
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6642 - 6649
  • [6] Zero-shot semantic parser for spoken language understanding
    Ferreira, Emmanuel
    Jabaian, Bassam
    Lefevre, Fabrice
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1403 - 1407
  • [7] Unified Language-driven Zero-shot Domain Adaptation
    Yang, Senqiao
    Tian, Zhuotao
    Jiang, Li
    Jia, Jiaya
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23407 - 23415
  • [8] UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
    Sun, Rui
    Wang, Zhecan
    You, Haoxuan
    Codella, Noel
    Chang, Kai-Wei
    Chang, Shih-Fu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 778 - 793
  • [9] Effective Guidance in Zero-Shot Multilingual Translation via Multiple Language Prototypes
    Zheng, Yafang
    Lin, Lei
    Yuan, Yuxuan
    Shi, Xiaodong
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI, 2024, 14452 : 226 - 238
  • [10] Zero-Shot Classification by Logical Reasoning on Natural Language Explanations
    Han, Chi
    Pei, Hengzhi
    Du, Xinya
    Ji, Heng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8967 - 8981