Toward Zero-Shot and Zero-Resource Multilingual Question Answering

被引:3
|
作者
Kuo, Chia-Chih [1 ]
Chen, Kuan-Yu [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Comp Sci & Informat Engn Dept, Taipei 106, Taiwan
关键词
Task analysis; Question answering (information retrieval); Training data; Data models; Transfer learning; Online services; Internet; Natural language processing; Multilingual question answering; zero-shot; zero-resource; mBERT;
D O I
10.1109/ACCESS.2022.3207569
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, multilingual question answering has been an emergent research topic and has attracted much attention. Although systems for English and other rich-resource languages that rely on various advanced deep learning-based techniques have been highly developed, most of them in low-resource languages are impractical due to data insufficiency. Accordingly, many studies have attempted to improve the performance of low-resource languages in a zero-shot or few-shot manner based on multilingual bidirectional encoder representations from transformers (mBERT) by transferring knowledge learned from rich-resource languages to low-resource languages. Most methods require either a large amount of unlabeled data or a small set of labeled data for low-resource languages. In Wikipedia, 169 languages have less than 10,000 articles, and 48 languages have less than 1,000 articles. This reason motivates us to conduct a zero-shot multilingual question answering task under a zero-resource scenario. Thus, this study proposes a framework to fine-tune the original mBERT using data from rich-resource languages, and the resulting model can be used for low-resource languages in a zero-shot and zero-resource manner. Compared to several baseline systems, which require millions of unlabeled data for low-resource languages, the performance of our proposed framework is not only highly comparative but is also better for languages used in training.
引用
收藏
页码:99754 / 99761
页数:8
相关论文
共 50 条
  • [31] Multilingual translation for zero-shot biomedical classification using BioTranslator
    Xu, Hanwen
    Woicik, Addie
    Poon, Hoifung
    Altman, Russ B.
    Wang, Sheng
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [32] Multilingual translation for zero-shot biomedical classification using BioTranslator
    Hanwen Xu
    Addie Woicik
    Hoifung Poon
    Russ B. Altman
    Sheng Wang
    Nature Communications, 14
  • [33] Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon
    Koto, Fajri
    Beck, Tilman
    Talat, Zeerak
    Gurevych, Iryna
    Baldwin, Timothy
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 298 - 320
  • [34] Zero-Shot Question Classification Using Synthetic Samples
    Fu, Hao
    Yuan, Caixia
    Wang, Xiaojie
    Sang, Zhijie
    Hu, Shuo
    Shi, Yuanyuan
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 714 - 718
  • [35] QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question Answering
    Shi, Haochen
    Wang, Weiqi
    Fang, Tianqing
    Xu, Baixuan
    Ding, Wenxuan
    Liu, Xin
    Song, Yangqiu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15329 - 15341
  • [36] Resolving Zero-Shot and Fact-Based Visual Question Answering via Enhanced Fact Retrieval
    Wu, Sen
    Zhao, Guoshuai
    Qian, Xueming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1790 - 1800
  • [37] From Images to Textual Prompts: Zero-shot Visual Question Answering with Frozen Large Language Models
    Guo, Jiaxian
    Li, Junnan
    Li, Dongxu
    Tiong, Anthony Meng Huat
    Li, Boyang
    Tao, Dacheng
    Hoi, Steven
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10867 - 10877
  • [38] S2QL: Retrieval Augmented Zero-Shot Question Answering over Knowledge Graph
    Zan, Daoguang
    Wang, Sirui
    Zhang, Hongzhi
    Yan, Yuanmeng
    Wu, Wei
    Guan, Bei
    Wang, Yongji
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT III, 2022, 13282 : 223 - 236
  • [39] Zero-resource Language Recognition
    Yu, Jiawei
    Zhang, Jinsong
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1907 - 1911
  • [40] Improved Acoustic Word Embeddings for Zero-Resource Languages Using Multilingual Transfer
    Kamper, Herman
    Matusevych, Yevgen
    Goldwater, Sharon
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1107 - 1118