Toward Zero-Shot and Zero-Resource Multilingual Question Answering

被引:3
|
作者
Kuo, Chia-Chih [1 ]
Chen, Kuan-Yu [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Comp Sci & Informat Engn Dept, Taipei 106, Taiwan
关键词
Task analysis; Question answering (information retrieval); Training data; Data models; Transfer learning; Online services; Internet; Natural language processing; Multilingual question answering; zero-shot; zero-resource; mBERT;
D O I
10.1109/ACCESS.2022.3207569
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, multilingual question answering has been an emergent research topic and has attracted much attention. Although systems for English and other rich-resource languages that rely on various advanced deep learning-based techniques have been highly developed, most of them in low-resource languages are impractical due to data insufficiency. Accordingly, many studies have attempted to improve the performance of low-resource languages in a zero-shot or few-shot manner based on multilingual bidirectional encoder representations from transformers (mBERT) by transferring knowledge learned from rich-resource languages to low-resource languages. Most methods require either a large amount of unlabeled data or a small set of labeled data for low-resource languages. In Wikipedia, 169 languages have less than 10,000 articles, and 48 languages have less than 1,000 articles. This reason motivates us to conduct a zero-shot multilingual question answering task under a zero-resource scenario. Thus, this study proposes a framework to fine-tune the original mBERT using data from rich-resource languages, and the resulting model can be used for low-resource languages in a zero-shot and zero-resource manner. Compared to several baseline systems, which require millions of unlabeled data for low-resource languages, the performance of our proposed framework is not only highly comparative but is also better for languages used in training.
引用
收藏
页码:99754 / 99761
页数:8
相关论文
共 50 条
  • [21] DiscoLQA: zero-shot discourse-based legal question answering on European Legislation
    Sovrano, Francesco
    Palmirani, Monica
    Sapienza, Salvatore
    Pistone, Vittoria
    ARTIFICIAL INTELLIGENCE AND LAW, 2024,
  • [22] Multilingual bottleneck features for subword modeling in zero-resource languages
    Hermann, Enno
    Goldwater, Sharon
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2668 - 2672
  • [23] Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts
    Lan, Yunshi
    Li, Xiang
    Liu, Xin
    Li, Yang
    Qin, Wei
    Qian, Weining
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4389 - 4400
  • [24] From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers
    Lauscher, Anne
    Ravishankar, Vinit
    Vulic, Ivan
    Glavas, Goran
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4483 - 4499
  • [25] Exploring Question Decomposition for Zero-Shot VQA
    Khan, Zaid
    Kumar, Vijay B. G.
    Schulter, Samuel
    Chandraker, Manmohan
    Fu, Yun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [26] An Image Grid Can Be Worth a Video: Zero-Shot Video Question Answering Using a VLM
    Kim, Wonkyun
    Choi, Changin
    Lee, Wonseok
    Rhee, Wonjong
    IEEE ACCESS, 2024, 12 : 193057 - 193075
  • [27] Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-Shot Commonsense Question Answering
    Bosselut, Antoine
    Le Bras, Ronan
    Choi, Yejin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4923 - 4931
  • [28] Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
    Pan, Junting
    Lin, Ziyi
    Ge, Yuying
    Zhu, Xiatian
    Zhang, Renrui
    Wang, Yi
    Qiao, Yu
    Li, Hongsheng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 272 - 283
  • [29] MULTILINGUAL ACOUSTIC WORD EMBEDDING MODELS FOR PROCESSING ZERO-RESOURCE LANGUAGES
    Kamper, Herman
    Matusevych, Yevgen
    Goldwater, Sharon
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6414 - 6418
  • [30] HOW PHONOTACTICS AFFECT MULTILINGUAL AND ZERO-SHOT ASR PERFORMANCE
    Feng, Siyuan
    Zelasko, Piotr
    Moro-Velazquez, Laureano
    Abavisani, Ali
    Hasegawa-Johnson, Mark
    Scharenborg, Odette
    Dehak, Najim
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7238 - 7242