Toward Zero-Shot and Zero-Resource Multilingual Question Answering

被引:3
|
作者
Kuo, Chia-Chih [1 ]
Chen, Kuan-Yu [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Comp Sci & Informat Engn Dept, Taipei 106, Taiwan
关键词
Task analysis; Question answering (information retrieval); Training data; Data models; Transfer learning; Online services; Internet; Natural language processing; Multilingual question answering; zero-shot; zero-resource; mBERT;
D O I
10.1109/ACCESS.2022.3207569
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, multilingual question answering has been an emergent research topic and has attracted much attention. Although systems for English and other rich-resource languages that rely on various advanced deep learning-based techniques have been highly developed, most of them in low-resource languages are impractical due to data insufficiency. Accordingly, many studies have attempted to improve the performance of low-resource languages in a zero-shot or few-shot manner based on multilingual bidirectional encoder representations from transformers (mBERT) by transferring knowledge learned from rich-resource languages to low-resource languages. Most methods require either a large amount of unlabeled data or a small set of labeled data for low-resource languages. In Wikipedia, 169 languages have less than 10,000 articles, and 48 languages have less than 1,000 articles. This reason motivates us to conduct a zero-shot multilingual question answering task under a zero-resource scenario. Thus, this study proposes a framework to fine-tune the original mBERT using data from rich-resource languages, and the resulting model can be used for low-resource languages in a zero-shot and zero-resource manner. Compared to several baseline systems, which require millions of unlabeled data for low-resource languages, the performance of our proposed framework is not only highly comparative but is also better for languages used in training.
引用
收藏
页码:99754 / 99761
页数:8
相关论文
共 50 条
  • [1] Zero-shot Event Causality Identification with Question Answering
    Liakhovets, Daria
    Schlarb, Sven
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA, CLIB 2022, 2022, : 113 - 119
  • [2] Zero-shot Visual Question Answering with Language Model Feedback
    Du, Yifan
    Li, Junyi
    Tang, Tianyi
    Zhao, Wayne Xin
    Wen, Ji-Rong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9268 - 9281
  • [3] Zero-Shot Visual Question Answering Using Knowledge Graph
    Chen, Zhuo
    Chen, Jiaoyan
    Geng, Yuxia
    Pan, Jeff Z.
    Yuan, Zonggang
    Chen, Huajun
    SEMANTIC WEB - ISWC 2021, 2021, 12922 : 146 - 162
  • [4] Improving Zero-Shot Cross-lingual Transfer for Multilingual Question Answering over Knowledge Graph
    Zhou, Yucheng
    Geng, Xiubo
    Shen, Tao
    Zhang, Wenqiang
    Jiang, Daxin
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5822 - 5834
  • [5] Zero-Shot Commonsense Question Answering with Cloze Translation and Consistency Optimization
    Dou, Zi-Yi
    Peng, Nanyun
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10572 - 10580
  • [6] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
    Yang, Antoine
    Miech, Antoine
    Sivic, Josef
    Laptev, Ivan
    Schmid, Cordelia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [7] CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering
    Wang, Weiqi
    Fang, Tianqing
    Ding, Wenxuan
    Xu, Baixuan
    Li, Xin
    Song, Yangqiu
    Bosselut, Antoine
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 13520 - 13545
  • [8] MuHeQA: Zero-shot question answering over multiple and heterogeneous knowledge bases
    Badenes-Olmedo, Carlos
    Corcho, Oscar
    SEMANTIC WEB, 2024, 15 (05) : 1547 - 1561
  • [9] Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering
    Riabi, Arij
    Scialom, Thomas
    Keraron, Rachel
    Sagot, Benoit
    Seddah, Djame
    Staiano, Jacopo
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7016 - 7030
  • [10] Chart question answering with multimodal graph representation learning and zero-shot classification
    Farahani, Ali Mazraeh
    Adibi, Peyman
    Ehsani, Mohammad Saeed
    Hutter, Hans-Peter
    Darvishy, Alireza
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 270