Enabling deep learning for large scale question answering in Italian

被引:2
|
作者
Croce, Danilo [1 ]
Zelenanska, Alexandra [1 ]
Basili, Roberto [1 ]
机构
[1] Univ Roma Tor Vergata, Dept Enterprise Engn, Rome, Italy
关键词
Question answering in Italian; deep learning; recurrent neural network with attention;
D O I
10.3233/IA-190018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent breakthroughs in the field of deep learning led to state-of-the-art results in several NLP tasks, such as Question Answering (QA). Unfortunately, the requirements of such neural QA systems are very strict due to the size of the involved training datasets. In cross-linguistic settings these requirements are not satisfied as training datasets for QA over non-English texts are often not available. This represents the major barrier for a wide-spread adoption of neural QA methods in NLP applications. In this paper, the acquisition of a large scale dataset for an open-domain factoid question answering system in Italian is discussed. It is obtained by automatic translation and linguistic elicitation of an existing English dataset, i.e. the SQUAD question-answer pair corpus. Even though the quality of the resulting corpus for Italian might not be completely satisfying, our work allowed to generate more than 60 thousand question-answer pairs. In the paper the impact of this resource on the QA process over the Italian Wikipedia is studied, according to different training conditions and architectural constraints. A comparative evaluation against the English version, in line with standards in the SQUAD literature, is carried out. The outcomes show that the results achievable for Italian are below the state-of-the-art for English, but the ability of learning not to respond (i.e. the adoption of techniques for detecting question whose answers are simply not available, i.e. EMPTY set of answers) allows the system to pursue reasonable levels of precision. This make it already usable within realistic application scenarios. Finally, an error analysis is presented that suggests possible future research directions on still critical but highly beneficial enhancements, in view of concrete QA applications in Italian.
引用
收藏
页码:49 / 61
页数:13
相关论文
共 50 条
  • [21] Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base
    Shen, Tao
    Geng, Xiubo
    Qin, Tao
    Guo, Daya
    Tang, Duyu
    Duan, Nan
    Long, Guodong
    Jiang, Daxin
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2442 - 2451
  • [22] Intelligent Question Answering in Restricted Domains Using Deep Learning and Question Pair Matching
    Cai, Lin-Qin
    Wei, Min
    Zhou, Si-Tong
    Yan, Xun
    IEEE ACCESS, 2020, 8 : 32922 - 32934
  • [23] Deep Semantic Understanding and Sequence Relevance Learning for Question Routing in Community Question Answering
    Li, Hong
    Li, Jianjun
    Li, Guohui
    Wang, Chunzhi
    Cao, Wenjun
    Chen, Zixuan
    INFORMATION TECHNOLOGY AND CONTROL, 2023, 52 (03): : 789 - 805
  • [24] Medical Exam Question Answering with Large-Scale Reading Comprehension
    Zhang, Xiao
    Wu, Ji
    He, Zhiyang
    Liu, Xien
    Su, Ying
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5706 - 5713
  • [25] Large-Scale Goodness Polarity Lexicons for Community Question Answering
    Mihaylov, Todor
    Balchev, Daniel
    Kiprov, Yasen
    Koychev, Ivan
    Nakov, Preslav
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1185 - 1188
  • [26] Enabling Versatile Analysis of Large Scale Traffic Video Data with Deep Learning and HiveQL
    Huang, Lei
    Xu, Weijia
    Liu, Si
    Pandey, Venktesh
    Juri, Natalia Ruiz
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1153 - 1162
  • [27] Seeing and Reasoning: A Simple Deep Learning Approach to Visual Question Answering
    Zakari, Rufai Yusuf
    Owusu, Jim Wilson
    Qin, Ke
    He, Tao
    Luo, Guangchun
    BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 458 - 478
  • [28] Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
    Malinowski, Mateusz
    Rohrbach, Marcus
    Fritz, Mario
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 125 (1-3) : 110 - 135
  • [29] Deep Knowledge Graph Representation Learning for Completion, Alignment, and Question Answering
    Chakrabarti, Soumen
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3451 - 3454
  • [30] Question Answering Systems With Deep Learning-Based Symbolic Processing
    Honda, Hiroshi
    Hagiwara, Masafumi
    IEEE ACCESS, 2019, 7 : 152368 - 152378