Enabling deep learning for large scale question answering in Italian

被引：2

作者：

Croce, Danilo ^{[1
]}

Zelenanska, Alexandra ^{[1
]}

Basili, Roberto ^{[1
]}

机构：

[1] Univ Roma Tor Vergata, Dept Enterprise Engn, Rome, Italy

来源：

INTELLIGENZA ARTIFICIALE | 2019年 / 13卷 / 01期

关键词：

Question answering in Italian; deep learning; recurrent neural network with attention;

D O I：

10.3233/IA-190018

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The recent breakthroughs in the field of deep learning led to state-of-the-art results in several NLP tasks, such as Question Answering (QA). Unfortunately, the requirements of such neural QA systems are very strict due to the size of the involved training datasets. In cross-linguistic settings these requirements are not satisfied as training datasets for QA over non-English texts are often not available. This represents the major barrier for a wide-spread adoption of neural QA methods in NLP applications. In this paper, the acquisition of a large scale dataset for an open-domain factoid question answering system in Italian is discussed. It is obtained by automatic translation and linguistic elicitation of an existing English dataset, i.e. the SQUAD question-answer pair corpus. Even though the quality of the resulting corpus for Italian might not be completely satisfying, our work allowed to generate more than 60 thousand question-answer pairs. In the paper the impact of this resource on the QA process over the Italian Wikipedia is studied, according to different training conditions and architectural constraints. A comparative evaluation against the English version, in line with standards in the SQUAD literature, is carried out. The outcomes show that the results achievable for Italian are below the state-of-the-art for English, but the ability of learning not to respond (i.e. the adoption of techniques for detecting question whose answers are simply not available, i.e. EMPTY set of answers) allows the system to pursue reasonable levels of precision. This make it already usable within realistic application scenarios. Finally, an error analysis is presented that suggests possible future research directions on still critical but highly beneficial enhancements, in view of concrete QA applications in Italian.

引用

页码：49 / 61

页数：13

共 50 条

[11] A building regulation question answering system: A deep learning methodology
Zhong, Botao
He, Wanlei
Huang, Ziwei
Love, Peter E. D.
Tang, Junqing
Luo, Hanbin
ADVANCED ENGINEERING INFORMATICS, 2020, 46 (46)
[12] Malayalam Question Answering System Using Deep Learning Approaches
Rahmath, Reji K.
Raj, P. C. Reghu
Rafeeque, P. C.
IETE JOURNAL OF RESEARCH, 2023, 69 (12) : 8889 - 8901
[13] Recent progress in leveraging deep learning methods for question answering
Hao, Tianyong
Li, Xinxin
He, Yulan
Wang, Fu Lee
Qu, Yingying
Neural Computing and Applications, 2022, 34 (04) : 2765 - 2783
[14] Knowledge Base Question Answering Based on Deep Learning Models
Xie, Zhiwen
Zeng, Zhao
Zhou, Guangyou
He, Tingting
NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 300 - 311
[15] Recent progress in leveraging deep learning methods for question answering
Tianyong Hao
Xinxin Li
Yulan He
Fu Lee Wang
Yingying Qu
Neural Computing and Applications, 2022, 34 : 2765 - 2783
[16] Recent progress in leveraging deep learning methods for question answering
Hao, Tianyong
Li, Xinxin
He, Yulan
Wang, Fu Lee
Qu, Yingying
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 2765 - 2783
[17] Research on Question-Answering System Based on Deep Learning
Song, Bo
Zhuo, Yue
Li, Xiaomei
ADVANCES IN SWARM INTELLIGENCE, ICSI 2018, PT II, 2018, 10942 : 522 - 529
[18] A Deep Learning Approach for Question Answering Over Knowledge Base
Wang, Linjie
Zhang, Yu
Liu, Ting
NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 885 - 892
[19] A review of deep learning in question answering over knowledge bases
Zhang, Chen
Lai, Yuxuan
Feng, Yansong
Zhao, Dongyan
AI OPEN, 2021, 2 : 205 - 215
[20] Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base
Shen, Tao
Geng, Xiubo
Qin, Tao
Guo, Daya
Tang, Duyu
Duan, Nan
Long, Guodong
Jiang, Daxin
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2442 - 2451

← 1 2 3 4 5 →