A Natural Language Processing Model for the Development of an Italian-Language Chatbot for Public Administration

被引：0

作者：

Piizzi, Antonio ^{[1
]}

Vavallo, Donatello ^{[1
]}

Lazzo, Gaetano ^{[1
]}

Dimola, Saverio ^{[1
]}

Zazzera, Elvira ^{[2
]}

机构：

[1] Tempo SRL, Bari, Italy

[2] Kad3 SRL, Fasano, Italy

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2024年 / 15卷 / 09期

关键词：

-Natural Language Processing; chatbot; BERT; transformer; Italian language;

D O I：

10.14569/IJACSA.2024.0150906

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Natural Language Processing models (NLP) are used in chatbots to understand user input, interpret its meaning, and generate conversational responses to provide immediate and consistent assistance. This reduces problem-solving time and staff workload and increases user satisfaction. There are both rule- based chatbots, which use decision trees and are programmed to answer specific questions, and self-learning chatbots, which can handle more complex conversations through continuous learning about data and user interactions. However, only a few chatbots have been developed specifically for the Italian language. T he development of chatbots for Public Administration (PA) in the Italian language presents unique challenges, particularly in creating models that can accurately understand and respond to user queries based on complex, context-specific documents. This paper proposes a novel natural language processing (NLP) model tailored to the Italian language, designed to support the development of an advanced Question Answering (QA) chatbot for PA. The core of the proposed model is based on the BERT (Bidirectional Encoder Representations from Transformers) architecture, enhanced with an encoder/decoder module and a highway network module to improve the filtering and processing of input text. The principal aim of this research is to address the gap in Italian-language NLP models by providing a robust solution capable of handling the intricacies of the Italian language within the context of PA. The model is trained and evaluated using the Italian version of the Stanford Question Answering Dataset (SQuAD-IT). Experimental results demonstrate that the proposed model outperforms existing models such as BIDAF in terms of F1-score and Exact Match (EM), indicating its superior ability to provide precise and accurate answers. The comparative analysis highlights a significant performance improvement, with the proposed model achieving an F1-score of 59.41% and an EM of 46.24%, compared to 49.35% and 38.43%, respectively, for BIDAF. The findings suggest that the proposed model offers substantial benefits in terms of accuracy and efficiency for PA applications.

引用

页码：54 / 58

页数：5

共 50 条

[41] Natural Language Processing Pretraining Language Model for Computer Intelligent Recognition Technology
Dong, Jun
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (08)
[42] Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
Gu Y.
Tinn R.
Cheng H.
Lucas M.
Usuyama N.
Liu X.
Naumann T.
Gao J.
Poon H.
ACM Transactions on Computing for Healthcare, 2022, 3 (01):
[43] Benchmarking Large Language Model Performance on Natural Language Processing Tasks for Pharmacoepidemiology
Feng, Hui
Ronzano, Francesco
LaFleur, JuDe
Garber, Matthew L.
de Oliveira, Rodrigo
Roth, Katharine
Rough, Kathryn
Nanavati, Jay
El Abidine, Khaldoun Zine
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 : 70 - 70
[44] Language Invariant Properties in Natural Language Processing
Bianchi, Federico
Nozza, Debora
Hovy, Dirk
PROCEEDINGS OF THE FIRST WORKSHOP ON EFFICIENT BENCHMARKING IN NLP (NLP POWER 2022), 2022, : 84 - 92
[45] Natural language processing as human language engineering
Di Felippo, Ariani
Dias-da-Silva, Bento Carlos
CALIDOSCOPIO, 2009, 7 (03): : 183 - 191
[46] Online Natural Language Processing of the Slovak Language
Hladek, Daniel
Ondas, Stanislav
Stas, Jan
2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2014, : 315 - 316
[47] Natural Language Processing for Dialects of a Language: A Survey
Joshi, Aditya
Dabre, Raj
Kanojia, Diptesh
Li, Zhuang
Zhan, Haolan
Haffari, Gholamreza
Dippold, Doris
ACM COMPUTING SURVEYS, 2025, 57 (06)
[48] Humanizing the Chatbot with Semantics based Natural Language Generation
Virkar, Mayuresh
Honmane, Vikas
Rao, S. Upendra
PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 891 - 894
[49] The Growing Impact of Natural Language Processing in Healthcare and Public Health
Jerfy, Aadit
Selden, Owen
Balkrishnan, Rajesh
INQUIRY-THE JOURNAL OF HEALTH CARE ORGANIZATION PROVISION AND FINANCING, 2024, 61
[50] Parsing Platforms: Natural Language Processing and Public Mental Health
Wiederhold, Brenda K.
CYBERPSYCHOLOGY BEHAVIOR AND SOCIAL NETWORKING, 2024, 27 (08) : 521 - 523

← 1 2 3 4 5 →