A Natural Language Processing Model for the Development of an Italian-Language Chatbot for Public Administration

被引:0
|
作者
Piizzi, Antonio [1 ]
Vavallo, Donatello [1 ]
Lazzo, Gaetano [1 ]
Dimola, Saverio [1 ]
Zazzera, Elvira [2 ]
机构
[1] Tempo SRL, Bari, Italy
[2] Kad3 SRL, Fasano, Italy
关键词
-Natural Language Processing; chatbot; BERT; transformer; Italian language;
D O I
10.14569/IJACSA.2024.0150906
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Natural Language Processing models (NLP) are used in chatbots to understand user input, interpret its meaning, and generate conversational responses to provide immediate and consistent assistance. This reduces problem-solving time and staff workload and increases user satisfaction. There are both rule- based chatbots, which use decision trees and are programmed to answer specific questions, and self-learning chatbots, which can handle more complex conversations through continuous learning about data and user interactions. However, only a few chatbots have been developed specifically for the Italian language. T he development of chatbots for Public Administration (PA) in the Italian language presents unique challenges, particularly in creating models that can accurately understand and respond to user queries based on complex, context-specific documents. This paper proposes a novel natural language processing (NLP) model tailored to the Italian language, designed to support the development of an advanced Question Answering (QA) chatbot for PA. The core of the proposed model is based on the BERT (Bidirectional Encoder Representations from Transformers) architecture, enhanced with an encoder/decoder module and a highway network module to improve the filtering and processing of input text. The principal aim of this research is to address the gap in Italian-language NLP models by providing a robust solution capable of handling the intricacies of the Italian language within the context of PA. The model is trained and evaluated using the Italian version of the Stanford Question Answering Dataset (SQuAD-IT). Experimental results demonstrate that the proposed model outperforms existing models such as BIDAF in terms of F1-score and Exact Match (EM), indicating its superior ability to provide precise and accurate answers. The comparative analysis highlights a significant performance improvement, with the proposed model achieving an F1-score of 59.41% and an EM of 46.24%, compared to 49.35% and 38.43%, respectively, for BIDAF. The findings suggest that the proposed model offers substantial benefits in terms of accuracy and efficiency for PA applications.
引用
收藏
页码:54 / 58
页数:5
相关论文
共 50 条
  • [41] Natural Language Processing Pretraining Language Model for Computer Intelligent Recognition Technology
    Dong, Jun
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (08)
  • [42] Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
    Gu Y.
    Tinn R.
    Cheng H.
    Lucas M.
    Usuyama N.
    Liu X.
    Naumann T.
    Gao J.
    Poon H.
    ACM Transactions on Computing for Healthcare, 2022, 3 (01):
  • [43] Benchmarking Large Language Model Performance on Natural Language Processing Tasks for Pharmacoepidemiology
    Feng, Hui
    Ronzano, Francesco
    LaFleur, JuDe
    Garber, Matthew L.
    de Oliveira, Rodrigo
    Roth, Katharine
    Rough, Kathryn
    Nanavati, Jay
    El Abidine, Khaldoun Zine
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 : 70 - 70
  • [44] Language Invariant Properties in Natural Language Processing
    Bianchi, Federico
    Nozza, Debora
    Hovy, Dirk
    PROCEEDINGS OF THE FIRST WORKSHOP ON EFFICIENT BENCHMARKING IN NLP (NLP POWER 2022), 2022, : 84 - 92
  • [45] Natural language processing as human language engineering
    Di Felippo, Ariani
    Dias-da-Silva, Bento Carlos
    CALIDOSCOPIO, 2009, 7 (03): : 183 - 191
  • [46] Online Natural Language Processing of the Slovak Language
    Hladek, Daniel
    Ondas, Stanislav
    Stas, Jan
    2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2014, : 315 - 316
  • [47] Natural Language Processing for Dialects of a Language: A Survey
    Joshi, Aditya
    Dabre, Raj
    Kanojia, Diptesh
    Li, Zhuang
    Zhan, Haolan
    Haffari, Gholamreza
    Dippold, Doris
    ACM COMPUTING SURVEYS, 2025, 57 (06)
  • [48] Humanizing the Chatbot with Semantics based Natural Language Generation
    Virkar, Mayuresh
    Honmane, Vikas
    Rao, S. Upendra
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 891 - 894
  • [49] The Growing Impact of Natural Language Processing in Healthcare and Public Health
    Jerfy, Aadit
    Selden, Owen
    Balkrishnan, Rajesh
    INQUIRY-THE JOURNAL OF HEALTH CARE ORGANIZATION PROVISION AND FINANCING, 2024, 61
  • [50] Parsing Platforms: Natural Language Processing and Public Mental Health
    Wiederhold, Brenda K.
    CYBERPSYCHOLOGY BEHAVIOR AND SOCIAL NETWORKING, 2024, 27 (08) : 521 - 523