Multilingual Indian COVID-19 Chatbot

被引:0
|
作者
Thara, S. [1 ]
Jyothiratnam [2 ]
Sonpole, Satya Harthik [1 ]
Inturi, Bhargav [1 ]
Krishna, Ajay [1 ]
Vuppala, Sahit [1 ]
Nedungadi, Prema [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Comp Sci & Engn, Amrita Sch Comp, Amritapuri, India
[2] Amrita Vishwa Vidyapeetham, Amrita CREATE, Amrita Sch Comp, Amritapuri, India
关键词
BERT; BM25; Chatbot; COVID-19; GloVe embeddings; Healthcare; Multilingual; Natural language processing; Squad; SIF; TF-IDF;
D O I
10.1007/978-981-97-1323-3_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Albeit the COVID-19 era has ended, common people are ravaged by lingering misconceptions about the pre- and post-COVID effects. Chatbot is a computer program that simulates human conversation. Chatbots communicate with users via text or voice using natural language. This paper presents the development of a MILIC-19 chatbot to augur user-friendly interfaces for Indians, where there are many official languages. We have integrated the Google Translate API, allowing our chatbot to converse in 19 Indian languages. The MILIC-19 chatbot application responds to user queries by matching them with the keywords in the database to retrieve an appropriate response from the database. Handling Question-Answer tasks is configured in two layers, each serving a unique purpose in providing the best results. The first layer is an information retrieval model that leverages the power of Term Frequency-Inverse Document Frequency and Smooth Inverse Frequency to calculate inverted indices and Best Match to retrieve the top 15 articles related to the query. The second layer is a Bio-BERT model trained on the Stanford QA dataset. This combination of techniques enables the model to effectively provide relevant information by prioritizing the most important documents based on the frequency of the terms in the query.
引用
收藏
页码:47 / 64
页数:18
相关论文
共 50 条
  • [1] IRLCov19: A Large COVID-19 Multilingual Twitter Dataset of Indian Regional Languages
    Uniyal, Deepak
    Agarwal, Amit
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2021, 1525 : 309 - 324
  • [2] Effects of COVID-19 on Multilingual Communication
    Pilgun, Maria
    Raskhodchikov, Aleksei N.
    Koreneva Antonova, Olga
    FRONTIERS IN PSYCHOLOGY, 2022, 12
  • [3] Population subgroup differences in the use of a COVID-19 chatbot
    Laura C. Schubel
    Deliya B. Wesley
    Ethan Booker
    John Lock
    Raj M. Ratwani
    npj Digital Medicine, 4
  • [4] Population subgroup differences in the use of a COVID-19 chatbot
    Schubel, Laura C.
    Wesley, Deliya B.
    Booker, Ethan
    Lock, John
    Ratwani, Raj M.
    NPJ DIGITAL MEDICINE, 2021, 4 (01)
  • [5] COVID-19 and Indian Pediatrics
    Devendra Mishra
    Indian Pediatrics, 2020, 57 (4) : 287 - 287
  • [6] COVID-19: An Indian perspective
    Velayudhan, Bashi
    Idhrees, Mohammed
    JOURNAL OF CARDIAC SURGERY, 2021, 36 (05) : 1713 - 1716
  • [7] COVID-19 and Indian Pediatrics
    Mishra, Devendra
    INDIAN PEDIATRICS, 2020, 57 (04) : 287 - 287
  • [8] Chatbot use cases in the Covid-19 public health response
    Amiri, Parham
    Karahanna, Elena
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (05) : 1000 - 1010
  • [9] The impact of the Covid-19 pandemic on multilingual families in the Netherlands
    Unsworth, Sharon
    Van Den Akker, Marieke
    Van Dijk, Caya
    JOURNAL OF CHILD LANGUAGE, 2024,
  • [10] CMTA: COVID-19 Misinformation Multilingual Analysis on Twitter
    Pranesh, Raj Ratn
    Farokhnejad, Mehrdad
    Shekhar, Ambesh
    Vargas-Solar, Genoveva
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 270 - 283