A code-mixed task-oriented dialog dataset for medical domain

被引:4
|
作者
Dowlagar, Suman [1 ]
Mamidi, Radhika [1 ]
机构
[1] Int Inst Informat Technol, Language Technol Res Ctr, Hyderabad 506002, Telangana, India
来源
关键词
Code-mixed; Dialog dataset; Medical domain; Task oriented; LANGUAGE; COMMUNICATION; NETWORKS; SYSTEMS;
D O I
10.1016/j.csl.2022.101449
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the healthcare domain, medical and patient interactions form a crucial part of the diagnosis. Initially, the AI models developed for healthcare centered only on monolingual data. However, such models do not cater to the multilingual regions, where most conversations are Code-Mixed. We present the Code-Mixed Medical Task-Oriented Dialog Dataset to facilitate the research and development of Code-Mixed medical dialog systems. We analyzed the dataset using medical, conversational, and linguistic theories. The dataset contains 3005 Telugu-English Code-Mixed dialogs between patients and doctors with 29 k utterances covering ten specializations with an average code-mixing index (CMI) of 33.3%. We manually annotated the conversational dataset with intents and slot labels. We also present baselines to establish benchmarks on the dataset using existing state-of-the-art Natural Language Understanding (NLU) models. We improved the existing baselines using contextual ground truth intent labels and processing the slots as chunks. The data is made publically available.1
引用
收藏
页数:34
相关论文
共 50 条
  • [41] Few-shot Natural Language Generation for Task-Oriented Dialog
    Peng, Baolin
    Zhu, Chenguang
    Li, Chunyuan
    Li, Xiujun
    Li, Jinchao
    Zeng, Michael
    Gao, Jianfeng
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 172 - 182
  • [42] Continual Learning for Natural Language Generation in Task-oriented Dialog Systems
    Mi, Fei
    Chen, Liangwei
    Zhao, Mengjie
    Huang, Minlie
    Faltings, Boi
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [43] Non-Autoregressive Semantic Parsing for Compositional Task-Oriented Dialog
    Babu, Arun
    Shrivastava, Akshat
    Aghajanyan, Armen
    Aly, Ahmed
    Fan, Angela
    Ghazvininejad, Marjan
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2969 - 2978
  • [44] Scheduled Dialog Policy Learning: An Automatic Curriculum Learning Framework for Task-oriented Dialog System
    Liu, Sihong
    Zhang, Jinchao
    He, Keqing
    Xu, Weiran
    Zhou, Jie
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1091 - 1102
  • [45] Conversation Learner - A Machine Teaching Tool for Building Dialog Managers for Task-Oriented Dialog Systems
    Shukla, Swadheen
    Liden, Lars
    Shayandeh, Shahin
    Kamal, Eslam
    Li, Jinchao
    Mazzola, Matt
    Park, Thomas
    Peng, Baolin
    Gao, Jianfeng
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, 2020, : 343 - 349
  • [46] Multi3 WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems
    Hu, Songbo
    Zhou, Han
    Hergul, Mete
    Gritta, Milan
    Zhang, Guchun
    Iacobacci, Ignacio
    Vulic, Ivan
    Korhonen, Anna
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 1396 - 1415
  • [47] MatDC: A Multi-turn Multi-domain Annotated Task-oriented Dialogue Dataset in Chinese
    Tseng, Yu-Hsiang
    Hsieh, Shu-Kai
    Lian, Richard
    Chiang, Chiung-Yu
    Chang, Yu-Lin
    Chang, Li-Ping
    Hsieh, Ji-Lung
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 165 - 170
  • [48] Disentangling Task-Oriented Representations for Unsupervised Domain Adaptation
    Dai, Pingyang
    Chen, Peixian
    Wu, Qiong
    Hong, Xiaopeng
    Ye, Qixiang
    Tian, Qi
    Chia-Wen Lin
    Ji, Rongrong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1012 - 1026
  • [49] ToAlign: Task-oriented Alignment for Unsupervised Domain Adaptation
    Wei, Guoqiang
    Lan, Cuiling
    Zeng, Wenjun
    Zhang, Zhizheng
    Chen, Zhibo
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [50] Study on a method for task-oriented domain knowledge push
    Wang, Jun
    You, Weijia
    Sun, Weiliang
    [J]. 2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 5329 - 5332