A code-mixed task-oriented dialog dataset for medical domain

被引:4
|
作者
Dowlagar, Suman [1 ]
Mamidi, Radhika [1 ]
机构
[1] Int Inst Informat Technol, Language Technol Res Ctr, Hyderabad 506002, Telangana, India
来源
关键词
Code-mixed; Dialog dataset; Medical domain; Task oriented; LANGUAGE; COMMUNICATION; NETWORKS; SYSTEMS;
D O I
10.1016/j.csl.2022.101449
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the healthcare domain, medical and patient interactions form a crucial part of the diagnosis. Initially, the AI models developed for healthcare centered only on monolingual data. However, such models do not cater to the multilingual regions, where most conversations are Code-Mixed. We present the Code-Mixed Medical Task-Oriented Dialog Dataset to facilitate the research and development of Code-Mixed medical dialog systems. We analyzed the dataset using medical, conversational, and linguistic theories. The dataset contains 3005 Telugu-English Code-Mixed dialogs between patients and doctors with 29 k utterances covering ten specializations with an average code-mixing index (CMI) of 33.3%. We manually annotated the conversational dataset with intents and slot labels. We also present baselines to establish benchmarks on the dataset using existing state-of-the-art Natural Language Understanding (NLU) models. We improved the existing baselines using contextual ground truth intent labels and processing the slots as chunks. The data is made publically available.1
引用
收藏
页数:34
相关论文
共 50 条
  • [11] Robustness Testing of Language Understanding in Task-Oriented Dialog
    Liu, Jiexi
    Takanobui, Ryuichi
    Wen, Jiaxin
    Wan, Dazhen
    Li, Hongguang
    Nie, Weiran
    Li, Cheng
    Peng, Wei
    Huang, Minlie
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2467 - 2480
  • [12] Novel Feature Discovery for Task-Oriented Dialog Systems
    Ho, Vinh Thinh
    Soliman, Mohamed
    Abujabal, Abdalghani
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 782 - 792
  • [13] Recent advances and challenges in task-oriented dialog systems
    Zheng Zhang
    Ryuichi Takanobu
    Qi Zhu
    MinLie Huang
    XiaoYan Zhu
    [J]. Science China Technological Sciences, 2020, 63 : 2011 - 2027
  • [14] Accelerating Natural Language Understanding in Task-Oriented Dialog
    Ahuja, Ojas
    Desai, Shrey
    [J]. NLP FOR CONVERSATIONAL AI, 2020, : 46 - 53
  • [15] Recent advances and challenges in task-oriented dialog systems
    ZHANG Zheng
    TAKANOBU Ryuichi
    ZHU Qi
    HUANG MinLie
    ZHU XiaoYan
    [J]. Science China(Technological Sciences), 2020, (10) - 2027
  • [16] Recent advances and challenges in task-oriented dialog systems
    ZHANG Zheng
    TAKANOBU Ryuichi
    ZHU Qi
    HUANG MinLie
    ZHU XiaoYan
    [J]. Science China Technological Sciences, 2020, 63 (10) : 2011 - 2027
  • [17] Adversarial Learning of Task-Oriented Neural Dialog Models
    Liu, Bing
    Lane, Ian
    [J]. 19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 350 - 359
  • [18] Polite Task-oriented Dialog Agents: To Generate or to Rewrite?
    Silva, Diogo
    Semedo, David
    Magalhaes, Joao
    [J]. PROCEEDINGS OF THE 12TH WORKSHOP ON COMPUTATIONAL APPROACHES TO SUBJECTIVITY, SENTIMENT & SOCIAL MEDIA ANALYSIS, 2022, : 304 - 314
  • [19] Recent advances and challenges in task-oriented dialog systems
    Zhang, Zheng
    Takanobu, Ryuichi
    Zhu, Qi
    Huang, MinLie
    Zhu, XiaoYan
    [J]. SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (10) : 2011 - 2027
  • [20] Task-Oriented Dialog Generation with Enhanced Entity Representation
    He, Zhenhao
    Wang, Jiachun
    Chen, Jian
    [J]. INTERSPEECH 2020, 2020, : 3905 - 3909