Lessons from Natural Language Inference in the Clinical Domain

被引:0
|
作者
Romanov, Alexey [1 ,3 ]
Shivade, Chaitanya [2 ]
机构
[1] Univ Massachusetts Lowell, Dept Comp Sci, Lowell, MA 01854 USA
[2] IBM Almaden Res Ctr, 650 Harry Rd, San Jose, CA 95120 USA
[3] IBM Res, San Jose, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State of the art models using deep neural networks have become very good in learning an accurate mapping from inputs to outputs. However, they still lack generalization capabilities in conditions that differ from the ones encountered during training. This is even more challenging in specialized, and knowledge intensive domains, where training data is limited. To address this gap, we introduce MedNLI(1) - a dataset annotated by doctors, performing a natural language inference task (NLI), grounded in the medical history of patients. We present strategies to: 1) leverage transfer learning using datasets from the open domain, (e.g. SNLI) and 2) incorporate domain knowledge from external data and lexical sources (e.g. medical terminologies). Our results demonstrate performance gains using both strategies.
引用
收藏
页码:1586 / 1596
页数:11
相关论文
共 50 条
  • [31] IndoNLI: A Natural Language Inference Dataset for Indonesian
    Mahendra, Rahmad
    Aji, Alham Fikri
    Louvan, Samuel
    Rahman, Fahrurrozi
    Vania, Clara
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 10511 - 10527
  • [32] OCNLI: Original Chinese Natural Language Inference
    Hu, Hai
    Richardson, Kyle
    Xu, Liang
    Li, Lu
    Kubler, Sandra
    Moss, Lawrence S.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [33] Mixture of Prompt Experts for Natural Language Inference
    Zheng, Ziou
    Zhu, Xiaodan
    2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024, 2024, : 43 - 48
  • [34] An Exploration of Dropout with RNNs for Natural Language Inference
    Gajbhiye, Amit
    Jaf, Sardar
    Al Moubayed, Noura
    McGough, A. Stephen
    Bradley, Steven
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 157 - 167
  • [35] Investigating Reasons for Disagreement in Natural Language Inference
    Jiang, Nan-Jiang
    de Marneffe, Marie-Catherine
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1357 - 1374
  • [36] Natural language as the basis for meaning representation and inference
    Dagan, Ido
    Bar-Haim, Roy
    Szpektor, Idan
    Greental, Iddo
    Shnarchl, Eyal
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 151 - +
  • [37] Data and Representation for Turkish Natural Language Inference
    Budur, Emrah
    Ozcelik, Riza
    Gungor, Tunga
    Potts, Christopher
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8253 - 8267
  • [38] Disentangling Reasoning Factors for Natural Language Inference
    Zhou, Xixi
    Zeng, Limin
    Zhao, Ziping
    Bu, Jiajun
    Liang, Wenjie
    Wang, Haishuai
    BIG DATA MINING AND ANALYTICS, 2025, 8 (03): : 694 - 711
  • [39] Syntactic Knowledge for Natural Language Inference in Portuguese
    Fonseca, Erick
    Aluisio, Sandra M.
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 242 - 252
  • [40] Domain Specific Query Generation from Natural Language Text
    Iftikhar, Anum
    Iftikhar, Erum
    Mehmood, Muhammad Khalid
    2016 SIXTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2016, : 502 - 506