Lessons from Natural Language Inference in the Clinical Domain

被引:0
|
作者
Romanov, Alexey [1 ,3 ]
Shivade, Chaitanya [2 ]
机构
[1] Univ Massachusetts Lowell, Dept Comp Sci, Lowell, MA 01854 USA
[2] IBM Almaden Res Ctr, 650 Harry Rd, San Jose, CA 95120 USA
[3] IBM Res, San Jose, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State of the art models using deep neural networks have become very good in learning an accurate mapping from inputs to outputs. However, they still lack generalization capabilities in conditions that differ from the ones encountered during training. This is even more challenging in specialized, and knowledge intensive domains, where training data is limited. To address this gap, we introduce MedNLI(1) - a dataset annotated by doctors, performing a natural language inference task (NLI), grounded in the medical history of patients. We present strategies to: 1) leverage transfer learning using datasets from the open domain, (e.g. SNLI) and 2) incorporate domain knowledge from external data and lexical sources (e.g. medical terminologies). Our results demonstrate performance gains using both strategies.
引用
收藏
页码:1586 / 1596
页数:11
相关论文
共 50 条
  • [41] Convolutional Interaction Network for Natural Language Inference
    Gong, Jingjing
    Qiu, Xipeng
    Chen, Xinchi
    Liang, Dong
    Huang, Xuanjing
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1576 - 1585
  • [42] FarsTail: a Persian natural language inference dataset
    Amirkhani, Hossein
    AzariJafari, Mohammad
    Faridan-Jahromi, Soroush
    Kouhkan, Zeinab
    Pourjafari, Zohreh
    Amirak, Azadeh
    SOFT COMPUTING, 2023,
  • [43] Natural Language Inference Based on Adversarial Regularization
    Liu G.-C.
    Cao Y.
    Xu J.-M.
    Xu B.
    Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (08): : 1455 - 1463
  • [44] Adversarial Analysis of Natural Language Inference Systems
    Chien, Tiffany
    Kalita, Jugal
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2020), 2020, : 1 - 8
  • [45] An inference model for semantic entailment in natural language
    Braz, Rodrigo de Salvo
    Girju, Roxana
    Punyakanok, Vasin
    Roth, Dan
    Sammons, Mark
    MACHINE LEARNING CHALLENGES: EVALUATING PREDICTIVE UNCERTAINTY VISUAL OBJECT CLASSIFICATION AND RECOGNIZING TEXTUAL ENTAILMENT, 2006, 3944 : 261 - 286
  • [46] Enhancing Generalization in Natural Language Inference by Syntax
    He, Qi
    Wang, Han
    Zhang, Yue
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4973 - 4978
  • [47] Marked Attribute Bias in Natural Language Inference
    Dawkins, Hillary
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4214 - 4226
  • [48] BASIC DEPENDENCY PARSING IN NATURAL LANGUAGE INFERENCE
    Yusuf, Aleshinloye Abass
    Nwojo, Nnanna Agwu
    Boukar, Moussa Mahamat
    2017 13TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION (ICECCO), 2017,
  • [49] Semantic Diversity in Dialogue with Natural Language Inference
    Stasaski, Katherine
    Hearst, Marti A.
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 85 - 98
  • [50] Identifying inherent disagreement in natural language inference
    Zhang, Xinliang Frederick
    de Marneffe, Marie-Catherine
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 4908 - 4915