DICE: Data-Efficient Clinical Event Extraction with Generative Models

被引:0
|
作者
Ma, Mingyu Derek [1 ]
Taylor, Alexander K. [1 ]
Wang, Wei [1 ]
Peng, Nanyun [1 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event extraction for the clinical domain is an under-explored research area. The lack of training data along with the high volume of domain-specific terminologies with vague entity boundaries makes the task especially challenging. In this paper, we introduce DICE, a robust and data-efficient generative model for clinical event extraction. DICE frames event extraction as a conditional generation problem and introduces a contrastive learning objective to accurately decide the boundaries of biomedical mentions. DICE also trains an auxiliary mention identification task jointly with event extraction tasks to better identify entity mention boundaries, and further introduces special markers to incorporate identified entity mentions as trigger and argument candidates for their respective tasks. To benchmark clinical event extraction, we compose MACCROBAT-EE, the first clinical event extraction dataset with argument annotation, based on an existing clinical information extraction dataset, MACCROBAT (Caufield et al., 2019). Our experiments demonstrate state-of-the-art performances of DICE for clinical and news domain event extraction, especially under low data settings.
引用
收藏
页码:15898 / 15917
页数:20
相关论文
共 50 条
  • [41] Data-Efficient and Interpretable Tabular Anomaly Detection
    Chang, Chun-Hao
    Yoon, Jinsung
    Arik, Sercan O.
    Udell, Madeleine
    Pfister, Tomas
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 190 - 201
  • [42] Elliptic PDE learning is provably data-efficient
    Boulle, Nicolas
    Halikias, Diana
    Townsend, Alex
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (39)
  • [43] Data-Efficient Control Barrier Function Refinement
    Dai, Bolun
    Huang, Heming
    Krishnamurthy, Prashanth
    Khorrami, Farshad
    [J]. 2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3675 - 3680
  • [44] Data-Efficient Augmentation for Training Neural Networks
    Liu, Tian Yu
    Mirzasoleiman, Baharan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [45] Data-Efficient Student Profiling in Online Courses
    Fenu, Gianni
    Galici, Roberta
    Marras, Mirko
    [J]. ARTIFICIAL INTELLIGENCE WITH AND FOR LEARNING SCIENCES, WAILS 2024, 2024, 14545 : 11 - 20
  • [46] Data-Efficient Sensitivity Analysis with Surrogate Modeling
    Van Steenkiste, Tom
    van der Herten, Joachim
    Couckuyt, Ivo
    Dhaene, Tom
    [J]. UNCERTAINTY MODELING FOR ENGINEERING APPLICATIONS, 2019, : 55 - 69
  • [47] Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation
    Sperber, Matthias
    Neubig, Graham
    Niehues, Jan
    Waibel, Alex
    [J]. Transactions of the Association for Computational Linguistics, 2019, 7 : 313 - 325
  • [48] CONVERTING DISCRETE EVENT SIMULATION NETWORKS (DESNETS) INTO DICE MODELS
    Yago, C. M.
    Diez, F. J.
    [J]. VALUE IN HEALTH, 2022, 25 (07) : S337 - S337
  • [49] Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
    Nie, Allen
    Flet-Berliac, Yannis
    Jordan, Deon R.
    Steenbergen, William
    Brunskill, Emma
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [50] Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation
    Sperber, Matthias
    Neubig, Graham
    Niehues, Jan
    Waibel, Alex
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 : 313 - 325