DICE: Data-Efficient Clinical Event Extraction with Generative Models

被引:0
|
作者
Ma, Mingyu Derek [1 ]
Taylor, Alexander K. [1 ]
Wang, Wei [1 ]
Peng, Nanyun [1 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event extraction for the clinical domain is an under-explored research area. The lack of training data along with the high volume of domain-specific terminologies with vague entity boundaries makes the task especially challenging. In this paper, we introduce DICE, a robust and data-efficient generative model for clinical event extraction. DICE frames event extraction as a conditional generation problem and introduces a contrastive learning objective to accurately decide the boundaries of biomedical mentions. DICE also trains an auxiliary mention identification task jointly with event extraction tasks to better identify entity mention boundaries, and further introduces special markers to incorporate identified entity mentions as trigger and argument candidates for their respective tasks. To benchmark clinical event extraction, we compose MACCROBAT-EE, the first clinical event extraction dataset with argument annotation, based on an existing clinical information extraction dataset, MACCROBAT (Caufield et al., 2019). Our experiments demonstrate state-of-the-art performances of DICE for clinical and news domain event extraction, especially under low data settings.
引用
收藏
页码:15898 / 15917
页数:20
相关论文
共 50 条
  • [1] Latent-Variable Generative Models for Data-Efficient Text Classification
    Ding, Xiaoan
    Gimpel, Kevin
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 507 - 517
  • [2] Learning to Ask for Data-Efficient Event Argument Extraction (Student Abstract)
    Ye, Hongbin
    Zhang, Ningyu
    Bi, Zhen
    Deng, Shumin
    Tan, Chuanqi
    Chen, Hui
    Huang, Fei
    Chen, Huajun
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 13099 - 13100
  • [3] DEGREE: A Data-Efficient Generation-Based Event Extraction Model
    Hsu, I-Hung
    Huang, Kuan-Hao
    Boschee, Elizabeth
    Miller, Scott
    Natarajan, Premkumar
    Chang, Kai-Wei
    Peng, Nanyun
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1890 - 1908
  • [4] Progressively volumetrized deep generative models for data-efficient contextual learning of MR image recovery
    Yurt, Mahmut
    Ozbey, Muzaffer
    Dar, Salman U. H.
    Tinaz, Berk
    Oguz, Kader K.
    Cukur, Tolga
    [J]. MEDICAL IMAGE ANALYSIS, 2022, 78
  • [5] Masked Generative Adversarial Networks are Data-Efficient Generation Learners
    Huang, Jiaxing
    Cui, Kaiwen
    Guan, Dayan
    Xiao, Aoran
    Zhan, Fangneng
    Lu, Shijian
    Liao, Shengcai
    Xing, Eric
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Data-Efficient Information Extraction from Documents with Pre-trained Language Models
    Sage, Clement
    Douzon, Thibault
    Aussem, Alex
    Eglin, Veronique
    Elghazel, Haytham
    Duffner, Stefan
    Garcia, Christophe
    Espinas, Jeremy
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 455 - 469
  • [7] Event-triggered Data-efficient Observers of Perturbed Systems
    Voortman, Quentin
    Efimov, Denis
    Pogromsky, Alexander
    Richard, Jean-Pierre
    Nijmeijer, Henk
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 2820 - 2825
  • [8] GRTr: Generative-Retrieval Transformers for Data-Efficient Dialogue Domain Adaptation
    Shalyminov, Igor
    Sordoni, Alessandro
    Atkinson, Adam
    Schulz, Hannes
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2484 - 2492
  • [9] Towards data-efficient mechanical design of bicontinuous composites using generative AI
    Milad Masrouri
    Zhao Qin
    [J]. Theoretical & Applied Mechanics Letters., 2024, 14 (01) - 64
  • [10] Towards data-efficient mechanical design of bicontinuous composites using generative AI
    Masrouri, Milad
    Qin, Zhao
    [J]. THEORETICAL AND APPLIED MECHANICS LETTERS, 2024, 14 (01)