The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data

被引:0
|
作者
Ding, Daisy Yi [1 ]
Simpson, Chloe [1 ]
Pfohl, Stephen [1 ]
Kale, Dave C. [2 ]
Jung, Kenneth [1 ]
Shah, Nigam H. [1 ]
机构
[1] Stanford Univ, Stanford Ctr Biomed Informat Res, Stanford, CA 94305 USA
[2] Univ Southern Calif, Inst Informat Sci, Marina Del Rey, CA 90292 USA
关键词
Electronic Health Records; Electronic phenotyping algorithms; Deep learning; Multi-task learning; ALGORITHMS;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Electronic phenotyping is the task of ascertaining whether an individual has a medical condition of interest by analyzing their medical record and is foundational in clinical informatics. Increasingly, electronic phenotyping is performed via supervised learning. We investigate the effectiveness of multitask learning for phenotyping using electronic health records (EHR) data. Multitask learning aims to improve model performance on a target task by jointly learning additional auxiliary tasks and has been used in disparate areas of machine learning. However, its utility when applied to EHR data has not been established, and prior work suggests that its benefits are inconsistent. We present experiments that elucidate when multitask learning with neural nets improves performance for phenotyping using EHR data relative to neural nets trained for a single phenotype and to well-tuned baselines. We find that multitask neural nets consistently outperform single-task neural nets for rare phenotypes but underperform for relatively more common phenotypes. The effect size increases as more auxiliary tasks are added. Moreover, multitask learning reduces the sensitivity of neural nets to hyperparameter settings for rare phenotypes. Last, we quantify phenotype complexity and find that neural nets trained with or without multitask learning do not improve on simple baselines unless the phenotypes are sufficiently complex.
引用
收藏
页码:18 / 29
页数:12
相关论文
共 50 条
  • [1] Applying active learning to high-throughput phenotyping algorithms for electronic health records data
    Chen, Yukun
    Carroll, Robert J.
    Hinz, Eugenia R. McPeek
    Shah, Anushi
    Eyler, Anne E.
    Denny, Joshua C.
    Xu, Hua
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (E2) : E253 - E259
  • [2] Interpretable Phenotyping for Electronic Health Records
    Allen, Christine
    Hu, Juhua
    Kumar, Vikas
    Ahmad, Muhammad Aurangzeb
    Teredesai, Ankur
    [J]. 2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 161 - 170
  • [3] Machine learning approaches for electronic health records phenotyping: a methodical review
    Yang, Siyue
    Varghese, Paul
    Stephenson, Ellen
    Tu, Karen
    Gronsbell, Jessica
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (02) : 367 - 381
  • [4] Combining deep learning with token selection for patient phenotyping from electronic health records
    Yang, Zhen
    Dehmer, Matthias
    Yli-Harja, Olli
    Emmert-Streib, Frank
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [5] Combining deep learning with token selection for patient phenotyping from electronic health records
    Zhen Yang
    Matthias Dehmer
    Olli Yli-Harja
    Frank Emmert-Streib
    [J]. Scientific Reports, 10
  • [6] The use of electronic health records for psychiatric phenotyping and genomics
    Smoller, Jordan W.
    [J]. AMERICAN JOURNAL OF MEDICAL GENETICS PART B-NEUROPSYCHIATRIC GENETICS, 2018, 177 (07) : 601 - 612
  • [7] Next-generation phenotyping of electronic health records
    Hripcsak, George
    Albers, David J.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (01) : 117 - 121
  • [8] Importance of Health Information Technology, Electronic Health Records, and Continuously Aggregating Data to Comparative Effectiveness Research and Learning Health Care
    Miriovsky, Benjamin J.
    Shulman, Lawrence N.
    Abernethy, Amy P.
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2012, 30 (34) : 4243 - 4248
  • [9] Learning from heterogeneous temporal data in electronic health records
    Zhao, Jing
    Papapetrou, Panagiotis
    Asker, Lars
    Bostrom, Henrik
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 65 : 105 - 119
  • [10] High Throughput Phenotyping for Dimensional Psychopathology in Electronic Health Records
    McCoy, Thomas H., Jr.
    Yu, Sheng
    Hart, Kamber L.
    Castro, Victor M.
    Brown, Hannah E.
    Rosenquist, James N.
    Doyle, Alysa E.
    Vuijk, Pieter J.
    Cai, Tianxi
    Perlis, Roy H.
    [J]. BIOLOGICAL PSYCHIATRY, 2018, 83 (12) : 997 - 1004