DistCare: Distilling Knowledge from Publicly Available Online EMR Data to Emerging Epidemic for Prognosis

被引:12
|
作者
Ma, Liantao [1 ,3 ]
Ma, Xinyu [1 ,3 ]
Gao, Junyi [1 ]
Jiao, Xianfeng [1 ]
Yu, Zhihao [1 ]
Zhang, Chaohe [1 ,3 ]
Ruan, Wenjie [4 ]
Wang, Yasha [1 ,2 ]
Tang, Wen [5 ]
Wang, Jiangtao [6 ]
机构
[1] Minist Educ, Key Lab High Confidence Software Technol, Beijing, Peoples R China
[2] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing, Peoples R China
[3] Peking Univ, Sch Elect Engn & Comp Sci, Beijing, Peoples R China
[4] Univ Exeter, Dept Comp Sci, CEMPS, Exeter, Devon, England
[5] Peking Univ Third Hosp, Div Nephrol, Beijing, Peoples R China
[6] Coventry Univ, Ctr Intelligent Healthcare, Coventry, W Midlands, England
来源
PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) | 2021年
基金
中国国家自然科学基金;
关键词
Electronic Medical Record; Healthcare Informatics; Prognosis; Transfer Learning; MORTALITY;
D O I
10.1145/3442381.3449855
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the characteristics of COVID-19, the epidemic develops rapidly and overwhelms health service systems worldwide. Many patients suffer from life-threatening systemic problems and need to be carefully monitored in ICUs. An intelligent prognosis can help physicians take an early intervention, prevent adverse outcomes, and optimize the medical resource allocation, which is urgently needed, especially in this ongoing global pandemic crisis. However, in the early stage of the epidemic outbreak, the data available for analysis is limited due to the lack of effective diagnostic mechanisms, the rarity of the cases, and privacy concerns. In this paper, we propose a distilled transfer learning framework, DistCare, which leverages the existing publicly available online Electronic Medical Records to enhance the prognosis for inpatients with emerging infectious diseases. It learns to embed the COVID-19-related medical features based on massive existing EMR data. The transferred parameters are further trained to imitate the teacher model's representation based on distillation, which embeds the health status more comprehensively on the source dataset. We conduct Length-of-Stay prediction experiments for patients in ICUs on real-world COVID-19 datasets. The experiment results indicate that our proposed model consistently outperforms competitive baseline methods. In order to further verify the scalability of DistCare to deal with different clinical tasks on different EMR datasets, we conduct an additional mortality prediction experiment on End-Stage Renal Disease datasets. The extensive experiments demonstrate that DistCare can benefit the prognosis for emerging pandemics and other diseases with limited EMR.
引用
收藏
页码:3558 / 3568
页数:11
相关论文
共 50 条
  • [1] New, publicly available flavonoid data products: Valuable resources for emerging science
    Sebastian, Rhonda S.
    Enns, Cecilia Wilkinson
    Goldman, Joseph D.
    Steinfeldt, Lois C.
    Martin, Carrie L.
    Clemens, John C.
    Murayi, Theophile
    Moshfegh, Alanna J.
    JOURNAL OF FOOD COMPOSITION AND ANALYSIS, 2017, 64 : 68 - 72
  • [2] INFERRING APP DEMAND FROM PUBLICLY AVAILABLE DATA
    Garg, Rajiv
    Telang, Rahul
    MIS QUARTERLY, 2013, 37 (04) : 1253 - 1264
  • [3] Knowledge Workers and Their Use of Publicly Available Online Services for Day-to-day Work
    Ferro, Toni
    Divine, Doug
    Zachry, Mark
    SIGDOC '12: PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON DESIGN OF COMMUNICATION, 2012, : 47 - 53
  • [4] PREDICTING EMERGING PRODUCT DESIGN TREND BY MINING PUBLICLY AVAILABLE CUSTOMER REVIEW DATA
    Tucker, Conrad
    Kim, Harrison M.
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON ENGINEERING DESIGN (ICED 11): IMPACTING SOCIETY THROUGH ENGINEERING DESIGN, VOL 6: DESIGN INFORMATION AND KNOWLEDGE, 2011, 6 : 43 - 52
  • [5] Inferring Urban Social Networks from Publicly Available Data
    Guarino, Stefano
    Mastrostefano, Enrico
    Bernaschi, Massimo
    Celestini, Alessandro
    Cianfriglia, Marco
    Torre, Davide
    Zastrow, Lena Rebecca
    FUTURE INTERNET, 2021, 13 (05):
  • [6] Approaching Generation Variable Costs From Publicly Available Data
    Marinho, Nuno
    Phulpin, Yannick
    Folliot, Damien
    Hennebel, Martin
    2016 13TH INTERNATIONAL CONFERENCE ON THE EUROPEAN ENERGY MARKET (EEM), 2016,
  • [7] Acquisition and Processing of Information from Slovak Publicly Available Data
    Hricova, Romana
    Adamcik, Stanislav
    INDUSTRY 4.0: TRENDS IN MANAGEMENT OF INTELLIGENT MANUFACTURING SYSTEMS, 2019, : 23 - 35
  • [8] Identification of tumour markers from publicly available gene expression data
    Dennis, JL
    Vass, JK
    Wit, EC
    Keith, WN
    Oien, KA
    JOURNAL OF PATHOLOGY, 2003, 201 : 7A - 7A
  • [9] Population-scale family trees from publicly available data
    Grant Otto
    Nature Reviews Genetics, 2018, 19 : 251 - 251
  • [10] Population-scale family trees from publicly available data
    Otto, Grant
    NATURE REVIEWS GENETICS, 2018, 19 (05) : 251 - 251