Data-driven analysis to understand long COVID using electronic health records from the RECOVER initiative

被引:16
|
作者
Zang, Chengxi [1 ]
Zhang, Yongkang [1 ]
Xu, Jie [2 ]
Bian, Jiang [2 ]
Morozyuk, Dmitry [1 ]
Schenck, Edward J. [3 ]
Khullar, Dhruv [1 ]
Nordvig, Anna S. [4 ]
Shenkman, Elizabeth A. [2 ]
Rothman, Russell L. [5 ]
Block, Jason P. [6 ]
Lyman, Kristin [7 ]
Weiner, Mark G. [1 ]
Carton, Thomas W. [7 ]
Wang, Fei [1 ]
Kaushal, Rainu [1 ]
机构
[1] Weill Cornell Med, Dept Populat Hlth Sci, New York, NY 10065 USA
[2] Univ Florida, Dept Hlth Outcomes Biomed Informat, Gainesville, FL USA
[3] Weill Cornell Med, Dept Med, Div Pulm & Crit Care Med, New York, NY USA
[4] Weill Cornell Med, Dept Neurol, New York, NY USA
[5] Vanderbilt Univ, Ctr Hlth Serv Res, Med Ctr, Nashville, TN USA
[6] Harvard Med Sch, Harvard Pilgrim Hlth Care Inst, Dept Populat Med, Boston, MA USA
[7] Louisiana Publ Hlth Inst, New Orleans, LA USA
基金
美国国家卫生研究院;
关键词
D O I
10.1038/s41467-023-37653-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this study, the authors characterise post-acute sequelae of SARS-CoV-2 (PASC) in two large cohorts based on electronic health records from the USA. They identify a broad range of PASC-related conditions which were only partially replicated across the two cohorts, indicating possible heterogeneity between populations. Recent studies have investigated post-acute sequelae of SARS-CoV-2 infection (PASC, or long COVID) using real-world patient data such as electronic health records (EHR). Prior studies have typically been conducted on patient cohorts with specific patient populations which makes their generalizability unclear. This study aims to characterize PASC using the EHR data warehouses from two large Patient-Centered Clinical Research Networks (PCORnet), INSIGHT and OneFlorida+, which include 11 million patients in New York City (NYC) area and 16.8 million patients in Florida respectively. With a high-throughput screening pipeline based on propensity score and inverse probability of treatment weighting, we identified a broad list of diagnoses and medications which exhibited significantly higher incidence risk for patients 30-180 days after the laboratory-confirmed SARS-CoV-2 infection compared to non-infected patients. We identified more PASC diagnoses in NYC than in Florida regarding our screening criteria, and conditions including dementia, hair loss, pressure ulcers, pulmonary fibrosis, dyspnea, pulmonary embolism, chest pain, abnormal heartbeat, malaise, and fatigue, were replicated across both cohorts. Our analyses highlight potentially heterogeneous risks of PASC in different populations.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Finding Long-COVID: temporal topic modeling of electronic health records from the N3C and RECOVER programs
    O'Neil, Shawn T.
    Madlock-Brown, Charisse
    Wilkins, Kenneth J.
    McGrath, Brenda M.
    Davis, Hannah E.
    Assaf, Gina S.
    Wei, Hannah
    Zareie, Parya
    French, Evan T.
    Loomba, Johanna
    McMurry, Julie A.
    Zhou, Andrea
    Chute, Christopher G.
    Moffitt, Richard A.
    Pfaff, Emily R.
    Yoo, Yun Jae
    Leese, Peter
    Chew, Robert F.
    Lieberman, Michael
    Haendel, Melissa A.
    NPJ DIGITAL MEDICINE, 2024, 7 (01):
  • [22] Machine learning identifies long COVID patterns from electronic health records
    Wang, Fei
    NATURE MEDICINE, 2023, 29 (01) : 47 - 48
  • [24] A Data-Driven Approach to Support the Understanding and Improvement of Patients' Journeys: A Case Study Using Electronic Health Records of an Emergency Department
    Rismanchian, Farhood
    Kassani, Sara Hosseinzadeh
    Shavarani, Seyed Mahdi
    Lee, Young Hoon
    VALUE IN HEALTH, 2023, 26 (01) : 18 - 27
  • [25] Developing data-driven clinical pathways using electronic health records: The cases of total laparoscopic hysterectomy and rotator cuff tears
    Cho, Minsu
    Kim, Kidong
    Lim, Jungeun
    Baek, Hyunyoung
    Kim, Seok
    Hwang, Hee
    Song, Minseok
    Yoo, Sooyoung
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2020, 133
  • [26] The Medical Informatics Initiative as a catalyst for data-driven health research in Germany
    Sedlmayr, Martin
    Semler, Sebastian Claudius
    BUNDESGESUNDHEITSBLATT-GESUNDHEITSFORSCHUNG-GESUNDHEITSSCHUTZ, 2024, 67 (06) : 613 - 615
  • [27] Data-driven identification of indication for treatment in electronic medical records using cluster analysis in combination with a self-controlled cohort analysis
    Bergvall, Tomas
    Grundmark, Birgitta
    Bourke, Alison
    Noren, Niklas
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2019, 28 : 202 - 202
  • [28] DensityTransfer: A Data Driven Approach for Imputing Electronic Health Records
    Wang, Fei
    Zhou, Jiayu
    Hu, Jianying
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2763 - 2768
  • [29] CARDIOVASCULAR DISEASE RESEARCH BY USING DATA FROM ELECTRONIC HEALTH RECORDS
    Majnaric, L.
    Sabanovic, S.
    ATHEROSCLEROSIS, 2016, 252 : E41 - E41
  • [30] A data-driven analysis of the aviation recovery from the COVID-19 pandemic
    Sun, Xiaoqian
    Wandelt, Sebastian
    Zhang, Anming
    JOURNAL OF AIR TRANSPORT MANAGEMENT, 2023, 109