Cluster analysis and visualisation of electronic health records data to identify undiagnosed patients with rare genetic diseases

被引:3
|
作者
Moynihan, Daniel [1 ]
Monaco, Sean [2 ]
Ting, Teck Wah [3 ,4 ]
Narasimhalu, Kaavya [4 ,5 ]
Hsieh, Jenny [4 ,6 ]
Kam, Sylvia [3 ,4 ]
Lim, Jiin Ying [3 ,4 ]
Lim, Weng Khong [4 ,7 ,8 ,9 ]
Davila, Sonia [4 ,7 ]
Bylstra, Yasmin [4 ,7 ]
Balakrishnan, Iswaree Devi [4 ,10 ]
Heng, Mark [11 ]
Chia, Elian [11 ]
Yeo, Khung Keong [10 ]
Goh, Bee Keow [12 ]
Gupta, Ritu [1 ]
Tan, Tele [1 ]
Baynam, Gareth [13 ,14 ]
Jamuar, Saumya Shekhar [3 ,4 ,7 ]
机构
[1] Curtin Univ, Perth, Australia
[2] Hlth Catalyst, South Jordan, UT USA
[3] KK Womens & Childrens Hosp, Dept Paediat, Genet Serv, 100 Bukit Timah Rd, Singapore 229899, Singapore
[4] SingHealth Duke NUS Genom Med Ctr, Singapore, Singapore
[5] Singapore Gen Hosp, Natl Neurosci Inst, Dept Neurol, Singapore, Singapore
[6] Singapore Gen Hosp, Dept Internal Med, Singapore, Singapore
[7] SingHealth Duke NUS Inst Precis Med, Singapore, Singapore
[8] Duke NUS Med Sch, Canc & Stem Cell Biol Program, Singapore, Singapore
[9] Genome Inst Singapore, Lab Genome Variat Analyt, Singapore, Singapore
[10] Natl Heart Ctr Singapore, Singapore, Singapore
[11] SingHealth Off Insights & Analyt, Singapore, Singapore
[12] KK Womens & Childrens Hosp, Data Analyt Off, Singapore, Singapore
[13] Perth Childrens Hosp, Rare Care Ctr, Perth, WA, Australia
[14] Western Australian Register Dev Anomalies, Perth, WA, Australia
关键词
FABRY-DISEASE;
D O I
10.1038/s41598-024-55424-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Rare genetic diseases affect 5-8% of the population but are often undiagnosed or misdiagnosed. Electronic health records (EHR) contain large amounts of data, which provide opportunities for analysing and mining. Data mining, in the form of cluster analysis and visualisation, was performed on a database containing deidentified health records of 1.28 million patients across 3 major hospitals in Singapore, in a bid to improve the diagnostic process for patients who are living with an undiagnosed rare disease, specifically focusing on Fabry Disease and Familial Hypercholesterolaemia (FH). On a baseline of 4 patients, we identified 2 additional patients with potential diagnosis of Fabry disease, suggesting a potential 50% increase in diagnosis. Similarly, we identified > 12,000 individuals who fulfil the clinical and laboratory criteria for FH but had not been diagnosed previously. This proof-of-concept study showed that it is possible to perform mining on EHR data albeit with some challenges and limitations.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Analysis and visualisation of electronic health records data to identify undiagnosed patients with rare genetic diseases
    Daniel Moynihan
    Sean Monaco
    Teck Wah Ting
    Kaavya Narasimhalu
    Jenny Hsieh
    Sylvia Kam
    Jiin Ying Lim
    Weng Khong Lim
    Sonia Davila
    Yasmin Bylstra
    Iswaree Devi Balakrishnan
    Mark Heng
    Elian Chia
    Khung Keong Yeo
    Bee Keow Goh
    Ritu Gupta
    Tele Tan
    Gareth Baynam
    Saumya Shekhar Jamuar
    Scientific Reports, 14
  • [2] Analysis and visualisation of electronic health records data to identify undiagnosed patients with rare genetic diseases (vol, 5056, 2024)
    Moynihan, Daniel
    Monaco, Sean
    Ting, Teck Wah
    Narasimhalu, Kaavya
    Hsieh, Jenny
    Kam, Sylvia
    Lim, Jiin Ying
    Lim, Weng Khong
    Davila, Sonia
    Bylstra, Yasmin
    Balakrishnan, Iswaree Devi
    Heng, Mark
    Chia, Elian
    Yeo, Khung Keong
    Goh, Bee Keow
    Gupta, Ritu
    Tan, Tele
    Baynam, Gareth
    Jamuar, Saumya Shekhar
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [3] Electronic health records for the diagnosis of rare diseases
    Garcelon, Nicolas
    Burgun, Anita
    Salomon, Remi
    Neuraz, Antoine
    KIDNEY INTERNATIONAL, 2020, 97 (04) : 676 - 686
  • [4] Exploring electronic health records to study rare diseases
    不详
    LANCET DIGITAL HEALTH, 2025, 7 (02): : e103 - e103
  • [5] Robust Ensemble Learning to Identify Rare Disease Patients from Electronic Health Records
    Colbaugh, Rich
    Glass, Kristin
    Rudolf, Christopher
    Tremblay, Mike
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 4085 - 4088
  • [6] The accuracy of using integrated electronic health care data to identify patients with undiagnosed diabetes mellitus
    Ho, Michael L.
    Lawrence, Nadine
    van Walraven, Carl
    Manuel, Doug
    Keely, Erin
    Malcolm, Janine
    Reid, Robert D.
    Forster, Alan J.
    JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2012, 18 (03) : 606 - 611
  • [7] Using electronic health records data to identify patients with chronic pain in a primary care setting
    Tian, Terrence Y.
    Zlateva, Ianita
    Anderson, Daren R.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (E2) : E275 - E280
  • [8] Development of an Algorithm to Identify Patients With Hypereosinophilic Syndrome (HES) in Claims Data and Electronic Health Records
    Pongdee, Thanai
    Walker, Valery
    Engel-Nitz, Nicole
    Johnson, Jonathan
    Rothenberg, Marc
    Edgecomb, Amy
    Ahmed, Waseem
    Hwee, Jeremiah
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2025, 155 (02) : AB431 - AB431
  • [9] Data analysis and visualisation of Cluster-Oriented Genetic Algorithm output
    Packham, ISJ
    Parmee, IC
    2000 IEEE INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION, PROCEEDINGS, 2000, : 173 - 178
  • [10] Cluster Analysis on Utilization Patterns of Patients with Chronic Diseases Based on Flattened Electronic Medical Records
    Wang, Debby D.
    Quan, Tan Xin
    Khoo, Astrid
    Sridharan, Srinath
    Ramachandran, Sravan
    Ng, Sheryl Hui-Xian
    Rahman, Siti Nabilah Abdul
    2017 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2017, : 293 - 298