Clinico-genomic Data Analytics for Precision Diagnosis and Disease Management

被引:3
|
作者
Kench, Anuja [1 ]
Janeja, Vandana P. [1 ]
Yesha, Yelena [1 ]
Rishe, Naphtali [2 ]
Grasso, Michael A. [3 ]
Niskar, Amanda [4 ]
机构
[1] Univ Maryland Baltimore Cty, Baltimore, MD 21228 USA
[2] Florida Int Univ, Miami, FL 33199 USA
[3] Univ Maryland, Sch Med, Baltimore, MD 21201 USA
[4] Arthrit Fdn, Atlanta, GA USA
关键词
Personalized Medicine; Single Nucleotide Polymorphism; correlations; clustering; CATEGORICAL-DATA; ALGORITHM;
D O I
10.1109/ICHI.2015.38
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Patient data can be present in clinical notes, lab results, genomic data sources, environmental and geospatial data sources and tissue banks to name a few. A holistic view of the patient's health can be achieved when relevant data from multiple heterogeneous sources are extracted and analyzed in a personalized manner. Moreover, comparative analysis of patients can be performed when multiple patient records are viewed across these heterogeneous data sources. To address this need, we propose clinico-genomic data analytics to enhance personalized medicine treatment decisions using heterogeneous, high dimensional, sparse and massive datasets. We utilize this framework to discover similar patients and overlaps among patients in a set of features towards the goals of: (1) better cohort discovery for clinical trials, (2) better disease management by studying peer group of patients with similar diagnosis but better prognosis, (3) early disease diagnosis by identifying similar features in patients with the existing diagnosis. We propose novel approach in two areas: (1) integrating clinical and genomic data of patients and (2) combined data analytics in such heterogeneous datasets. Our approach is modeled as a unified clustering algorithm for finding correlations among clinical and genomic factors of patients. We integrate data containing risk causing Single Nucleotide Polymorphism's (SNP's) known from literature with clinical records of patients. In such heterogeneous data, we propose a combined similarity measure for numeric and nominal data attributes, which we use in our clustering algorithm. Our results show compelling overlaps among patients in the same cluster. These patients had high pairwise similarity and emulated the real world similarities between patients with co-morbid diseases.
引用
收藏
页码:263 / 271
页数:9
相关论文
共 50 条
  • [1] Extending Workflow Management for Knowledge Discovery in Clinico-Genomic Data
    Rueping, Stefan
    Sfakianakis, Stelios
    Tsiknakis, Manolis
    [J]. FROM GENES TO PERSONALIZED HEALTHCARE: GRID SOLUTIONS FOR THE LIFE SCIENCES, 2007, 126 : 184 - +
  • [2] The importance of an ontology based clinical data management system (OCDMS) for clinico-genomic trials in ACGT (advancing clinico-genomic trials on cancer)
    Graf, Norbert
    Weiler, Gabriele
    Brochhausen, Mathias
    Schera, Fatima
    Hoppe, Alexander
    Tsiknakis, Manolis
    Kiefer, Stephan
    [J]. PEDIATRIC BLOOD & CANCER, 2007, 49 (04) : 581 - 581
  • [3] Grid-based knowledge discovery in clinico-genomic data
    May, Michael
    Potamias, George
    Rueping, Stefan
    [J]. BIOLOGICAL AND MEDICAL DATA ANALYSIS, PROCEEDINGS, 2006, 4345 : 219 - +
  • [4] Real-World Evidence In Support Of Precision Medicine: Clinico-Genomic Cancer Data As A Case Study
    Agarwala, Vineeta
    Khozin, Sean
    Singal, Gaurav
    O'Connell, Claire
    Kuk, Deborah
    Li, Gerald
    Gossai, Anala
    Miller, Vincent
    Abernethy, Amy P.
    [J]. HEALTH AFFAIRS, 2018, 37 (05) : 765 - 772
  • [5] Selection matters: Addressing biases in linked clinico-genomic data sets
    Panageas, Katherine
    Kehl, Kenneth L.
    Incerti, Devin
    Sturmer, Til
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2021, 30 : 11 - 12
  • [6] Mining interesting Clinico-Genomic associations: The HealthObs approach
    Potamias, George
    Koumakis, Lefteris
    Kanterakis, Alexandros
    Moustakis, Vassilis
    Kafetzopouios, Dimitrsi
    Tsiknakis, Manolis
    [J]. ARTIFICIAL INTELLIGENCE AND INNOVATIONS 2007: FROM THEORY TO APPLICATIONS, 2007, : 137 - +
  • [7] HIGH-LEVEL MODEL DEFINITION FOR MICROARRAY DATA IN A FUTURE CLINICO-GENOMIC EHR
    Bucur, Anca
    van Leeuwen, Jasper
    Vdovjak, Richard
    Vrijnsen, Jeroen
    [J]. HEALTHINF 2010: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON HEALTH INFORMATICS, 2010, : 149 - 154
  • [8] Retrospective Analysis of Real-World Clinico-Genomic Data for Clinical Impact of Genomic Profiling of ctDNA in NSCLC
    Madison, R.
    Schrock, A.
    Gregg, J.
    Carson, K.
    Castellanos, E.
    Singal, G.
    Miller, V.
    Ali, S.
    Alexander, B.
    Chung, J.
    [J]. JOURNAL OF THORACIC ONCOLOGY, 2019, 14 (10) : S364 - S365
  • [9] Supporting Clinico-Genomic knowledge discovery: A multi-strategy data mining process
    Kanterakis, Alexandros
    Potamias, George
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 520 - 524
  • [10] Survival prediction from clinico-genomic models - a comparative study
    Hege M Bøvelstad
    Ståle Nygård
    Ørnulf Borgan
    [J]. BMC Bioinformatics, 10