A kernel-based integration of genome-wide data for clinical decision support

被引:50
|
作者
Daemen, Anneleen [1 ]
Gevaert, Olivier [1 ]
Ojeda, Fabian [1 ]
Debucquoy, Annelies [2 ]
Suykens, Johan A. K. [1 ]
Sempoux, Christine [3 ]
Machiels, Jean-Pascal [4 ]
Haustermans, Karin [2 ]
De Moor, Bart [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT SCD, B-3001 Louvain, Belgium
[2] Katholieke Univ Leuven, Dept Expt Radiotherapy, B-3000 Louvain, Belgium
[3] Catholic Univ Louvain, St Luc Univ Hosp, Dept Pathol, B-1200 Brussels, Belgium
[4] Catholic Univ Louvain, St Luc Univ Hosp, Dept Med Oncol, B-1200 Brussels, Belgium
来源
GENOME MEDICINE | 2009年 / 1卷
关键词
RECTAL-CANCER; COPY NUMBER; PROGNOSTIC-SIGNIFICANCE; PROTEIN EXPRESSION; LOCAL RECURRENCE; MICROARRAY DATA; NF-KAPPAB; TUMOR; GROWTH; CELLS;
D O I
10.1186/gm39
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Although microarray technology allows the investigation of the transcriptomic make-up of a tumor in one experiment, the transcriptome does not completely reflect the underlying biology due to alternative splicing, post-translational modifications, as well as the influence of pathological conditions (for example, cancer) on transcription and translation. This increases the importance of fusing more than one source of genome-wide data, such as the genome, transcriptome, proteome, and epigenome. The current increase in the amount of available omics data emphasizes the need for a methodological integration framework. Methods: We propose a kernel-based approach for clinical decision support in which many genome-wide data sources are combined. Integration occurs within the patient domain at the level of kernel matrices before building the classifier. As supervised classification algorithm, a weighted least squares support vector machine is used. We apply this framework to two cancer cases, namely, a rectal cancer data set containing microarray and proteomics data and a prostate cancer data set containing microarray and genomics data. For both cases, multiple outcomes are predicted. Results: For the rectal cancer outcomes, the highest leave-one-out (LOO) areas under the receiver operating characteristic curves (AUC) were obtained when combining microarray and proteomics data gathered during therapy and ranged from 0.927 to 0.987. For prostate cancer, all four outcomes had a better LOO AUC when combining microarray and genomics data, ranging from 0.786 for recurrence to 0.987 for metastasis. Conclusions: For both cancer sites the prediction of all outcomes improved when more than one genome-wide data set was considered. This suggests that integrating multiple genome-wide data sources increases the predictive performance of clinical decision support models. This emphasizes the need for comprehensive multi-modal data. We acknowledge that, in a first phase, this will substantially increase costs; however, this is a necessary investment to ultimately obtain cost-efficient models usable in patient tailored therapy.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A kernel-based integration of genome-wide data for clinical decision support
    Anneleen Daemen
    Olivier Gevaert
    Fabian Ojeda
    Annelies Debucquoy
    Johan AK Suykens
    Christine Sempoux
    Jean-Pascal Machiels
    Karin Haustermans
    Bart De Moor
    [J]. Genome Medicine, 1
  • [2] Kernel-based Pathway Meta-Analysis in ILCCO / TRICL Genome-wide Association Studies
    Friedrichs, Stefanie
    Amos, Christopher I.
    Brennan, Paul
    Christiani, David
    Hung, Rayjean J.
    Risch, Angela
    Brueske, Irene
    Caporaso, Neil
    Landi, Maria T.
    Rafnar, Thorunn
    Bickeboeller, Heike
    [J]. GENETIC EPIDEMIOLOGY, 2015, 39 (07) : 549 - 549
  • [3] Privacy-Preserving Clinical Decision Support System Using Gaussian Kernel-Based Classification
    Rahulamathavan, Yogachandran
    Veluru, Suresh
    Phan, Raphael C. -W.
    Chambers, Jonathon A.
    Rajarajan, Muttukrishnan
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2014, 18 (01) : 56 - 66
  • [4] Data Integration for Clinical Decision Support
    Jung, Yuchae
    Yoon, Yong Ik
    [J]. 2016 EIGHTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2016, : 164 - 166
  • [5] Parallel Integration of Heterogeneous Genome-Wide Data Sources
    Greene, Derek
    Bryan, Kenneth
    Cunningham, Padraig
    [J]. 8TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, VOLS 1 AND 2, 2008, : 368 - 374
  • [6] Decision Support System for Medical Diagnosis Using a Kernel-Based Approach
    Mezrigui, Houda
    Theljani, Foued
    Laabidi, Kaouther
    [J]. 2017 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND DIAGNOSIS (ICCAD), 2017, : 303 - 308
  • [7] Heterogeneous urban traffic data and their integration through kernel-based interpolation
    Chow, Andy
    [J]. JOURNAL OF FACILITIES MANAGEMENT, 2016, 14 (02) : 165 - 178
  • [8] Support Kernel Classification: A New Kernel-Based Approach
    Bchir, Ouiem
    Ben Ismail, Mohamed M.
    Algarni, Sara
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (10) : 17 - 26
  • [9] Imbalanced data classification based on scaling kernel-based support vector machine
    Zhang, Yong
    Fu, Panpan
    Liu, Wenzhe
    Chen, Guolong
    [J]. NEURAL COMPUTING & APPLICATIONS, 2014, 25 (3-4): : 927 - 935
  • [10] Imbalanced data classification based on scaling kernel-based support vector machine
    Yong Zhang
    Panpan Fu
    Wenzhe Liu
    Guolong Chen
    [J]. Neural Computing and Applications, 2014, 25 : 927 - 935