Integrative classification and analysis of multiple arrayCGH datasets with probe alignment

被引:6
|
作者
Ze Tian [1 ]
Rui Kuang [1 ]
机构
[1] Univ Minnesota Twin Cities, Dept Comp Sci & Engn, Minneapolis, MN USA
关键词
COPY NUMBER VARIATION; GENE-EXPRESSION; BLADDER-CANCER; HUMAN GENOME; CGH DATA; ALGORITHMS; MATRIX;
D O I
10.1093/bioinformatics/btq428
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Array comparative genomic hybridization (arrayCGH) is widely used to measure DNA copy numbers in cancer research. ArrayCGH data report log-ratio intensities of thousands of probes sampled along the chromosomes. Typically, the choices of the locations and the lengths of the probes vary in different experiments. This discrepancy in choosing probes poses a challenge in integrated classification or analysis across multiple arrayCGH datasets. We propose an alignment-based framework to integrate arrayCGH samples generated from different probe sets. The alignment framework seeks an optimal alignment between the probe series of one arrayCGH sample and the probe series of another sample, intended to find the maximum possible overlap of DNA copy number variations between the two measured chromosomes. An alignment kernel is introduced for integrative patient sample classification and a multiple alignment algorithm is also introduced for identifying common regions with copy number aberrations. Results: The probe alignment kernel and the MPA algorithm were experimented to integrate three bladder cancer datasets as well as artificial datasets. In the experiments, by integrating arrayCGH samples from multiple datasets, the probe alignment kernel used with support vector machines significantly improved patient sample classification accuracy over other baseline kernels. The experiments also demonstrated that the multiple probe alignment (MPA) algorithm can find common DNA aberrations that cannot be identified with the standard interpolation method. Furthermore, the MPA algorithm also identified many known bladder cancer DNA aberrations containing four known bladder cancer genes, three of which cannot be detected by interpolation.
引用
收藏
页码:2313 / 2320
页数:8
相关论文
共 50 条
  • [31] Enhancing Electrocardiogram Classification with Multiple Datasets and Distant Transfer Learning
    Chui, Kwok Tai
    Gupta, Brij B.
    Zhao, Mingbo
    Malibari, Areej
    Arya, Varsha
    Alhalabi, Wadee
    Ruiz, Miguel Torres
    BIOENGINEERING-BASEL, 2022, 9 (11):
  • [32] Analysis of temporal alignment for Video Classification
    Blanc, Katy
    Lingrand, Diane
    Paladini, Antonio
    Coviello, Luca
    Mitrev, Dane
    Sohler, Emily
    Guzman, Leonardo
    Precioso, Frederic
    2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 495 - 499
  • [33] Multiple sequence alignment using biological features classification
    Besharati, Arezoo
    Mehrdadjalali
    2014 INTERNATIONAL CONGRESS ON TECHNOLOGY, COMMUNICATION AND KNOWLEDGE (ICTCK), 2014,
  • [34] Screening and Identification of Human Endogenous Retrovirus-K mRNAs for Breast Cancer Through Integrative Analysis of Multiple Datasets
    Wei, Yongzhong
    Wei, Huilin
    Wei, Yinfeng
    Tan, Aihua
    Chen, Xiuyong
    Liao, Xiuquan
    Xie, Bo
    Wei, Xihua
    Li, Lanxiang
    Liu, Zengjing
    Dai, Shengkang
    Khan, Adil
    Pang, Xianwu
    Hassan, Nada M. A.
    Xiong, Kai
    Zhang, Kai
    Leng, Jing
    Lv, Jiannan
    Hu, Yanling
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [35] iASeq: integrative analysis of allele-specificity of protein-DNA interactions in multiple ChIP-seq datasets
    Yingying Wei
    Xia Li
    Qian-fei Wang
    Hongkai Ji
    BMC Genomics, 13
  • [36] iASeq: integrative analysis of allele-specificity of protein-DNA interactions in multiple ChIP-seq datasets
    Wei, Yingying
    Li, Xia
    Wang, Qian-fei
    Ji, Hongkai
    BMC GENOMICS, 2012, 13
  • [37] A penalized integrative deep neural network for variable selection among multiple omics datasets
    Yang Li
    Xiaonan Ren
    Haochen Yu
    Tao Sun
    Shuangge Ma
    Quantitative Biology, 2024, 12 (03) : 313 - 323
  • [38] The possibility of integrative causal analysis: learning from different datasets and studies
    Tsamardinos, Ioannis
    Triantafillou, Sofia
    ENGINEERING INTELLIGENT SYSTEMS FOR ELECTRICAL ENGINEERING AND COMMUNICATIONS, 2009, 17 (2-3): : 163 - 175
  • [39] A penalized integrative deep neural network for variable selection among multiple omics datasets
    Li, Yang
    Ren, Xiaonan
    Yu, Haochen
    Sun, Tao
    Ma, Shuangge
    QUANTITATIVE BIOLOGY, 2024, 12 (03) : 313 - 323
  • [40] A local search algorithm for local multiple alignment: Special case analysis and application to cancer classification
    Akutsu, T
    PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 1284 - 1290