Integrative classification and analysis of multiple arrayCGH datasets with probe alignment

被引:6
|
作者
Ze Tian [1 ]
Rui Kuang [1 ]
机构
[1] Univ Minnesota Twin Cities, Dept Comp Sci & Engn, Minneapolis, MN USA
关键词
COPY NUMBER VARIATION; GENE-EXPRESSION; BLADDER-CANCER; HUMAN GENOME; CGH DATA; ALGORITHMS; MATRIX;
D O I
10.1093/bioinformatics/btq428
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Array comparative genomic hybridization (arrayCGH) is widely used to measure DNA copy numbers in cancer research. ArrayCGH data report log-ratio intensities of thousands of probes sampled along the chromosomes. Typically, the choices of the locations and the lengths of the probes vary in different experiments. This discrepancy in choosing probes poses a challenge in integrated classification or analysis across multiple arrayCGH datasets. We propose an alignment-based framework to integrate arrayCGH samples generated from different probe sets. The alignment framework seeks an optimal alignment between the probe series of one arrayCGH sample and the probe series of another sample, intended to find the maximum possible overlap of DNA copy number variations between the two measured chromosomes. An alignment kernel is introduced for integrative patient sample classification and a multiple alignment algorithm is also introduced for identifying common regions with copy number aberrations. Results: The probe alignment kernel and the MPA algorithm were experimented to integrate three bladder cancer datasets as well as artificial datasets. In the experiments, by integrating arrayCGH samples from multiple datasets, the probe alignment kernel used with support vector machines significantly improved patient sample classification accuracy over other baseline kernels. The experiments also demonstrated that the multiple probe alignment (MPA) algorithm can find common DNA aberrations that cannot be identified with the standard interpolation method. Furthermore, the MPA algorithm also identified many known bladder cancer DNA aberrations containing four known bladder cancer genes, three of which cannot be detected by interpolation.
引用
下载
收藏
页码:2313 / 2320
页数:8
相关论文
共 50 条
  • [41] Alignment of multiple non-overlapping axially symmetric 3D datasets
    Willis, A
    Cooper, DB
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 96 - 99
  • [42] Performance of SVM with Multiple Kernel Learning for Classification Tasks of Imbalanced Datasets
    Saeed, Sana
    Ong, Hong Choon
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2019, 27 (01): : 527 - 545
  • [43] Action class relation detection and classification across multiple video datasets
    Yoshikawa, Yuya
    Shigeto, Yutaro
    Shimbo, Masashi
    Takeuchi, Akikazu
    PATTERN RECOGNITION LETTERS, 2023, 173 : 93 - 100
  • [44] ALIGNMENT AND CALIBRATION OF MULTIPLE MAGNETIC PROBE IN DEFLAGRATION PLASMA GUN BARREL
    NOMURA, JL
    LELANA, DS
    JOHNSON, DC
    TRIPATHI, PP
    CHANG, CN
    CHENG, DY
    BULLETIN OF THE AMERICAN PHYSICAL SOCIETY, 1976, 21 (09): : 1175 - 1175
  • [45] A tool for alignment of multiple laser beams in pump-probe experiments
    Karimullin, Kamil
    Knyazev, Mikhail
    Eremchev, Ivan
    Vainer, Yuri
    Naumov, Andrei
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2013, 24 (02)
  • [46] Comparative Analysis of HAR Datasets Using Classification Algorithms
    Nayak, Suvra
    Panigrahi, Chhabi
    Pati, Bibudhendu
    Nanda, Sarmistha
    Hsieh, Meng-Yen
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2022, 19 (01) : 47 - 63
  • [47] Performance analysis of Classification Algorithms under Different Datasets
    Rani, A. Swarupa
    Jyothi, S.
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 1584 - 1589
  • [48] Sex classification from functional brain connectivity: Generalization to multiple datasets
    Wiersch, Lisa
    Friedrich, Patrick
    Hamdan, Sami
    Komeyer, Vera
    Hoffstaedter, Felix
    Patil, Kaustubh R.
    Eickhoff, Simon B.
    Weis, Susanne
    HUMAN BRAIN MAPPING, 2024, 45 (06)
  • [49] Classification methods, reduced datasets and quality analysis applications
    Alippi, C
    Braione, P
    2004 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MEASUREMENT SYSTEMS AND APPLICATIONS, 2004, : 121 - 126
  • [50] A WORKBENCH FOR MULTIPLE ALIGNMENT CONSTRUCTION AND ANALYSIS
    SCHULER, GD
    ALTSCHUL, SF
    LIPMAN, DJ
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1991, 9 (03): : 180 - 190