Network-based regularization for analysis of high-dimensional genomic data with group structure

被引:0
|
作者
Kim, Kipoong [1 ]
Choi, Jiyun [1 ]
Sun, Hokeun [1 ]
机构
[1] Pusan Natl Univ, Dept Stat, Busandaehak Ro 63beon Gil, Busan 46241, South Korea
关键词
high-dimensional genomic data; network-based regularization; genetic network; principal component analysis (PCA); partial least square (PLS);
D O I
10.5351/KJAS.2016.29.6.1117
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In genetic association studies with high-dimensional genomic data, regularization procedures based on penalized likelihood are often applied to identify genes or genetic regions associated with diseases or traits. A network-based regularization procedure can utilize biological network information (such as genetic pathways and signaling pathways in genetic association studies) with an outstanding selection performance over other regularization procedures such as lasso and elastic-net. However, network-based regularization has a limitation because cannot be applied to high-dimension genomic data with a group structure. In this article, we propose to combine data dimension reduction techniques such as principal component analysis and a partial least square into network-based regularization for the analysis of high-dimensional genomic data with a group structure. The selection performance of the proposed method was evaluated by extensive simulation studies. The proposed method was also applied to real DNA methylation data generated from Illumina Infinium HumanMethylation27K BeadChip, where methylation beta values of around 20,000 CpG sites over 12,770 genes were compared between 123 ovarian cancer patients and 152 healthy controls. This analysis was also able to indicate a few cancer-related genes.
引用
收藏
页码:1117 / 1128
页数:12
相关论文
共 50 条
  • [1] Robust network-based regularization and variable selection for high-dimensional genomic data in cancer prognosis
    Ren, Jie
    Du, Yinhao
    Li, Shaoyu
    Ma, Shuangge
    Jiang, Yu
    Wu, Cen
    [J]. GENETIC EPIDEMIOLOGY, 2019, 43 (03) : 276 - 291
  • [2] Network-based regularization for matched case-control analysis of high-dimensional DNA methylation data
    Sun, Hokeun
    Wang, Shuang
    [J]. STATISTICS IN MEDICINE, 2013, 32 (12) : 2127 - 2139
  • [3] Network-based Clustering and Embedding for High-Dimensional Data Visualization
    Zhang, Hengyuan
    Chen, Xiaowu
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS (CAD/GRAPHICS), 2013, : 290 - 297
  • [4] Network-Based Interface for the Exploration of High-Dimensional Data Spaces
    Zhang, Zhiyuan
    McDonnell, Kevin T.
    Mueller, Klaus
    [J]. IEEE PACIFIC VISUALIZATION SYMPOSIUM 2012, 2012, : 17 - 24
  • [5] Regularization techniques for high-dimensional data analysis
    Lu, Jiwen
    Peng, Xi
    Deng, Weihong
    Mian, Ajmal
    [J]. IMAGE AND VISION COMPUTING, 2017, 60 : 1 - 3
  • [6] Analysis of high-dimensional genomic data using MapReduce based probabilistic neural network
    Baliarsingh, Santos Kumar
    Vipsita, Swati
    Gandomi, Amir H.
    Panda, Abhijeet
    Bakshi, Sambit
    Ramasubbareddy, Somula
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 195
  • [7] TESTING FOR GROUP STRUCTURE IN HIGH-DIMENSIONAL DATA
    McLachlan, G. J.
    Rathnayake, Suren I.
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2011, 21 (06) : 1113 - 1125
  • [8] A Network-Based Model for High-Dimensional Information Filtering
    Nanas, Nikolaos
    Vavalis, Manolis
    De Roeck, Anne
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 202 - 209
  • [9] NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA
    Sun, Hokeun
    Lin, Wei
    Feng, Rui
    Li, Hongzhe
    [J]. STATISTICA SINICA, 2014, 24 (03) : 1433 - 1459
  • [10] Network-based hierarchical population structure analysis for large genomic data sets
    Greenbaum, Gili
    Rubin, Amir
    Templeton, Alan R.
    Rosenberg, Noah A.
    [J]. GENOME RESEARCH, 2019, 29 (12) : 2020 - 2033