Unified variable selection in semi-parametric models

被引:1
|
作者
Terry, William [1 ]
Zhang, Hongmei [2 ]
Maity, Arnab [3 ]
Arshad, Hasan [4 ,5 ]
Karmaus, Wilfried [2 ]
机构
[1] Univ Memphis, Dept Math Sci, Memphis, TN 38152 USA
[2] Univ Memphis, Div Epidemiol Biostat & Environm Hlth, Sch Publ Hlth, Memphis, TN 38152 USA
[3] North Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
[4] Univ Southampton, Allergy & Clin Immunol, Southampton, Hants, England
[5] David Hide Asthma & Allergy Res Ctr, Isle Of Wight, England
关键词
Bayesian methods; Gaussian kernel; non-linear effects; transformation; reproducing kernel; variable selection; single nucleotide polymorphisms; DNA methylation; DNA METHYLATION; REGRESSION-FUNCTIONS; ASTHMA; VARIANTS; RISK;
D O I
10.1177/0962280215610928
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
We propose a Bayesian variable selection method in semi-parametric models with applications to genetic and epigenetic data (e.g., single nucleotide polymorphisms and DNA methylation, respectively). The data are individually standardized to reduce heterogeneity and facilitate simultaneous selection of categorical (single nucleotide polymorphisms) and continuous (DNA methylation) variables. The Gaussian reproducing kernel is applied to the transformed data to evaluate joint effect of the variables, which may include complex interactions between, e.g., single nucleotide polymorphisms and DNA methylation. Indicator variables are introduced to the model for the purpose of variable selection. The method is demonstrated and evaluated using simulations under different scenarios. We apply the method to identify informative DNA methylation sites and single nucleotide polymorphisms in a set of genes based on their joint effect on allergic sensitization. The selected single nucleotide polymorphisms and methylation sites have the potential to serve as early markers for allergy prediction, and consequently benefit medical and clinical research to prevent allergy before its manifestation.
引用
收藏
页码:2821 / 2831
页数:11
相关论文
共 50 条
  • [1] Variable selection in semi-parametric models
    Zhang, Hongmei
    Maity, Arnab
    Arshad, Hasan
    Holloway, John
    Karmaus, Wilfried
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (04) : 1736 - 1752
  • [2] Variable selection in finite mixture of semi-parametric regression models
    Ormoz, Ehsan
    Eskandari, Farzad
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2016, 45 (03) : 695 - 711
  • [3] Variable selection and estimation for semi-parametric multiple-index models
    Wang, Tao
    Xu, Peirong
    Zhu, Lixing
    [J]. BERNOULLI, 2015, 21 (01) : 242 - 275
  • [4] Modern variable selection for longitudinal semi-parametric models with missing data
    Kowalski, J.
    Hao, S.
    Chen, T.
    Liang, Y.
    Liu, J.
    Ge, L.
    Feng, C.
    Tu, X. M.
    [J]. JOURNAL OF APPLIED STATISTICS, 2018, 45 (14) : 2548 - 2562
  • [5] A semi-parametric estimator for censored selection models with endogeneity
    Lee, MJ
    Vella, F
    [J]. JOURNAL OF ECONOMETRICS, 2006, 130 (02) : 235 - 252
  • [6] Instrumental variable estimation in semi-parametric additive hazards models
    Brueckner, Matthias
    Titman, Andrew
    Jaki, Thomas
    [J]. BIOMETRICS, 2019, 75 (01) : 110 - 120
  • [7] Semi-parametric copula sample selection models for count responses
    Marra, Giampiero
    Wyszynski, Karol
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 104 : 110 - 129
  • [8] Selection of covariance patterns for longitudinal data in semi-parametric models
    Li, Jialiang
    Wong, Weng Kee
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (02) : 183 - 196
  • [9] Robust signed-rank estimation and variable selection for semi-parametric additive partial linear models
    Nguelifack, Brice M.
    Kemajou-Brown, Isabelle
    [J]. JOURNAL OF APPLIED STATISTICS, 2020, 47 (10) : 1794 - 1819
  • [10] Semi-parametric regression models and economies of scale in the presence of an endogenous variable
    Cohen, Jeffrey P.
    Osleeb, Jeffrey P.
    Yang, Ke
    [J]. REGIONAL SCIENCE AND URBAN ECONOMICS, 2014, 49 : 252 - 261