Feature Selection for Detecting Gene-Gene Interactions in Genome-Wide Association Studies

被引:5
|
作者
Dorani, Faramarz [1 ]
Hu, Ting [1 ]
机构
[1] Mem Univ, Dept Comp Sci, St John, NF A1B 3X5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Feature selection; Relief algorithms; Information gain; Genome-wide association studies; Gene-gene interactions; CLASSIFICATION; EPISTASIS; RELIEFF;
D O I
10.1007/978-3-319-77538-8_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Disease association studies aim at finding the genetic variations underlying complex human diseases in order to better understand the etiology of the disease and to provide better diagnoses, treatment, and even prevention. The non-linear interactions among multiple genetic factors play an important role in finding those genetic variations, but have not always been taken fully into account. This is due to the fact that searching combinations of interacting genetic factors becomes inhibitive as its complexity grows exponentially with the size of data. It is especially challenging for genome-wide association studies (GWAS) where typically more than a million single-nucleotide polymorphisms (SNPs) are under consideration. Dimensionality reduction is thus needed to allow us to investigate only a subset of genetic attributes that most likely have interaction effects. In this article, we conduct a comprehensive study by examining six widely used feature selection methods in machine learning for filtering interacting SNPs rather than the ones with strong individual main effects. Those six feature selection methods include chi-square, logistic regression, odds ratio, and three Relief-based algorithms. By applying all six feature selection methods to both a simulated and a real GWAS datasets, we report that Relief-based methods perform the best in filtering SNPs associated with a disease in terms of strong interaction effects.
引用
收藏
页码:33 / 46
页数:14
相关论文
共 50 条
  • [1] A FAST ALGORITHM FOR DETECTING GENE-GENE INTERACTIONS IN GENOME-WIDE ASSOCIATION STUDIES
    Li, Jiahan
    Zhong, Wei
    Li, Runze
    Wu, Rongling
    [J]. ANNALS OF APPLIED STATISTICS, 2014, 8 (04): : 2292 - 2318
  • [2] The choice of null distributions for detecting gene-gene interactions in genome-wide association studies
    Can Yang
    Xiang Wan
    Zengyou He
    Qiang Yang
    Hong Xue
    Weichuan Yu
    [J]. BMC Bioinformatics, 12
  • [3] The choice of null distributions for detecting gene-gene interactions in genome-wide association studies
    Yang, Can
    Wan, Xiang
    He, Zengyou
    Yang, Qiang
    Xue, Hong
    Yu, Weichuan
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [4] Software for detecting gene-gene interactions in genome wide association studies
    Koo, Ching Lee
    Liew, Mei Jing
    Mohamad, Mohd Saberi
    Salleh, Abdul Hakim Mohamed
    Deris, Safaai
    Ibrahim, Zuwairie
    Susilo, Bambang
    Hendrawan, Yusuf
    Wardani, Agustin Krisna
    [J]. BIOTECHNOLOGY AND BIOPROCESS ENGINEERING, 2015, 20 (04) : 662 - 676
  • [5] Software for detecting gene-gene interactions in genome wide association studies
    Ching Lee Koo
    Mei Jing Liew
    Mohd Saberi Mohamad
    Abdul Hakim Mohamed Salleh
    Safaai Deris
    Zuwairie Ibrahim
    Bambang Susilo
    Yusuf Hendrawan
    Agustin Krisna Wardani
    [J]. Biotechnology and Bioprocess Engineering, 2015, 20 : 662 - 676
  • [6] High performance Grid computing for detecting gene-gene interactions in genome-wide association studies
    Hmida, M. Ben Haj
    Slimani, Y.
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2012, 5 (01): : 33 - 44
  • [7] Rapid Testing of Gene-gene Interactions in Genome-wide Association Studies
    Bhattacharya, Kanishka
    Magi, Reedik
    Morris, Andrew P.
    [J]. GENETIC EPIDEMIOLOGY, 2010, 34 (08) : 930 - 931
  • [8] RAPID detection of gene-gene interactions in genome-wide association studies
    Brinza, Dumitru
    Schultz, Matthew
    Tesler, Glenn
    Bafna, Vineet
    [J]. BIOINFORMATICS, 2010, 26 (22) : 2856 - 2862
  • [9] Testing Gene-Gene Interactions in Genome Wide Association Studies
    Hu, Jie Kate
    Wang, Xianlong
    Wang, Pei
    [J]. GENETIC EPIDEMIOLOGY, 2014, 38 (02) : 123 - 134
  • [10] Testing Gene-Gene Interactions Based on a Neighborhood Perspective in Genome-wide Association Studies
    Guo, Yingjie
    Cheng, Honghong
    Yuan, Zhian
    Liang, Zhen
    Wang, Yang
    Du, Debing
    [J]. FRONTIERS IN GENETICS, 2021, 12