FEATURES SELECTION USING PARAMETRIC AND NON-PARAMETRIC METHODS: TAG SNPs SELECTION USING GA-SVM AND GA-KNN

被引:1
|
作者
Elatraby, Amr I. A. [1 ]
Wahba, Rashad R. T. [1 ]
机构
[1] Ain Shams Univ, Fac Commerce, Stat Math & Insurance Dept, Cairo, Egypt
关键词
Single Nucleotide Polymorphisms (SNPs); tag SNPs; Support Vector Machine (SVM); K-Nearest Neighbor (KNN); Genetic Algorithm (GA);
D O I
10.17654/ADASMay2015_105_123
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The study of genetic variations of the human genome, especially Single Nucleotide Polymorphisms (SNPs), can lead to the discovery of new methods to prevent, diagnose and treat diseases. Full examination of all the SNPs of the human genome has become too expensive, thus a small subset of informative SNPs called tag SNPs must be selected. In this study, two methods for the selection of tag SNPs are presented. The first method is called GA-SVM, which integrates the Support Vector Machine (SVM) as a parametric technique with the Genetic Algorithm (GA). The second method is called GA-KNN, which integrates the K-Nearest Neighbor (KNN) as a non-parametric technique with GA. The two methods are tested on a group of genes, which known to be related to the natural clearance of Hepatitis C Virus (HCV). The genes' SNPs data had extracted from the HapMap site (http://hapmap.org). Moreover, the prediction accuracy of each method has been evaluated by using the 10-Fold Cross Validation (10-FCV) method. Our results have showed that, although the prediction accuracy of GA-SVM outperforms the prediction accuracy of GA-KNN when selecting a very small number of tag SNPs, the prediction accuracy of GA-KNN outperforms GA-SVM in all other cases. In addition, our results have indicated that the GA-KNN method requires more computing time as compared with GA-SVM.
引用
收藏
页码:105 / 123
页数:19
相关论文
共 50 条
  • [21] Uncertainty and sensitivity analysis of a PWR LOCA sequence using parametric and non-parametric methods
    Zugazagoitia, Eneko
    Queral, Cesar
    Fernandez-Cosials, Kevin
    Gomez, Javier
    Felipe Duran, Luis
    Sanchez-Torrijos, Jorge
    Maria Posada, Jose
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2020, 193
  • [22] Evaluation of Disparity Map computed using Local Stereo Parametric and Non-Parametric methods
    Tabssum, Tahera
    Charles, Priya
    Patil, A. V.
    2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 104 - 109
  • [23] EVALUATION OF PHENOTYPIC STABILITY IN BREAD WHEAT ACCESSIONS USING PARAMETRIC AND NON-PARAMETRIC METHODS
    Yaghotipoor, A.
    Farshadfar, E.
    Saeidi, M.
    JOURNAL OF ANIMAL AND PLANT SCIENCES-JAPS, 2017, 27 (04): : 1269 - 1275
  • [24] Assessment of Water Quality Trends in the Minnesota River using Non-Parametric and Parametric Methods
    Johnson, Heather O.
    Gupta, Satish C.
    Vecchia, Aldo V.
    Zvomuya, Francis
    JOURNAL OF ENVIRONMENTAL QUALITY, 2009, 38 (03) : 1018 - 1030
  • [25] Topology optimisation of adhesive joints using non-parametric methods
    Ejaz, H.
    Mubashar, A.
    Ashcroft, I. A.
    Uddin, Emad
    Khan, M.
    INTERNATIONAL JOURNAL OF ADHESION AND ADHESIVES, 2018, 81 : 1 - 10
  • [26] Localization of growth estimates using non-parametric imputation methods
    Sironen, S.
    Kangas, A.
    Maltamo, M.
    Kalliovirta, J.
    FOREST ECOLOGY AND MANAGEMENT, 2008, 256 (04) : 674 - 684
  • [27] Rainfall Forecasting Using Sub sampling Non-parametric Methods
    Pucheta, J.
    Rodriguez Rivero, C.
    Herrera, M.
    Salas, C.
    Sauchelli, V.
    IEEE LATIN AMERICA TRANSACTIONS, 2013, 11 (01) : 646 - 650
  • [29] Unsupervised Clustering of Utterances using Non-parametric Bayesian Methods
    Higashinaka, Ryuichiro
    Kawamae, Noriaki
    Sadamitsu, Kugatsu
    Minami, Yasuhiro
    Meguro, Toyomi
    Dohsaka, Kohji
    Inagaki, Hirohito
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2092 - 2095
  • [30] Speaker Linking and Applications using Non-parametric Hashing Methods
    Sturim, Douglas
    Campbell, William M.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2170 - 2174