Support vector machine model of developmental brain gene expression data for prioritization of Autism risk gene candidates

被引:49
|
作者
Cogill, S. [1 ]
Wang, L. [1 ]
机构
[1] Clemson Univ, Dept Biochem & Genet, Clemson, SC 29634 USA
关键词
LONG NONCODING RNAS; SPECTRUM DISORDERS; PREDICTION; KNOWLEDGEBASE; IMPLICATE; EVOLUTION; CHILDREN; INSIGHTS; GENCODE; DNA;
D O I
10.1093/bioinformatics/btw498
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Autism spectrum disorders (ASD) are a group of neurodevelopmental disorders with clinical heterogeneity and a substantial polygenic component. High-throughput methods for ASD risk gene identification produce numerous candidate genes that are time-consuming and expensive to validate. Prioritization methods can identify high-confidence candidates. Previous ASD gene prioritization methods have focused on a priori knowledge, which excludes genes with little functional annotation or no protein product such as long non-coding RNAs (lncRNAs). Results: We have developed a support vector machine (SVM) model, trained using brain developmental gene expression data, for the classification and prioritization of ASD risk genes. The selected feature model had a mean accuracy of 76.7%, mean specificity of 77.2% and mean sensitivity of 74.4%. Gene lists comprised of an ASD risk gene and adjacent genes were ranked using the model's decision function output. The known ASD risk genes were ranked on average in the 77.4th, 78.4th and 80.7th percentile for sets of 101, 201 and 401 genes respectively. Of 10,840 lncRNA genes, 63 were classified as ASD-associated candidates with a confidence greater than 0.95. Genes previously associated with brain development and neurodevelopmental disorders were prioritized highly within the lncRNA gene list.
引用
收藏
页码:3611 / 3618
页数:8
相关论文
共 50 条
  • [31] A novel machine learning model to predict autism spectrum disorders risk gene
    Gok, Murat
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (10): : 6711 - 6717
  • [32] DEVELOPMENTAL EXPRESSION OF PRION PROTEIN GENE IN BRAIN
    MCKINLEY, MP
    HAY, B
    LINGAPPA, VR
    LIEBERBURG, I
    PRUSINER, SB
    DEVELOPMENTAL BIOLOGY, 1987, 121 (01) : 105 - 110
  • [33] Developmental mouse brain gene expression maps
    Brumwell, Craig L.
    Curran, Tom
    JOURNAL OF PHYSIOLOGY-LONDON, 2006, 575 (02): : 343 - 346
  • [34] Class discovery in gene expression data: Characterizing splits by support vector machines
    Markowetz, F
    von Heydebreck, A
    BETWEEN DATA SCIENCE AND APPLIED DATA ANALYSIS, 2003, : 662 - 669
  • [35] Incorporating gene similarity into support vector machine for microarray classification and gene selection
    Tan, Jun-Yan
    Yang, Zhi-Xia
    OPTIMIZATION AND SYSTEMS BIOLOGY, PROCEEDINGS, 2008, 9 : 350 - +
  • [36] Comparison of support vector machines to other classifiers using gene expression data
    Shieh, GS
    Jiang, YC
    Shih, YS
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2006, 35 (01) : 241 - 256
  • [37] A Machine Learning Approach to Predict the Changes of Brain Functional Connectivity in Autism Spectrum Disorder From the Gene Expression Data
    Choudhery, Sanjeevani
    Huang, Chuan
    Wang, Daifeng
    BIOLOGICAL PSYCHIATRY, 2018, 83 (09) : S227 - S228
  • [38] Using Rule-Based Machine Learning for Candidate Disease Gene Prioritization and Sample Classification of Cancer Gene Expression Data
    Glaab, Enrico
    Bacardit, Jaume
    Garibaldi, Jonathan M.
    Krasnogor, Natalio
    PLOS ONE, 2012, 7 (07):
  • [39] Gene selection algorithms for microarray data based on least squares support vector machine
    Tang, EK
    Suganthan, PN
    Yao, X
    BMC BIOINFORMATICS, 2006, 7
  • [40] Gene selection algorithms for microarray data based on least squares support vector machine
    E Ke Tang
    PN Suganthan
    Xin Yao
    BMC Bioinformatics, 7 (1)