Predicting alternatively spliced exons using semi-supervised learning

被引:2
|
作者
Stanescu, Ana [1 ]
Tangirala, Karthik [1 ]
Caragea, Doina [1 ]
机构
[1] Kansas State Univ, Dept Comp & Informat Sci, Manhattan, KS 66506 USA
关键词
semi-supervised learning; expectation maximisation; self-training; co-training; alternatively spliced exons; constitutively spliced exons; ROC; parameter tuning; cross-validation; Caenorhabditis elegans; EM ALGORITHM; IDENTIFICATION; RNA; PROTEINS;
D O I
10.1504/IJDMB.2016.073337
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cost-efficient next generation sequencers can now produce unprecedented volumes of raw DNA data, posing challenges for annotation. Supervised machine learning approaches have been traditionally used to analyse and annotate complex genomic information. However, such approaches require labelled data for training, which in practice is scarce or expensive, while the unlabelled data is abundant. For some problems, semi-supervised learning can help improve supervised classifiers by making use of large amounts of unlabelled data and the latent information within them. We evaluate the applicability of semi-supervised learning algorithms to the problem of DNA sequence annotation, specifically to the prediction of alternatively spliced exons. We employ Expectation Maximisation, Self-training, and Co-training algorithms in an effort to assess the strengths and limitations of these techniques in the context of alternative splicing.
引用
收藏
页码:1 / 21
页数:21
相关论文
共 50 条
  • [1] Semi-Supervised Learning of Alternatively Spliced Exons Using Co-Training
    Tangirala, Karthik
    Caragea, Doina
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 243 - 246
  • [2] SEMI-SUPERVISED LEARNING OF ALTERNATIVELY SPLICED EXONS USING EXPECTATION MAXIMIZATION TYPE APPROACHES
    Stanescu, Ana
    Caragea, Doina
    BIOINFORMATICS: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOINFORMATICS MODELS, METHODS AND ALGORITHMS, 2012, : 240 - 245
  • [3] Predicting buffer overflow using semi-supervised learning
    Meng Qingkun
    Wen Shameng
    Feng Chao
    Tang Chaojing
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 1959 - 1963
  • [4] Using Semi-Supervised Learning for Predicting Metamorphic Relations
    Hardin, Bonnie
    Kanewala, Upulee
    2018 IEEE/ACM 3RD INTERNATIONAL WORKSHOP ON METAMORPHIC TESTING (MET 2018), 2018, : 14 - 17
  • [5] Learning And Predicting Diabetes Data Sets Using Semi-Supervised Learning
    Tayal, Radhika
    Shankar, Achyut
    PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 385 - 389
  • [6] Alternatively spliced internal exons prediction using SVM
    Qiao, Yuanhua
    Jia, Erze
    Zeng, Yanjun
    PROGRESS ON POST-GENOME TECHNOLOGIES, 2007, : 115 - 118
  • [7] A phylogenetic generalized hidden Markov model for predicting alternatively spliced exons
    Allen, Jonathan E.
    Salzberg, Steven L.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2006, 1 (1)
  • [8] A phylogenetic generalized hidden Markov model for predicting alternatively spliced exons
    Jonathan E Allen
    Steven L Salzberg
    Algorithms for Molecular Biology, 1
  • [9] Predicting Personality Using Facebook Status Based on Semi-supervised Learning
    Zheng, Heci
    Wu, Chunhua
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 53 - 58
  • [10] Are all of the human exons alternatively spliced?
    Chen, Feng-Chi
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (04) : 542 - 551