Predicting microRNA precursors with a generalized Gaussian components based density estimation algorithm

被引:15
|
作者
Hsieh, Chih-Hung [2 ]
Chang, Darby Tien-Hao [1 ]
Hsueh, Cheng-Hao [1 ]
Wu, Chi-Yeh [1 ]
Oyang, Yen-Jen [2 ,3 ,4 ]
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Tainan 70101, Taiwan
[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan
[3] Natl Taiwan Univ, Inst Networking & Multimedia, Taipei 106, Taiwan
[4] Natl Taiwan Univ, Ctr Syst Biol & Bioinformat, Taipei 106, Taiwan
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
IDENTIFICATION; MIRNAS; MODEL; CLASSIFICATION; ACCURATE; SEQUENCE; RNAS;
D O I
10.1186/1471-2105-11-S1-S52
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: MicroRNAs (miRNAs) are short non-coding RNA molecules, which play an important role in post-transcriptional regulation of gene expression. There have been many efforts to discover miRNA precursors (pre-miRNAs) over the years. Recently, ab initio approaches have attracted more attention because they do not depend on homology information and provide broader applications than comparative approaches. Kernel based classifiers such as support vector machine (SVM) are extensively adopted in these ab initio approaches due to the prediction performance they achieved. On the other hand, logic based classifiers such as decision tree, of which the constructed model is interpretable, have attracted less attention. Results: This article reports the design of a predictor of pre-miRNAs with a novel kernel based classifier named the generalized Gaussian density estimator (G(2)DE) based classifier. The G(2)DE is a kernel based algorithm designed to provide interpretability by utilizing a few but representative kernels for constructing the classification model. The performance of the proposed predictor has been evaluated with 692 human pre-miRNAs and has been compared with two kernel based and two logic based classifiers. The experimental results show that the proposed predictor is capable of achieving prediction performance comparable to those delivered by the prevailing kernel based classification algorithms, while providing the user with an overall picture of the distribution of the data set. Conclusion: Software predictors that identify pre-miRNAs in genomic sequences have been exploited by biologists to facilitate molecular biology research in recent years. The G(2)DE employed in this study can deliver prediction accuracy comparable with the state-of-the-art kernel based machine learning algorithms. Furthermore, biologists can obtain valuable insights about the different characteristics of the sequences of pre-miRNAs with the models generated by the G(2)DE based predictor.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Distance and density based clustering algorithm using Gaussian kernel
    Gungor, Emre
    Ozmen, Ahmet
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 69 : 10 - 20
  • [32] An Improved Distance Estimation Algorithm Based on Generalized CRT
    Deng, Ping
    Cui, Yunhe
    [J]. 2012 IEEE VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2012,
  • [33] An Adaptive Detection Algorithm for Blind Watermark Based on Generalized Gaussian Distribution
    Li, Zhiming
    Wang, Taiyue
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2058 - +
  • [34] Target detection algorithm based on generalized inverse Gaussian texture structure
    Chen, Duo
    Fan, Yifei
    Su, Jia
    Guo, Zixun
    Tao, Mingliang
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (12): : 4018 - 4025
  • [35] Predicting human microRNA precursors based on an optimized feature subset generated by GA-SVM
    Wang, Yanqiu
    Chen, Xiaowen
    Jiang, Wei
    Li, Li
    Li, Wei
    Yang, Lei
    Liao, Mingzhi
    Lian, Baofeng
    Lv, Yingli
    Wang, Shiyuan
    Wang, Shuyuan
    Li, Xia
    [J]. GENOMICS, 2011, 98 (02) : 73 - 78
  • [36] Least square and Gaussian process for image based microalgal density estimation
    Nguyen, Linh
    Nguyen, Dung K.
    Nghiem, Truong X.
    Nguyen, Thang
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 193
  • [37] A new algorithm for clustering based on kernel density estimation
    Matioli, L. C.
    Santos, S. R.
    Kleina, M.
    Leite, E. A.
    [J]. JOURNAL OF APPLIED STATISTICS, 2018, 45 (02) : 347 - 366
  • [38] A contact algorithm for density-based load estimation
    Bona, MA
    Martin, LD
    Fischer, KJ
    [J]. JOURNAL OF BIOMECHANICS, 2006, 39 (04) : 636 - 644
  • [39] Texture image retrieval based on DT-CWT and generalized Gaussian density
    [J]. Zhang, J.-W. (zhangjw@lzu.edu.cn), 1600, Editorial Board of Jilin University (43):
  • [40] A Parabolic Detection Algorithm Based on Kernel Density Estimation
    Liu, Xiaomin
    Song, Qi
    Li, Peihua
    [J]. EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, 5754 : 405 - 412