Predicting miRNA’s target from primary structure by the nearest neighbor algorithm

被引:0
|
作者
Kao Lin
Ziliang Qian
Lin Lu
Lingyi Lu
Lihui Lai
Jieyi Gu
Zhenbing Zeng
Haipeng Li
Yudong Cai
机构
[1] Chinese Academy of Sciences,CAS
[2] Graduate School of the Chinese Academy of Sciences,MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences
[3] Chinese Academy of Sciences,Bioinformatics Center, Key Lab of Molecular Systems Biology, Shanghai Institutes for Biological Sciences
[4] Shanghai JiaoTong University,Department of Biomedical Engineering
[5] East China Normal University,School of Life Science
[6] East China Normal University,Software Engineering Institute
[7] Shanghai University,Institute of System Biology
来源
Molecular Diversity | 2010年 / 14卷
关键词
miRNA; Target; Predict; Nearest neighbor algorithm; Minimum redundancy maximum relevance; Properties forward selection;
D O I
暂无
中图分类号
学科分类号
摘要
We used a machine learning method, the nearest neighbor algorithm (NNA), to learn the relationship between miRNAs and their target proteins, generating a predictor which can then judge whether a new miRNA-target pair is true or not. We acquired 198 positive (true) miRNA-target pairs from Tarbase and the literature, and generated 4,888 negative (false) pairs through random combination. A 0/1 system and the frequencies of single nucleotides and di-nucleotides were used to encode miRNAs into vectors while various physicochemical parameters were used to encode the targets. The NNA was then applied, learning from these data to produce a predictor. We implemented minimum redundancy maximum relevance (mRMR) and properties forward selection (PFS) to reduce the redundancy of our encoding system, obtaining 91 most efficient properties. Finally, via the Jackknife cross-validation test, we got a positive accuracy of 69.2% and an overall accuracy of 96.0% with all the 253 properties. Besides, we got a positive accuracy of 83.8% and an overall accuracy of 97.2% with the 91 most efficient properties. A web-server for predictions is also made available at http://app3.biosino.org:8080/miRTP/index.jsp.
引用
收藏
页码:719 / 729
页数:10
相关论文
共 50 条
  • [21] An Adjusted Nearest Neighbor Algorithm Maximizing the F-Measure from Imbalanced Data
    Viola, Remi
    Emonet, Remi
    Habrard, Amaury
    Metzler, Guillaume
    Riou, Sebastien
    Sebban, Marc
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 243 - 250
  • [22] Nearest Neighbor-Genetic Algorithm for Downscaling of Climate Change Data from GCMs
    Kim, Soojun
    Kwak, Jaewon
    Kim, Hung Soo
    Jung, Younghun
    Kim, Gilho
    JOURNAL OF APPLIED METEOROLOGY AND CLIMATOLOGY, 2016, 55 (03) : 773 - 789
  • [23] Scalable, explainable, adaptive information extraction from structure-aware nearest neighbor
    Lu, Shudong
    Li, Si
    Guo, Jun
    NEUROCOMPUTING, 2024, 605
  • [24] Predicting persistence in the sediment compartment with a new automatic software based on the k-Nearest Neighbor (k-NN) algorithm
    Manganaro, Alberto
    Pizzo, Fabiola
    Lombardo, Anna
    Pogliaghi, Alberto
    Benfenati, Emilio
    CHEMOSPHERE, 2016, 144 : 1624 - 1630
  • [25] A regression-based K nearest neighbor algorithm for gene function prediction from heterogenous data
    Department of Computer Science and Engineering, AC101 Paul G. Center, University of Washington, Seattle, WA 98195, United States
    不详
    BMC Bioinform., 2006, SUPPL.1
  • [26] A Regression-based K nearest neighbor algorithm for gene function prediction from heterogeneous data
    Zizhen Yao
    Walter L Ruzzo
    BMC Bioinformatics, 7
  • [27] A Regression-based K nearest neighbor algorithm for gene function prediction from heterogeneous data
    Yao, ZZ
    Ruzzo, WL
    BMC BIOINFORMATICS, 2006, 7 (Suppl 1)
  • [28] Using Adaptive K-nearest Neighbor Algorithm and Cellular Automata Images to Predicting G-Protein-Coupled Receptor Classes
    Xiao, Xuan
    Qiu, Wang-Ren
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2010, 2 (02) : 180 - 184
  • [29] Using adaptive K-nearest neighbor algorithm and cellular automata images to predicting G-protein-coupled receptor classes
    Xuan Xiao
    Wang-Ren Qiu
    Interdisciplinary Sciences: Computational Life Sciences, 2010, 2 : 180 - 184
  • [30] NEAREST NEIGHBOR SPACINGS OF CLEAN VANADIUM SURFACES FROM EXTENDED APPEARANCE POTENTIAL FINE-STRUCTURE
    ELAM, WT
    COHEN, PI
    ROELOFS, L
    PARK, RL
    APPLIED SURFACE SCIENCE, 1979, 2 (04) : 636 - 639