Predicting miRNA’s target from primary structure by the nearest neighbor algorithm

被引:0
|
作者
Kao Lin
Ziliang Qian
Lin Lu
Lingyi Lu
Lihui Lai
Jieyi Gu
Zhenbing Zeng
Haipeng Li
Yudong Cai
机构
[1] Chinese Academy of Sciences,CAS
[2] Graduate School of the Chinese Academy of Sciences,MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences
[3] Chinese Academy of Sciences,Bioinformatics Center, Key Lab of Molecular Systems Biology, Shanghai Institutes for Biological Sciences
[4] Shanghai JiaoTong University,Department of Biomedical Engineering
[5] East China Normal University,School of Life Science
[6] East China Normal University,Software Engineering Institute
[7] Shanghai University,Institute of System Biology
来源
Molecular Diversity | 2010年 / 14卷
关键词
miRNA; Target; Predict; Nearest neighbor algorithm; Minimum redundancy maximum relevance; Properties forward selection;
D O I
暂无
中图分类号
学科分类号
摘要
We used a machine learning method, the nearest neighbor algorithm (NNA), to learn the relationship between miRNAs and their target proteins, generating a predictor which can then judge whether a new miRNA-target pair is true or not. We acquired 198 positive (true) miRNA-target pairs from Tarbase and the literature, and generated 4,888 negative (false) pairs through random combination. A 0/1 system and the frequencies of single nucleotides and di-nucleotides were used to encode miRNAs into vectors while various physicochemical parameters were used to encode the targets. The NNA was then applied, learning from these data to produce a predictor. We implemented minimum redundancy maximum relevance (mRMR) and properties forward selection (PFS) to reduce the redundancy of our encoding system, obtaining 91 most efficient properties. Finally, via the Jackknife cross-validation test, we got a positive accuracy of 69.2% and an overall accuracy of 96.0% with all the 253 properties. Besides, we got a positive accuracy of 83.8% and an overall accuracy of 97.2% with the 91 most efficient properties. A web-server for predictions is also made available at http://app3.biosino.org:8080/miRTP/index.jsp.
引用
收藏
页码:719 / 729
页数:10
相关论文
共 50 条
  • [31] Predicting Viral Protein Subcellular Localization with Chou's Pseudo Amino Acid Composition and Imbalance-Weighted Multi-Label K-Nearest Neighbor Algorithm
    Cao, Jun-Zhe
    Liu, Wen-Qi
    Gu, Hong
    PROTEIN AND PEPTIDE LETTERS, 2012, 19 (11): : 1163 - 1169
  • [32] Nearest Hyperplane Distance Neighbor Clustering algorithm Applied to Gene Co-Expression Analysis in Alzheimer's Disease
    Pasluosta, Cristian F.
    Dua, Prerna
    Lukiw, Walter J.
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 5559 - 5562
  • [33] Online Adjustment of the AI's Strength in a Fighting Game Using the k-Nearest Neighbor Algorithm and a Game Simulator
    Nakagawa, Yuto
    Yamamoto, Kaito
    Thawonmas, Ruck
    2014 IEEE 3RD GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2014, : 494 - 495
  • [34] Fast k-nearest neighbor search algorithm based on pyramid structure of wavelet transform and its application to texture classification
    Qiao, Yu-Long
    Lu, Zhe-Ming
    Pan, Jeng-Shyang
    Sun, Sheng-He
    DIGITAL SIGNAL PROCESSING, 2010, 20 (03) : 837 - 845
  • [35] Predicting Drug-Target Interactions Between New Drugs and New Targets via Pairwise K-nearest Neighbor and Automatic Similarity Selection
    Shi, Jian-Yu
    Li, Jia-Xin
    Lu, Hui-Meng
    Zhang, Yong
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING TECHNIQUES, ISCIDE 2015, PT II, 2015, 9243 : 477 - 486
  • [36] Electronic structure of Ga1-xInxN by the tight-binding method with nearest-neighbor s, p and d and second-neighbor s and p interactions
    Nakajima, S
    Yang, T
    Sakai, S
    SILICON CARBIDE AND RELATED MATERIALS 1995, 1996, 142 : 947 - 950
  • [37] RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers
    Bindewald, E
    Shapiro, BA
    RNA, 2006, 12 (03) : 342 - 352
  • [38] An efficient O(1) time 3D all nearest neighbor algorithm from image processing perspective
    Wang, Yuh-Rau
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2007, 67 (10) : 1082 - 1091
  • [39] Predicting ENR's Construction Cost Index Using the Modified K Nearest Neighbors (KNN) Algorithm
    Wang, Jun
    Ashuri, Baabak
    CONSTRUCTION RESEARCH CONGRESS 2016: OLD AND NEW CONSTRUCTION TECHNOLOGIES CONVERGE IN HISTORIC SAN JUAN, 2016, : 2502 - 2509
  • [40] Enhanced Harris hawks optimization-based fuzzy k-nearest neighbor algorithm for diagnosis of Alzheimer's disease
    Zhang, Qian
    Sheng, Jinhua
    Zhang, Qiao
    Wang, Luyun
    Yang, Ze
    Xin, Yu
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165