BLKnn: A K-Nearest Neighbors Method For Predicting Bioluminescent Proteins

被引:0
|
作者
Hu, Jing [1 ]
机构
[1] Franklin & Marshall Coll, Dept Math & Comp Sci, Lancaster, PA 17604 USA
关键词
K-nearest neighbors method; bit-score weighted Euclidean distance; pseudo-amino acid composition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bioluminescence is a chemical process in which light is produced and emitted by a living organism. Recent biotechnological applications of bioluminescence include using of bioluminescent proteins in gene expression analysis, bioluminescent imaging, study of protein-protein interaction and disease progression, drug discovery, toxicity determination, etc. Therefore, it is of great medical and commercial significances to identify bioluminescent proteins accurately and efficiently. In this study, we present BLKnn, a K-nearest neighbors method that can predict bioluminescent proteins. This method is based on the bit-score weighted Euclidean distance, which is calculated from compositions of selected amino acids and pseudo-amino acids. On a balanced training dataset, BLKnn achieved 74.9% sensitivity, 95.5% specificity, 85.2% accuracy, and 0.919 AUC (area under the ROC curve) by 10-fold cross-validation. When tested on a much bigger independent test dataset, the method also achieved a consistent performance of 88.0% overall accuracy and 0.989 AUC. Comparisons showed that BLKnn outperformed previously published methods. The method is available at https://edisk.fandm.edu/jing.hu/blknn/blknn.html.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] METHOD FOR DETERMINING K-NEAREST NEIGHBORS
    KITTLER, J
    [J]. KYBERNETES, 1978, 7 (04) : 313 - 315
  • [2] K-Nearest Neighbors Hashing
    He, Xiangyu
    Wang, Peisong
    Cheng, Jian
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2834 - 2843
  • [3] Modernizing k-nearest neighbors
    Elizabeth Yancey, Robin
    Xin, Bochao
    Matloff, Norm
    [J]. STAT, 2021, 10 (01):
  • [4] Toward Predicting Medical Conditions Using k-Nearest Neighbors
    Tayeb, Shahab
    Pirouz, Matin
    Sun, Johann
    Hall, Kaylee
    Chang, Andrew
    Li, Jessica
    Song, Connor
    Chauhan, Apoorva
    Ferra, Michael
    Sager, Theresa
    Zhan, Justin
    Latifi, Shahram
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3897 - 3903
  • [5] Improving the speed and stability of the k-nearest neighbors method
    Beliakov, Gleb
    Li, Gang
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (10) : 1296 - 1301
  • [6] A K-nearest neighbors survival probability prediction method
    Lowsky, D. J.
    Ding, Y.
    Lee, D. K. K.
    McCulloch, C. E.
    Ross, L. F.
    Thistlethwaite, J. R.
    Zenios, S. A.
    [J]. STATISTICS IN MEDICINE, 2013, 32 (12) : 2062 - 2069
  • [7] K-nearest neighbors in uncertain graph
    Zhang, Yinglong
    Li, Cuiping
    Chen, Hong
    Du, Lingxia
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2011, 48 (10): : 1850 - 1858
  • [8] Classification with learning k-nearest neighbors
    Laaksonen, J
    Oja, E
    [J]. ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 1480 - 1483
  • [9] Hausdorff Distance with k-Nearest Neighbors
    Wang, Jun
    Tan, Ying
    [J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2012, PT II, 2012, 7332 : 272 - 281
  • [10] K-nearest neighbors clustering algorithm
    Gauza, Dariusz
    Zukowska, Anna
    Nowak, Robert
    [J]. PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2014, 2014, 9290