An Efficient Prediction of HPV Genotypes from Partial Coding Sequences by Chaos Game Representation and Fuzzy k-Nearest Neighbor Technique

被引:8
|
作者
Tanchotsrinon, Watcharaporn [1 ]
Lursinsap, Chidchanok [1 ]
Poovorawan, Yong [2 ]
机构
[1] Chulalongkorn Univ, Dept Math & Comp Sci, Fac Sci, Bangkok, Thailand
[2] Chulalongkorn Univ, Ctr Excellence Clin Virol, Fac Med, Bangkok, Thailand
关键词
Prediction; Human Papillomavirus; HPV; genotype; cervical cancer; INVASIVE CERVICAL-CANCER; INFECTION;
D O I
10.2174/1574893611666161110112006
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Human Papillomavirus is considered as a necessary cause of cervical cancer, which is the second most common cancer in women around the world. At present, an individual genotyping of Human Papillomavirus can provide essential information for an improvement of diagnosis and medical treatment to infected patients. Objective: For this purpose, our paper focuses on predicting the significant Human Papillomavirus genotypes mainly associated with cervical cancers. Method: In this experiment, partial coding sequences of genotypes were transformed into coordinates in chaos game representations, and they were subsequently partitioned into 8x8 equal sub-regions. Probabilities of distribution in sub-regions were extracted in forms of tri-nucleotide frequencies. Then, two-fold cross validation technique was employed for separating training and testing sets. For each fold, a feature selection by RReliefF algorithm was conducted for selecting significant features, followed by predicting the corresponding genotypes by fuzzy k-nearest neighbor technique. Results: The experimental results showed that our proposed method can achieve higher performance than two related methods, while RReliefF algorithm can successfully reduce all of 64 extracted features into 29 significant features. Additionally, it also found that our experimental results are significantly different from those of the method of Nair et al., in almost all genotypes. Conclusion: Therefore, the algorithm based on chaos game representation and fuzzy k-nearest neighbor technique can efficiently predict Human Papillomavirus genotypes.
引用
下载
收藏
页码:431 / 440
页数:10
相关论文
共 31 条
  • [1] Efficient Heart Disease Prediction System using K-Nearest Neighbor Classification Technique
    Khateeb, Nida
    Usman, Muhammad
    INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS (BDIOT 2017), 2017, : 21 - 26
  • [2] Prediction of protein solvent accessibility using fuzzy k-nearest neighbor method
    Sim, J
    Kim, SY
    Lee, J
    BIOINFORMATICS, 2005, 21 (12) : 2844 - 2849
  • [3] A novel bankruptcy prediction model based on an adaptive fuzzy k-nearest neighbor method
    Chen, Hui-Ling
    Yang, Bo
    Wang, Gang
    Liu, Jie
    Xu, Xin
    Wang, Su-Jing
    Liu, Da-You
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (08) : 1348 - 1359
  • [4] Fault Analysis and Prediction of Transmission Line Based on Fuzzy K-Nearest Neighbor Algorithm
    Zhang, Yue
    Chen, Jianxia
    Fang, Qin
    Ye, Zhiwei
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 894 - 899
  • [5] A high performance prediction of HPV genotypes by Chaos game representation and singular value decomposition
    Watcharaporn Tanchotsrinon
    Chidchanok Lursinsap
    Yong Poovorawan
    BMC Bioinformatics, 16
  • [6] A high performance prediction of HPV genotypes by Chaos game representation and singular value decomposition
    Tanchotsrinon, Watcharaporn
    Lursinsap, Chidchanok
    Poovorawan, Yong
    BMC BIOINFORMATICS, 2015, 16 : 1 - 13
  • [7] An Efficient Approach for Prediction of Nuclear Receptor and Their Subfamilies Based on Fuzzy k-Nearest Neighbor with Maximum Relevance Minimum Redundancy
    Tiwari, Arvind Kumar
    Srivastava, Rajeev
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES INDIA SECTION A-PHYSICAL SCIENCES, 2018, 88 (01) : 129 - 136
  • [8] An Efficient Approach for Prediction of Nuclear Receptor and Their Subfamilies Based on Fuzzy k-Nearest Neighbor with Maximum Relevance Minimum Redundancy
    Arvind Kumar Tiwari
    Rajeev Srivastava
    Proceedings of the National Academy of Sciences, India Section A: Physical Sciences, 2018, 88 : 129 - 136
  • [9] FINkNN:: A fuzzy interval number k-nearest neighbor classifier for prediction of sugar production from populations of samples
    Petridis, V
    Kaburlasos, VG
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (01) : 17 - 37
  • [10] Prediction of moving objects' k-nearest neighbor based on fuzzy-rough sets theory
    Hong, Xiaoguang
    Yuan, Yan
    Hu, Xinglei
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 407 - 411