Comparative analysis and prediction of nucleosome positioning using integrative feature representation and machine learning algorithms

被引:6
|
作者
Han, Guo-Sheng [1 ,2 ,3 ]
Li, Qi [1 ,2 ,3 ]
Li, Ying [1 ,2 ,3 ]
机构
[1] Xiangtan Univ, Dept Math & Computat Sci, Xiangtan 411105, Hunan, Peoples R China
[2] Xiangtan Univ, Key Lab Intelligent Comp & Informat Proc, Minist Educ, Xiangtan 411105, Hunan, Peoples R China
[3] Xiangtan Univ, Hunan Key Lab Computat & Simulat Sci & Engn, Xiangtan 411105, Hunan, Peoples R China
关键词
Nucleosome classification; Frequency chaos game representation; Support vector machine; Extreme learning machine; Extreme gradient boosting; Convolutional neural networks; CHAOS GAME REPRESENTATION; K-TUPLE; HIGH-RESOLUTION; IDENTIFICATION; OCCUPANCY; SEQUENCES; PSEKNC;
D O I
10.1186/s12859-021-04006-w
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Nucleosome plays an important role in the process of genome expression, DNA replication, DNA repair and transcription. Therefore, the research of nucleosome positioning has invariably received extensive attention. Considering the diversity of DNA sequence representation methods, we tried to integrate multiple features to analyze its effect in the process of nucleosome positioning analysis. This process can also deepen our understanding of the theoretical analysis of nucleosome positioning. Results Here, we not only used frequency chaos game representation (FCGR) to construct DNA sequence features, but also integrated it with other features and adopted the principal component analysis (PCA) algorithm. Simultaneously, support vector machine (SVM), extreme learning machine (ELM), extreme gradient boosting (XGBoost), multilayer perceptron (MLP) and convolutional neural networks (CNN) are used as predictors for nucleosome positioning prediction analysis, respectively. The integrated feature vector prediction quality is significantly superior to a single feature. After using principal component analysis (PCA) to reduce the feature dimension, the prediction quality of H. sapiens dataset has been significantly improved. Conclusions Comparative analysis and prediction on H. sapiens, C. elegans, D. melanogaster and S. cerevisiae datasets, demonstrate that the application of FCGR to nucleosome positioning is feasible, and we also found that integrative feature representation would be better.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Comparative analysis and prediction of nucleosome positioning using integrative feature representation and machine learning algorithms
    Guo-Sheng Han
    Qi Li
    Ying Li
    BMC Bioinformatics, 22
  • [2] Comparative analysis and prediction of quorum-sensing peptides using feature representation learning and machine learning algorithms
    Wei, Leyi
    Hu, Jie
    Li, Fuyi
    Song, Jiangning
    Su, Ran
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (01) : 106 - 119
  • [3] Comparative Analysis of Stress Prediction Using Unsupervised Machine Learning Algorithms
    Maurya, Istuti
    Sarvaiya, Anjali
    Upla, Kishor
    Ramachandra, Raghavendra
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT III, 2024, 2011 : 261 - 271
  • [4] Prediction of nucleosome positioning using a support vector machine
    Bishop, Eric
    Tullius, Thomas D.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2007, 24 (06): : 624 - 624
  • [5] Comparative Analysis of Machine Learning Algorithms for Rainfall Prediction
    Patil, Rudragoud
    Bedekar, Gayatri
    INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 833 - 842
  • [6] Comparative Analysis of Building Insurance Prediction Using Some Machine Learning Algorithms
    Ejiyi, Chukwuebuka Joseph
    Qin, Zhen
    Salako, Abdulhaq Adetunji
    Happy, Monday Nkanta
    Nneji, Grace Ugochi
    Ukwuoma, Chiagoziem Chima
    Chikwendu, Ijeoma Amuche
    Gen, Ji
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2022, 7 (03): : 75 - 85
  • [7] CHURN PREDICTION - A COMPARATIVE ANALYSIS WITH SUPERVISED MACHINE LEARNING ALGORITHMS
    Gangadharan, Chika K.
    Alex, Roshni
    Sabu, M. K.
    ADVANCES AND APPLICATIONS IN MATHEMATICAL SCIENCES, 2021, 20 (12): : 3049 - 3060
  • [8] Comparative Analysis of Machine Learning Algorithms to Urban Traffic Prediction
    Lee, Yong-Ju
    Min, Okgee
    2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1034 - 1036
  • [9] Machine Learning Algorithms for Transportation Mode Prediction: A Comparative Analysis
    Murrar S.
    Alhaj F.
    Qutqut M.H.
    Informatica (Slovenia), 2024, 48 (06): : 117 - 130
  • [10] Comparative Analysis of Machine Learning Algorithms for CKD Risk Prediction
    Yang, Weilin
    Ahmed, Nasim
    Barczak, Andre L. C.
    IEEE ACCESS, 2024, 12 : 171205 - 171220