Drug-target interaction data cluster analysis based on improving the density peaks clustering algorithm

被引:7
|
作者
Guo, Maozu [1 ,2 ,3 ]
Yu, Donghua [1 ]
Liu, Guojun [1 ]
Liu, Xiaoyan [1 ]
Cheng, Shuang [4 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
[3] Beijing Key Lab Intelligent Proc Bldg Big Data, Beijing 100044, Peoples R China
[4] China Acad Engn Phys, Inst Mat, Mianyang 621907, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Drug-target interaction data; cluster analysis; density-based clustering; cutoff distance sequence; INTERACTION PREDICTION; INFORMATION; KNN;
D O I
10.3233/IDA-184382
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since drug-target data have neither class labels nor the cluster number information, they are not suitable for clustering algorithms that require predefined parameters determined by comparing clustering results with real class labels. Density peaks clustering (DPC) is a density-based clustering algorithm that can determine the number of clusters without requiring class labels. However, the predefined cutoff distance of local density limits its wide application. Therefore, this paper proposes an improved local density method based on a cutoff distance sequence that overcomes the limitations of DPC and can be successful applied to drug-target data. We also introduce multiple-dimensional scaling based on drug and target similarity and perform intuitive graph analysis of the two most significant differentiation features. Drugs of the Enzyme, GPCR, Ion Channel, and Nuclear Receptor 4 standard datasets are identified as 6, 6, 3, and 5 clusters by an improved algorithm, respectively, and similarly, their targets are identified be 5, 5, 8, and 4 clusters. Drug-target data clustering results of the improved algorithm are more reasonable than the results of the fast K-medoids and hierarchical clustering algorithms.
引用
收藏
页码:1335 / 1353
页数:19
相关论文
共 50 条
  • [1] Drug-target protein interaction prediction based on AdaBoost algorithm
    基于AdaBoost算法的药物-靶向蛋白作用预测算法
    [J]. Xie, Xianfen (xiexianfen2009@jnu.edu.cn), 2018, West China Hospital, Sichuan Institute of Biomedical Engineering (35):
  • [2] Biases of Drug-Target Interaction Network Data
    van Laarhoven, Twan
    Marchiori, Elena
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, PRIB 2014, 2014, 8626 : 23 - 33
  • [3] A Clustering Algorithm for Binary Protocol Data Frames Based on Principal Component Analysis and Density Peaks Clustering
    Yan, Xiaoyong
    Li, Qing
    Tao, Siyu
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1260 - 1266
  • [4] Ensemble Learning Algorithm for Drug-Target Interaction Prediction
    Pathak, Sudipta
    Cai, Xingyu
    [J]. 2017 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2017,
  • [5] A novel density peaks clustering algorithm for mixed data
    Du, Mingjing
    Ding, Shifei
    Xue, Yu
    [J]. PATTERN RECOGNITION LETTERS, 2017, 97 : 46 - 53
  • [6] MINDG: A Drug-Target Interaction Prediction Method Based on an Integrated Learning Algorithm
    Yang, Hailong
    Chen, Yue
    Zuo, Yun
    Deng, Zhaohong
    Pan, Xiaoyong
    Shen, Hong-Bin
    Choi, Kup-Sze
    Yu, Dong-Jun
    [J]. BIOINFORMATICS, 2024, 40 (04)
  • [7] Drug-Target Interaction Prediction Based on Transformer
    Liu, Junkai
    Jiang, Tengsheng
    Lu, Yaoyao
    Wu, Hongjie
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 302 - 309
  • [8] A New Weight Based Density Peaks Clustering Algorithm for Numerical and Categorical Data
    Tong, Wuning
    Wang, Yuping
    Zhong, Junkun
    Yan, Wei
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2017, : 169 - 172
  • [9] An Ensemble Learning Algorithm Based on Density Peaks Clustering and Fitness for Imbalanced Data
    Xu, Hui
    Liu, Qicheng
    [J]. IEEE ACCESS, 2022, 10 : 116120 - 116128
  • [10] Drug-target affinity prediction using applicability domain based on data density
    Sugita, Shunya
    Ohue, Masahito
    [J]. 2021 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2021, : 224 - 229