Combining Multiple K-Means Clusterings of Chemical Structures Using Cluster-Based Similarity Partitioning Algorithm

被引:0
|
作者
Saeedi, Faisal [1 ,2 ]
Salim, Naomie [1 ]
Abdo, Ammar [3 ]
Hentabli, Hamza [1 ]
机构
[1] Univ Teknol Malaysia, Fac Comp Sci & Informat Syst, Johor Baharu, Johor, Malaysia
[2] Dept Informat Technol, Sanhan Community Coll, Sanaa, Yemen
[3] Hodeidah Univ, Dept Comp Sci, Hodeidah, Yemen
来源
ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS | 2012年 / 322卷
关键词
2D Fingerprint; Compound Selection; Consensus Clustering; K-Means; Molecular Datasets; Ward's Method; DATA FUSION; COMBINATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consensus clustering methods have been used in many areas to improve the quality of individual clusterings. In this paper, graph-based consensus clustering, Cluster-based Similarity Partitioning Algorithm (CSPA), was used to improve the quality of chemical structures clustering by enhancing the ability to separate active from inactive molecules in each cluster and improve the robustness and stability of individual clusterings. The clustering was evaluated using Quality Partition Index (QPI) measure and the results were compared with the Ward's clustering method. The chemical dataset MDL Drug Data Report (MDDR) database was used for experiments. The results obtained by combining multiple K-means clusterings showed that graph-based consensus clustering, CSPA, can improve the quality of individual chemical structure clusterings.
引用
收藏
页码:304 / +
页数:3
相关论文
共 50 条
  • [1] Combining Multiple Individual Clusterings of Chemical Structures Using Cluster-Based Similarity Partitioning Algorithm
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    Hentabli, Hamza
    ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, 2012, 322 : 276 - +
  • [2] A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data
    Xu, Zhaozhao
    Shen, Derong
    Nie, Tiezheng
    Kou, Yue
    Yin, Nan
    Han, Xi
    INFORMATION SCIENCES, 2021, 572 : 574 - 589
  • [3] Combining Multiple Clusterings of Chemical Structures Using Cumulative Voting-Based Aggregation Algorithm
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    Hentabli, Hamza
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 178 - 185
  • [4] Adaptive Cumulative Voting-Based Aggregation Algorithm for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    Hentabli, Hamza
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 305 - 314
  • [5] Using Soft Consensus Clustering for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    JURNAL TEKNOLOGI, 2013, 63 (01):
  • [6] An Improved K-Means Algorithm Based on Contour Similarity
    Zhao, Jing
    Bao, Yanke
    Li, Dongsheng
    Guan, Xinguo
    MATHEMATICS, 2024, 12 (14)
  • [7] Using graph-based consensus clustering for combining K-means clustering of heterogeneous chemical structures
    Faisal Saeed
    Naomie Salim
    Ammar Abdo
    Hentabli Hamza
    Journal of Cheminformatics, 5 (Suppl 1)
  • [8] Enhancing the K-means Algorithm Using Cluster Adjustment
    Yamout, Fadi
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 307 - 311
  • [9] Optimization and improvement based on K-Means Cluster algorithm
    Wu, Jieming
    Yu, Wenhu
    2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 3, 2009, : 335 - 339
  • [10] An efficient k-means clustering algorithm using simple partitioning
    Hung, MC
    Wu, JP
    Chang, JH
    Yang, DL
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2005, 21 (06) : 1157 - 1177