Voting-based consensus clustering for combining multiple clusterings of chemical structures

被引:23
|
作者
Saeed, Faisal [1 ,2 ]
Salim, Naomie [1 ]
Abdo, Ammar [3 ,4 ,5 ]
机构
[1] Univ Technol Malaysia, Fac Comp Sci & Informat Syst, Johor Baharu, Malaysia
[2] Sanhan Community Coll, Informat Technol Dept, Sanaa, Yemen
[3] Alhodaida Univ, Dept Comp Sci, Alhodaida, Yemen
[4] Univ Lille 1, LIFL UMR CNRS 8022, F-59655 Villeneuve Dascq, France
[5] INRIA Lille Nord Europe, F-59655 Villeneuve Dascq, France
来源
JOURNAL OF CHEMINFORMATICS | 2012年 / 4卷
关键词
DATA FUSION; RECEPTOR-BINDING; SIMILARITY; CLASSIFICATIONS; COEFFICIENTS; DESCRIPTORS; COMBINATION; SELECTION;
D O I
10.1186/1758-2946-4-37
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: Although many consensus clustering methods have been successfully used for combining multiple classifiers in many areas such as machine learning, applied statistics, pattern recognition and bioinformatics, few consensus clustering methods have been applied for combining multiple clusterings of chemical structures. It is known that any individual clustering method will not always give the best results for all types of applications. So, in this paper, three voting and graph-based consensus clusterings were used for combining multiple clusterings of chemical structures to enhance the ability of separating biologically active molecules from inactive ones in each cluster. Results: The cumulative voting-based aggregation algorithm (CVAA), cluster-based similarity partitioning algorithm (CSPA) and hyper-graph partitioning algorithm (HGPA) were examined. The F-measure and Quality Partition Index method (QPI) were used to evaluate the clusterings and the results were compared to the Ward's clustering method. The MDL Drug Data Report (MDDR) dataset was used for experiments and was represented by two 2D fingerprints, ALOGP and ECFP_4. The performance of voting-based consensus clustering method outperformed the Ward's method using F-measure and QPI method for both ALOGP and ECFP_4 fingerprints, while the graph-based consensus clustering methods outperformed the Ward's method only for ALOGP using QPI. The Jaccard and Euclidean distance measures were the methods of choice to generate the ensembles, which give the highest values for both criteria. Conclusions: The results of the experiments show that consensus clustering methods can improve the effectiveness of chemical structures clusterings. The cumulative voting-based aggregation algorithm (CVAA) was the method of choice among consensus clustering methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Positional and confidence voting-based consensus functions for fuzzy cluster ensembles
    Sevillano, Xavier
    Alias, Francesc
    Claudi Socoro, Joan
    FUZZY SETS AND SYSTEMS, 2012, 193 : 1 - 32
  • [22] An energy-efficient voting-based clustering algorithm for sensor networks
    Qin, M
    Zimmermann, R
    SIXTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERNG, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING AND FIRST AICS INTERNATIONAL WORKSHOP ON SELF-ASSEMBLING WIRELESS NETWORKS, PROCEEDINGS, 2005, : 444 - 451
  • [23] Voting-Based Multiple Classification Approach for Turkish News Texts
    Buluz, Basak
    Komecoglu, Yavuz
    Kizrak, Merve Ayyuce
    2019 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2019, : 401 - 406
  • [24] Expediting In-Network Federated Learning by Voting-Based Consensus Model Compression
    Su, Xiaoxin
    Zhou, Yipeng
    Cui, Laizhong
    Guo, Song
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 1271 - 1280
  • [25] Unsupervised collaborative boosting of clustering: an unifying framework for multi-view clustering, multiple consensus clusterings and alternative clustering
    Sublemontier, Jacques-Henri
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [26] Voting-Based Decentralized Consensus Design for Improving the Efficiency and Security of Consortium Blockchain
    Sun, Gang
    Dai, Miao
    Sun, Jian
    Yu, Hongfang
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (08) : 6257 - 6272
  • [27] VCA: An energy-efficient voting-based clustering algorithm for sensor networks
    Qin, Min
    Zimmermann, Roger
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2007, 13 (01) : 87 - 109
  • [28] Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties
    Azad, A. K. M.
    Lee, Hyunju
    PLOS ONE, 2013, 8 (08):
  • [29] LVCA: An efficient voting-based consensus algorithm in private Blockchain for enhancing data security
    Verma, Sudhani
    Chandra, Girish
    Yadav, Divakar
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2025, 18 (02)
  • [30] Combining multiple clusterings using information theory based genetic algorithm
    Luo, Huilan
    Jing, Furong
    Xie, Xiaobing
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 84 - 89