Weighted consensus clustering and its application to Big data

被引:17
|
作者
Alguliyev, Rasim M. [1 ]
Aliguliyev, Ramiz M. [1 ]
Sukhostat, Lyudmila, V [1 ]
机构
[1] Azerbaijan Natl Acad Sci, Inst Informat Technol, 9A B Vahabzade St, AZ-1141 Baku, Azerbaijan
关键词
Weighted consensus clustering; Big data; Utility function; Purity-based utility function; Co-association matrix; ENSEMBLE; ALGORITHM; INDEXES;
D O I
10.1016/j.eswa.2020.113294
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of this study is the development of a weighted consensus clustering that assigns weights to single clustering methods using the purity utility function. In the case of Big data that does not contain labels, the utility function based on the Davies-Bouldin index is proposed in this paper. The Banknote authentication, Phishing, Diabetic, Magic04, Credit card clients, Covertype, Phone accelerometer, and NSL-KDD datasets are used to assess the efficiency of the proposed consensus approach. The proposed approach is evaluated using the Euclidean, Minkowski, squared Euclidean, cosine, and Chebychev distance metrics. It is compared with single clustering algorithms (DBSCAN, OPTICS, CLARANS, k-means, and shared nearby neighbor clustering). The experimental results show the effectiveness of the proposed approach to the Big data clustering in comparison to single clustering methods. The proposed weighted consensus clustering using the squared Euclidean distance metric achieves the highest accuracy, which is a very promising result for Big data clustering. It can be applied to expert systems to help experts make group decisions based on several alternatives. The paper also provides directions for future research on consensus clustering in this area. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A Distributed Weighted Possibilistic c-Means Algorithm for Clustering Incomplete Big Sensor Data
    Zhang, Qingchen
    Chen, Zhikui
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2014,
  • [42] Fuzzy Weighted Clustering Method for Numerical Attributes of Communication Big Data Based on Cloud Computing
    Ding, Haitao
    Sun, Chu
    Zeng, Jianqiu
    SYMMETRY-BASEL, 2020, 12 (04):
  • [43] Fuzzy Divergence Weighted Ensemble Clustering With Spectral Learning Based on Random Projections for Big Data
    Lahmar, Ines
    Zaier, Aida
    Yahia, Mohamed
    Ali, Tarig
    Boaullegue, Ridha
    IEEE ACCESS, 2024, 12 : 20197 - 20208
  • [44] Deep Learning Model and Its Application in Big Data
    Zhou, Yuanming
    Zhao, Shifeng
    Wang, Xuesong
    Liu, Wei
    DESIGN, USER EXPERIENCE, AND USABILITY: THEORY AND PRACTICE, DUXU 2018, PT I, 2018, 10918 : 795 - 806
  • [45] OpenStack Platform and its Application in Big Data Processing
    Shao, Cen
    Liang, Bo
    Wang, Feng
    Deng, Hui
    Dai, Wei
    Wei, Shoulin
    Zhang, Xiaoli
    Yuan, Zhi
    2015 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKS AND INTELLIGENT SYSTEMS (ICINIS), 2015, : 98 - 101
  • [46] Literature review on Big Data and Its Application Fields
    Fan, Xiani
    Li, Zeping
    Zhou, Li
    2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [47] Multivariate functional clustering and its application to typhoon data
    Misumi T.
    Matsui H.
    Konishi S.
    Behaviormetrika, 2019, 46 (1) : 163 - 175
  • [48] A novel memetic algorithm and its application to data clustering
    Ni, JiaCheng
    Li, Li
    Qiao, Fei
    Wu, QiDi
    MEMETIC COMPUTING, 2013, 5 (01) : 65 - 78
  • [49] Fuzzy Clustering with ε-Hyperballs and Its Application to Data Classification
    Jezewski, Michal
    Czabanski, Robert
    Leski, Jacek
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2017, PT II, 2017, 10246 : 84 - 93
  • [50] A novel memetic algorithm and its application to data clustering
    JiaCheng Ni
    Li Li
    Fei Qiao
    QiDi Wu
    Memetic Computing, 2013, 5 : 65 - 78