A New Measure of Stability of Clustering Solutions: Application to Data Partitioning

被引:0
|
作者
Saha, Sriparna [1 ]
Bandyopadhyay, Sanghamitra [1 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Kolkata, India
关键词
clustering; multiobjective optimization (MOO); symmetry; stability; SYMMETRY; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper at first a new measure of stability of clustering solutions over different bootstrap samples of a data set is proposed. Thereafter in this paper, a multiobjective optimization based clustering technique is developed which optimizes both the measures of symmetry and stability simultaneously to automatically determine the appropriate number of clusters and the appropriate partitioning from data sets having symmetrical shaped clusters. The proposed algorithm utilizes a recently developed simulated annealing based multiobjective optimization technique, AMOSA, as the underlying optimization method. Here assignment of points to different clusters are done based on a recently developed point symmetry based distance rather than the Euclidean distance. Results on several artificial and real-life data sets show that the proposed technique is well-suited to detect the number of clusters from data sets having point symmetric clusters.
引用
收藏
页码:181 / 186
页数:6
相关论文
共 50 条
  • [41] Distributed Clustering via LSH Based Data Partitioning
    Bhaskara, Aditya
    Wijewardena, Maheshakya
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [42] Efficient Large Scale Clustering based on Data Partitioning
    Bendechache, Malika
    Le-Khac, Nhien-An
    Kechadi, M-Tahar
    [J]. PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, : 612 - 621
  • [43] Temporal Clustering of Motion Capture Data with Optimal Partitioning
    Yang, Yang
    Shum, Hubert P. H.
    Aslam, Nauman
    Zeng, Lanling
    [J]. PROCEEDINGS VRCAI 2016: 15TH ACM SIGGRAPH CONFERENCE ON VIRTUAL-REALITY CONTINUUM AND ITS APPLICATIONS IN INDUSTRY, 2016, : 479 - 482
  • [44] Data Clustering and Graph Partitioning via Simulated Mixing
    Bhatti, Shahzad
    Beck, Carolyn
    Nedic, Angelia
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2019, 6 (03): : 253 - 266
  • [45] Fuzzy Clustering Ensemble Algorithm for Partitioning Categorical Data
    Li, Taoying
    Chen, Yan
    [J]. 2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 170 - 174
  • [46] A multi-clustering fusion scheme for data partitioning
    Frossyniotis, DS
    Pateritsas, C
    Stafylopatis, A
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (05) : 391 - 401
  • [47] SIMILARITY AND STABILITY ANALYSIS OF THE 2 PARTITIONING TYPE CLUSTERING ALGORITHMS
    CAN, F
    OZKARAHAN, EA
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1985, 36 (01): : 3 - 14
  • [48] Interestingness Measure on Privacy Preserved Data with Horizontal Partitioning
    KumaraSwamy, S.
    Manjula, S. H.
    Venugopal, K. R.
    Patnaik, L. M.
    [J]. 2014 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2014,
  • [49] A data clustering algorithm for stratified data partitioning in artificial neural network
    Sahoo, Ajit K.
    Zuo, Ming J.
    Tiwari, M. K.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (08) : 7004 - 7014
  • [50] Generalized Similarity Measure for Categorical Data Clustering
    Sharma, Shruti
    Singh, Manoj
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 765 - 769