Iterated Watersheds, A Connected Variation of K-Means for Clustering GIS Data

被引:7
|
作者
Soor, Sampriti [1 ]
Challa, Aditya [1 ]
Danda, Sravan [1 ]
Sagar, B. S. Daya [1 ]
Najman, Laurent [2 ]
机构
[1] Indian Stat Inst, Syst Sci & Informat Unit, Bangalore 560059, Karnataka, India
[2] Univ Paris Est, ESIEE Paris, ENPC, LIGM UMR 8049,CNRS,UPEMLV, F-93162 Noisy Le Grand, France
关键词
Clustering algorithms; Partitioning algorithms; Cost function; Approximation algorithms; Image segmentation; Roads; Graph clustering; K-means; E-governance; watersheds; IMAGE; ALGORITHMS;
D O I
10.1109/TETC.2019.2910147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we propose a novel algorithm to obtain a solution to the clustering problem with an additional constraint of connectivity. This is achieved by suitably modifying K-Means algorithm to include connectivity constraints. The modified algorithm involves repeated application of watershed transform, and hence is referred to as iterated watersheds. Detailed analysis of the algorithm is performed using toy examples. Iterated watersheds is compared with several image segmentation algorithms. It has been shown that iterated watersheds performs better than methods such as spectral clustering, isoperimetric partitioning, and K-Means on various measures. To illustrate the applicability of iterated watersheds - a simple problem of placing emergency stations and suitable cost function is considered. Using real world road networks of various cities, iterated watersheds is compared with K-Means and greedy K-center methods. It is observed that iterated watersheds result in 4 - 66 percent improvement over K-Means and in 31 - 72 percent improvement over Greedy K-Centers in experiments on road networks of various cities.
引用
收藏
页码:626 / 636
页数:11
相关论文
共 50 条
  • [21] Optimized data fusion for K-means Laplacian clustering
    Yu, Shi
    Liu, Xinhai
    Tranchevent, Leon-Charles
    Glanzel, Wolfgang
    Suykens, Johan A. K.
    De Moor, Bart
    Moreau, Yves
    BIOINFORMATICS, 2011, 27 (01) : 118 - 126
  • [22] Parallelization of K-Means Clustering Algorithm for Data Mining
    Jiang, Hao
    Yu, Liyan
    4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [23] K-means Clustering with Feature Selection for Stream Data
    Wang, Xiao-dong
    Chen, Rung-Ching
    Yan, Fei
    Hendry
    2018 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2018), 2018, : 453 - 456
  • [24] Online k-means Clustering on Arbitrary Data Streams
    Bhattacharjee, Robi
    Imola, Jacob John
    Moshkovitz, Michal
    Dasgupta, Sanjoy
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 204 - 236
  • [25] Optimized Data Fusion for Kernel k-Means Clustering
    Yu, Shi
    Tranchevent, Leon-Charles
    Liu, Xinhai
    Glanzel, Wolfgang
    Suykens, Johan A. K.
    De Moor, Bart
    Moreau, Yves
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (05) : 1031 - 1039
  • [26] Data clustering: 50 years beyond K-means
    Jain, Anil K.
    PATTERN RECOGNITION LETTERS, 2010, 31 (08) : 651 - 666
  • [27] Combining PSO and k-means to Enhance Data Clustering
    Ahmadyfard, Alireza
    Modares, Hamidreza
    2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 688 - 691
  • [28] An extension of the K-means algorithm to clustering skewed data
    Melnykov, Volodymyr
    Zhu, Xuwen
    COMPUTATIONAL STATISTICS, 2019, 34 (01) : 373 - 394
  • [29] On the quality of k-means clustering based on grouped data
    Kaeaerik, Meelis
    Paerna, Kalev
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (11) : 3836 - 3841
  • [30] How to Use K-means for Big Data Clustering?
    Mussabayev, Rustam
    Mladenovic, Nenad
    Jarboui, Bassem
    Mussabayev, Ravil
    PATTERN RECOGNITION, 2023, 137