Iterated Watersheds, A Connected Variation of K-Means for Clustering GIS Data

被引：7

作者：

Soor, Sampriti ^{[1
]}

Challa, Aditya ^{[1
]}

Danda, Sravan ^{[1
]}

Sagar, B. S. Daya ^{[1
]}

Najman, Laurent ^{[2
]}

机构：

[1] Indian Stat Inst, Syst Sci & Informat Unit, Bangalore 560059, Karnataka, India

[2] Univ Paris Est, ESIEE Paris, ENPC, LIGM UMR 8049,CNRS,UPEMLV, F-93162 Noisy Le Grand, France

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING | 2021年 / 9卷 / 02期

关键词：

Clustering algorithms; Partitioning algorithms; Cost function; Approximation algorithms; Image segmentation; Roads; Graph clustering; K-means; E-governance; watersheds; IMAGE; ALGORITHMS;

D O I：

10.1109/TETC.2019.2910147

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, we propose a novel algorithm to obtain a solution to the clustering problem with an additional constraint of connectivity. This is achieved by suitably modifying K-Means algorithm to include connectivity constraints. The modified algorithm involves repeated application of watershed transform, and hence is referred to as iterated watersheds. Detailed analysis of the algorithm is performed using toy examples. Iterated watersheds is compared with several image segmentation algorithms. It has been shown that iterated watersheds performs better than methods such as spectral clustering, isoperimetric partitioning, and K-Means on various measures. To illustrate the applicability of iterated watersheds - a simple problem of placing emergency stations and suitable cost function is considered. Using real world road networks of various cities, iterated watersheds is compared with K-Means and greedy K-center methods. It is observed that iterated watersheds result in 4 - 66 percent improvement over K-Means and in 31 - 72 percent improvement over Greedy K-Centers in experiments on road networks of various cities.

引用

页码：626 / 636

页数：11

共 50 条

[21] Optimized data fusion for K-means Laplacian clustering
Yu, Shi
Liu, Xinhai
Tranchevent, Leon-Charles
Glanzel, Wolfgang
Suykens, Johan A. K.
De Moor, Bart
Moreau, Yves
BIOINFORMATICS, 2011, 27 (01) : 118 - 126
[22] Parallelization of K-Means Clustering Algorithm for Data Mining
Jiang, Hao
Yu, Liyan
4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
[23] K-means Clustering with Feature Selection for Stream Data
Wang, Xiao-dong
Chen, Rung-Ching
Yan, Fei
Hendry
2018 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2018), 2018, : 453 - 456
[24] Online k-means Clustering on Arbitrary Data Streams
Bhattacharjee, Robi
Imola, Jacob John
Moshkovitz, Michal
Dasgupta, Sanjoy
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 204 - 236
[25] Optimized Data Fusion for Kernel k-Means Clustering
Yu, Shi
Tranchevent, Leon-Charles
Liu, Xinhai
Glanzel, Wolfgang
Suykens, Johan A. K.
De Moor, Bart
Moreau, Yves
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (05) : 1031 - 1039
[26] Data clustering: 50 years beyond K-means
Jain, Anil K.
PATTERN RECOGNITION LETTERS, 2010, 31 (08) : 651 - 666
[27] Combining PSO and k-means to Enhance Data Clustering
Ahmadyfard, Alireza
Modares, Hamidreza
2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 688 - 691
[28] An extension of the K-means algorithm to clustering skewed data
Melnykov, Volodymyr
Zhu, Xuwen
COMPUTATIONAL STATISTICS, 2019, 34 (01) : 373 - 394
[29] On the quality of k-means clustering based on grouped data
Kaeaerik, Meelis
Paerna, Kalev
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (11) : 3836 - 3841
[30] How to Use K-means for Big Data Clustering?
Mussabayev, Rustam
Mladenovic, Nenad
Jarboui, Bassem
Mussabayev, Ravil
PATTERN RECOGNITION, 2023, 137

← 1 2 3 4 5 →