Parameterized Complexity of Categorical Clustering with Size Constraints

被引：1

作者：

Fomin, Fedor, V ^{[1
]}

Golovach, Petr A. ^{[1
]}

Purohit, Nidhi ^{[1
]}

机构：

[1] Univ Bergen, Dept Informat, Bergen, Norway

来源：

ALGORITHMS AND DATA STRUCTURES, WADS 2021 | 2021年 / 12808卷

基金：

欧洲研究理事会;

关键词：

Categorical clustering; Capacitated clustering; Parameterized complexity; APPROXIMATION ALGORITHM;

D O I：

10.1007/978-3-030-83508-8_28

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In the Categorical Clustering problem, we are given a set of vectors (matrix) A = {a(1), . . . , a(n)} over Sigma(m), where Sigma is a finite alphabet, and integers k and B. The task is to partition A into k clusters such that the median objective of the clustering in the Hamming norm is at most B. That is, we seek a partition {I-1, . . . , I-k} of {1, . . . , n} and vectors c(1), . . . , c(k) is an element of Sigma(m) such that Sigma(k)(i=1) Sigma(j is an element of Ii) d(H)(c(i), a(j)) <= B, where d(H)(a, b) is the Hamming distance between vectors a and b. Fomin, Golovach, and Panolan [ICALP 2018] proved that the problem is fixed-parameter tractable (for binary case Sigma = {0, 1}) by giving an algorithm that solves the problem in time 2(O(B logB)) . (mn)(O(1)). We extend this algorithmic result to a popular capacitated clustering model, where in addition the sizes of the clusters should satisfy certain constraints. More precisely, in CAPACITATED CLUSTERING, in addition, we are given two non-negative integers p and q, and seek a clustering with p <= vertical bar I-i vertical bar <= q for all i is an element of {1, . . . , k}. Our main theorem is that CAPACITATED CLUSTERING is solvable in time 2(O(B logB))vertical bar Sigma vertical bar(B) . (mn)(O(1)). The theorem not only extends the previous algorithmic results to a significantly more general model, it also implies algorithms for several other variants of CATEGORICAL CLUSTERING with constraints on cluster sizes.

引用

页码：385 / 398

页数：14

共 50 条

[1] Parameterized complexity of categorical clustering with size constraints
Fomin F.V.
Golovach P.A.
Purohit N.
Journal of Computer and System Sciences, 2023, 136 : 171 - 194
[2] Parameterized Complexity of Feature Selection for Categorical Data Clustering
Bandyapadhyay, Sayan
Fomin, Fedor V.
Golovach, Petr A.
Simonov, Kirill
ACM TRANSACTIONS ON COMPUTATION THEORY, 2023, 15 (3-4)
[3] On the Complexity of Clustering with Relaxed Size Constraints
Goldwurm, Massimiliano
Lin, Jianyi
Sacca, Francesco
ALGORITHMIC ASPECTS IN INFORMATION AND MANAGEMENT, 2016, 9778 : 26 - 38
[4] On the Parameterized Complexity of Consensus Clustering
Doernfelder, Martin
Guo, Jiong
Komusiewicz, Christian
Weller, Mathias
ALGORITHMS AND COMPUTATION, 2011, 7074 : 624 - +
[5] On the parameterized complexity of consensus clustering
Doernfelder, Martin
Guo, Jiong
Komusiewicz, Christian
Weller, Mathias
THEORETICAL COMPUTER SCIENCE, 2014, 542 : 71 - 82
[6] On the complexity of clustering with relaxed size constraints in fixed dimension
Goldwurm, Massimiliano
Lin, Jianyi
Sacca, Francesco
THEORETICAL COMPUTER SCIENCE, 2018, 717 : 37 - 46
[7] The Parameterized Complexity of Clustering Incomplete Data
Eiben, Eduard
Ganian, Robert
Kanj, Iyad
Ordyniak, Sebastian
Szeider, Stefan
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7296 - 7304
[8] On the Parameterized Complexity of Labelled Correlation Clustering Problem
Liu, Xianmin
Li, Jianzhong
Gao, Hong
COMPUTING AND COMBINATORICS, COCOON 2014, 2014, 8591 : 276 - 287
[9] On the parameterized complexity of clustering problems for incomplete data
Eiben, Eduard
Ganian, Robert
Kanj, Iyad
Ordyniak, Sebastian
Szeider, Stefan
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2023, 134 : 1 - 19
[10] The complexity of irredundant sets parameterized by size
Downey, RG
Fellows, MR
Raman, V
DISCRETE APPLIED MATHEMATICS, 2000, 100 (03) : 155 - 167

← 1 2 3 4 5 →