Relational data clustering with incomplete data

被引:0
|
作者
Hathaway, RJ [1 ]
Overstreet, DD [1 ]
Murphy, TE [1 ]
Bezdek, JC [1 ]
机构
[1] Georgia So Univ, Dept Math, Statesboro, GA 30460 USA
关键词
pattern recognition; clustering; c-means clustering; relational data; incomplete data; missing data; dissimilarity data;
D O I
10.1117/12.421178
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of clustering a set of objects which are represented by relational data in the form of a dissimilarity matrix which has missing values. Three methods are developed to estimate the missing values, all based on simple triangle inequality-based approximation schemes. With few exceptions, any relational clustering algorithm can then be applied to the completed data matrix to obtain nice clusters. We illustrate our approach by clustering incomplete data built from several data sets; The primary clustering method chosen for our numerical experiments is the non-Euclidean relational fuzzy c-means algorithm. Our examples show that satisfactory clusters can still be obtained even when roughly half of the distance values are missing before completion.
引用
收藏
页码:273 / 280
页数:8
相关论文
共 50 条
  • [1] Clustering relational data
    Ferligoj, Anuska
    [J]. PROCEEDINGS OF THE ITI 2008 30TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2008, : 13 - 18
  • [2] On Fuzzy Clustering for Incomplete Spherical Data and for Incomplete Multivariate Categorical Data
    Kanzawa, Yuchi
    [J]. 2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 638 - 643
  • [3] Clustering incomplete relational data using the non-Euclidean relational fuzzy c-means algorithm
    Hathaway, RJ
    Bezdek, JC
    [J]. PATTERN RECOGNITION LETTERS, 2002, 23 (1-3) : 151 - 160
  • [4] The Parameterized Complexity of Clustering Incomplete Data
    Eiben, Eduard
    Ganian, Robert
    Kanj, Iyad
    Ordyniak, Sebastian
    Szeider, Stefan
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7296 - 7304
  • [5] Fusion Subspace Clustering for Incomplete Data
    Mahmood, Usman
    Pimentel-Alarcon, Daniel
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] MDL hierarchical clustering with incomplete data
    Lai, Po-Hsiang
    O'Sullivan, Joseph A.
    [J]. 2010 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2010, : 369 - 373
  • [7] Integrating Incomplete Information into the Relational Data Model
    Ribeiro, Jorge
    Machado, Jose
    Abelha, Antonio
    Fernandez-Delgado, Manuel
    Neves, Jose
    [J]. WORLD CONGRESS ON ENGINEERING, WCE 2010, VOL I, 2010, : 57 - 62
  • [8] Evolutionary fuzzy clustering of relational data
    Horta, Danilo
    de Andrade, Ivan C.
    Campello, Ricardo J. G. B.
    [J]. THEORETICAL COMPUTER SCIENCE, 2011, 412 (42) : 5854 - 5870
  • [9] Evolutionary Clustering Algorithms for Relational Data
    Banerjee, Amit
    Abu-Mahfouz, Issam
    [J]. CYBER PHYSICAL SYSTEMS AND DEEP LEARNING, 2018, 140 : 276 - 283
  • [10] Affinity Propagation Clustering with Incomplete Data
    Lu, Cheng
    Song, Shiji
    Wu, Cheng
    [J]. COMPUTATIONAL INTELLIGENCE, NETWORKED SYSTEMS AND THEIR APPLICATIONS, 2014, 462 : 239 - 248