Clustering Uncertain Data Objects Using Jeffreys-Divergence and Maximum Bipartite Matching Based Similarity Measure

被引:13
|
作者
Sharma, Krishna Kumar [1 ,2 ]
Seal, Ayan [1 ,3 ]
Yazidi, Anis [4 ,5 ,6 ]
Selamat, Ali [3 ,7 ]
Krejcar, Ondrej [3 ,7 ]
机构
[1] PDPM Indian Inst Informat Technol Design & Mfg Ja, Dept Comp Sci & Engn, Jabalpur 482005, India
[2] Univ Kota, Dept Comp Sci & Informat, Kota 324005, India
[3] Univ Hradec Kralove, Fac Informat & Management, Ctr Basic & Appl Res, Hradec Kralove 50003, Czech Republic
[4] Oslo Metropolitan Univ, Dept Comp Sci, N-460167 Oslo, Norway
[5] Univ Teknol Malaysia, Malaysia Japan Int Inst Technol, Kuala Lumpur 54100, Malaysia
[6] Norwegian Univ Sci & Technol, Dept Comp Sci, N-7491 Trondheim, Norway
[7] Oslo Univ Hosp, Dept Plast & Reconstruct Surg, N-0424 Oslo, Norway
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Uncertain data clustering; probability density estimation; bipartite matching; INTEGRATION; SELECTION;
D O I
10.1109/ACCESS.2021.3083969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing centralized clustering algorithms in order to tackle uncertainty in data. However, in order to perform uncertain data clustering, representation plays an imperative role. In this paper, a Monte-Carlo integration is adopted and modified to express uncertain data in a probabilistic form. Then three similarity measures are used to determine the closeness between two probability distributions including one novel measure. These similarity measures are derived from the notion of Kullback-Leibler divergence and Jeffreys divergence. Finally, density-based spatial clustering of applications with noise and k-medoids algorithms are modified and implemented on one synthetic database and three real-world uncertain databases. The obtained outcomes confirm that the proposed clustering technique defeats some of the existing algorithms.
引用
下载
收藏
页码:79505 / 79519
页数:15
相关论文
共 50 条
  • [41] Tensor-Based Reliable Multiview Similarity Learning for Robust Spectral Clustering on Uncertain Data
    Li, Ao
    Chen, Jiajia
    Chen, Deyun
    Yu, Xiaoyang
    Yuan, Mengke
    Xu, Shibiao
    Sun, Guanglu
    IEEE TRANSACTIONS ON RELIABILITY, 2021, 70 (03) : 916 - 930
  • [42] Fuzzy c-medoids Method based on JS']JS-divergence for Uncertain Data Clustering
    Wang, Yingxu
    Dong, Jiwen
    Zhou, Jin
    Wang, Dong
    Wang, Lin
    Han, Shiyuan
    Chen, Yuehui
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 312 - 315
  • [43] Texture image retrieval based on a Gaussian Mixture Model and similarity measure using a Kullback divergence
    Yuan, H
    Zhang, XP
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1867 - 1870
  • [44] ONLINE FUZZY CLUSTERING OF INCOMPLETE DATA USING CREDIBILISTIC APPROACH AND SIMILARITY MEASURE OF SPECIAL TYPE
    Bodyanskiy, Ye, V
    Shafronenko, A. Yu
    Klymova, I. N.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2021, (01) : 97 - 104
  • [45] SDR: A Novel Similarity Measure Using Curve Fitting Method for Time Series Data Clustering
    Yang, Huahui
    Meng, Chen
    Wang, Cheng
    Yao, Yunzhi
    2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019), 2019, : 464 - 469
  • [46] Spectral Clustering Using Robust Similarity Measure Based on Closeness of Shared Nearest Neighbors
    Ye, Xiucai
    Sakurai, Tetsuya
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [47] Improved Spectral Clustering using PCA based similarity measure on different Laplacian Graphs
    Kavitha, K. R.
    Sandeep, S.
    Praveen, P. R.
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH, 2016, : 79 - 84
  • [48] Clustering of temporal gene expression data by regularized spline regression and an energy based similarity measure
    Zhang, Wei-Feng
    Liu, Chao-Chun
    Yan, Hong
    PATTERN RECOGNITION, 2010, 43 (12) : 3969 - 3976
  • [49] Data compression using a sort-based context similarity measure
    Yokoo, H
    COMPUTER JOURNAL, 1997, 40 (2-3): : 94 - 102
  • [50] A graph-based approach to corner matching using mutual information as a local similarity measure
    Lourakis, MIA
    Argyros, AA
    Marias, K
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 827 - 830