Clustering Uncertain Data Objects Using Jeffreys-Divergence and Maximum Bipartite Matching Based Similarity Measure

被引:13
|
作者
Sharma, Krishna Kumar [1 ,2 ]
Seal, Ayan [1 ,3 ]
Yazidi, Anis [4 ,5 ,6 ]
Selamat, Ali [3 ,7 ]
Krejcar, Ondrej [3 ,7 ]
机构
[1] PDPM Indian Inst Informat Technol Design & Mfg Ja, Dept Comp Sci & Engn, Jabalpur 482005, India
[2] Univ Kota, Dept Comp Sci & Informat, Kota 324005, India
[3] Univ Hradec Kralove, Fac Informat & Management, Ctr Basic & Appl Res, Hradec Kralove 50003, Czech Republic
[4] Oslo Metropolitan Univ, Dept Comp Sci, N-460167 Oslo, Norway
[5] Univ Teknol Malaysia, Malaysia Japan Int Inst Technol, Kuala Lumpur 54100, Malaysia
[6] Norwegian Univ Sci & Technol, Dept Comp Sci, N-7491 Trondheim, Norway
[7] Oslo Univ Hosp, Dept Plast & Reconstruct Surg, N-0424 Oslo, Norway
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Uncertain data clustering; probability density estimation; bipartite matching; INTEGRATION; SELECTION;
D O I
10.1109/ACCESS.2021.3083969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing centralized clustering algorithms in order to tackle uncertainty in data. However, in order to perform uncertain data clustering, representation plays an imperative role. In this paper, a Monte-Carlo integration is adopted and modified to express uncertain data in a probabilistic form. Then three similarity measures are used to determine the closeness between two probability distributions including one novel measure. These similarity measures are derived from the notion of Kullback-Leibler divergence and Jeffreys divergence. Finally, density-based spatial clustering of applications with noise and k-medoids algorithms are modified and implemented on one synthetic database and three real-world uncertain databases. The obtained outcomes confirm that the proposed clustering technique defeats some of the existing algorithms.
引用
下载
收藏
页码:79505 / 79519
页数:15
相关论文
共 50 条
  • [1] Clustering Uncertain Data Objects Using Jeffreys-Divergence and Maximum Bipartite Matching Based Similarity Measure
    Sharma, Krishna Kumar
    Seal, Ayan
    Yazidi, Anis
    Selamat, Ali
    Krejcar, Ondrej
    IEEE Access, 2021, 9 : 79505 - 79519
  • [2] Fuzzy c-means clustering using Jeffreys-divergence based similarity measure
    Seal, Ayan
    Karlekar, Aditya
    Krejcar, Ondrej
    Gonzalo-Martin, Consuelo
    APPLIED SOFT COMPUTING, 2020, 88
  • [3] Performance and Convergence Analysis of Modified C-Means Using Jeffreys-Divergence for Clustering
    Seal, Ayan
    Karlekar, Aditya
    Krejcar, Ondrej
    Herrera-Viedma, Enrique
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2021, 7 (02): : 141 - 149
  • [4] Image similarity measure using max weighted bipartite matching
    Wan, Hualin
    Hu, Hong
    Shi, Zhongzhi
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2002, 14 (11): : 1066 - 1069
  • [5] Objects similarity measure based on skeleton tree descriptor matching
    Liu, WY
    Liu, JT
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2005, 24 (06) : 432 - 436
  • [6] MatchSim: a novel similarity measure based on maximum neighborhood matching
    Lin, Zhenjiang
    Lyu, Michael R.
    King, Irwin
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 32 (01) : 141 - 166
  • [7] MatchSim: a novel similarity measure based on maximum neighborhood matching
    Zhenjiang Lin
    Michael R. Lyu
    Irwin King
    Knowledge and Information Systems, 2012, 32 : 141 - 166
  • [8] Clustering Uncertain Data Based on Probability Distribution Similarity
    Jiang, Bin
    Pei, Jian
    Tao, Yufei
    Lin, Xuemin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (04) : 751 - 763
  • [9] Weighted FCM Clustering Algorithm Based on Jeffrey Divergence Similarity Measure
    Wu Chenwen
    Ma Ning
    Jiang Yufan
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
  • [10] A Matching Pursuit Based Similarity Measure for Fuzzy Clustering and Classification of Signals
    Mazhar, Raazia
    Gader, Paul D.
    Wilson, Joseph N.
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 1952 - 1957