A semi-supervised probabilistic model for clustering large databases of complex images

被引:1
|
作者
Chandran, S. Nisha [1 ]
Gangodkar, Durgaprasad [1 ]
Mittal, Ankush [1 ]
机构
[1] Graph Era Univ, Dept CSE, Bell Rd, Dehra Dun 248001, Uttar Pradesh, India
基金
美国国家科学基金会;
关键词
Image clustering; Kullback-Leibler distance; Gaussian mixture modeling;
D O I
10.1007/s11042-017-4664-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image content clustering is an effective way to organize large databases thereby making the content based image retrieval process much easier. However, clustering of images with varied background and foreground is quite challenging. In this paper, we propose a novel image content clustering paradigm suitable for clustering large and diverse image databases. In our approach images are represented in a continuous domain based on a probabilistic Gaussian Mixture Model (GMM) with the images modeled as mixture of Gaussian distributions in the selected feature space. The distance metric between the Gaussian distributions is defined in the sense of Kullback-Leibler (KL) divergence. The clustering is done using a semi-supervised learning framework where labeled data in the form of cluster templates is used to classify the unlabelled data. The clusters are formed around initially chosen seeds and are updated in the due course based on user inputs. In our clustering approach the user interaction is done in a structured way as to get maximum inputs from the user in a limited time. We propose two methods to carry out the structured user interaction using which the cluster templates are updated to improve the quality of the clusters formed. The proposed method is experimentally evaluated on benchmark datasets that are specifically chosen to include a wide variation of images around a common theme that is typically encountered in applications like photo-summarization and poses a major semantic gap challenge to conventional clustering approaches. The experimental results presented demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:21937 / 21959
页数:23
相关论文
共 50 条
  • [41] Input validation for semi-supervised clustering
    Yip, Kevin Y.
    Ng, Michael K.
    Cheung, David W.
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 479 - 483
  • [42] A survey on semi-supervised graph clustering
    Daneshfar, Fatemeh
    Soleymanbaigi, Sayvan
    Yamini, Pedram
    Amini, Mohammad Sadra
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133 (133)
  • [43] Research Progress on Semi-Supervised Clustering
    Qin, Yue
    Ding, Shifei
    Wang, Lijuan
    Wang, Yanru
    COGNITIVE COMPUTATION, 2019, 11 (05) : 599 - 612
  • [44] Semi-supervised deep density clustering
    Xu, Xiao
    Hou, Haiwei
    Ding, Shifei
    APPLIED SOFT COMPUTING, 2023, 148
  • [45] Semi-supervised Linear Discriminant Clustering
    Liu, Chien-Liang
    Hsaio, Wen-Hoar
    Lee, Chia-Hoang
    Gou, Fu-Sheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (07) : 989 - 1000
  • [46] Composite kernels for semi-supervised clustering
    Carlotta Domeniconi
    Jing Peng
    Bojun Yan
    Knowledge and Information Systems, 2011, 28 : 99 - 116
  • [47] A SUPERVISORY APPROACH TO SEMI-SUPERVISED CLUSTERING
    Conroy, Bryan
    Xi, Yongxin Taylor
    Ramadge, Peter
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1858 - 1861
  • [48] Weighted Semi-supervised Fuzzy Clustering
    Kong, Yi-qing
    Wang, Shi-tong
    FUZZY INFORMATION AND ENGINEERING, VOL 1, 2009, 54 : 465 - 470
  • [49] SemiSync: Semi-supervised Clustering by Synchronization
    Zhang, Zhong
    Kang, Didi
    Gao, Chongming
    Shao, Junming
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 358 - 362
  • [50] Categorization Using Semi-Supervised Clustering
    Hu, Jianying
    Singh, Moninder
    Mojsilovic, Aleksandra
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3666 - 3669