A semi-supervised probabilistic model for clustering large databases of complex images

被引:1
|
作者
Chandran, S. Nisha [1 ]
Gangodkar, Durgaprasad [1 ]
Mittal, Ankush [1 ]
机构
[1] Graph Era Univ, Dept CSE, Bell Rd, Dehra Dun 248001, Uttar Pradesh, India
基金
美国国家科学基金会;
关键词
Image clustering; Kullback-Leibler distance; Gaussian mixture modeling;
D O I
10.1007/s11042-017-4664-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image content clustering is an effective way to organize large databases thereby making the content based image retrieval process much easier. However, clustering of images with varied background and foreground is quite challenging. In this paper, we propose a novel image content clustering paradigm suitable for clustering large and diverse image databases. In our approach images are represented in a continuous domain based on a probabilistic Gaussian Mixture Model (GMM) with the images modeled as mixture of Gaussian distributions in the selected feature space. The distance metric between the Gaussian distributions is defined in the sense of Kullback-Leibler (KL) divergence. The clustering is done using a semi-supervised learning framework where labeled data in the form of cluster templates is used to classify the unlabelled data. The clusters are formed around initially chosen seeds and are updated in the due course based on user inputs. In our clustering approach the user interaction is done in a structured way as to get maximum inputs from the user in a limited time. We propose two methods to carry out the structured user interaction using which the cluster templates are updated to improve the quality of the clusters formed. The proposed method is experimentally evaluated on benchmark datasets that are specifically chosen to include a wide variation of images around a common theme that is typically encountered in applications like photo-summarization and poses a major semantic gap challenge to conventional clustering approaches. The experimental results presented demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:21937 / 21959
页数:23
相关论文
共 50 条
  • [21] Research Progress on Semi-Supervised Clustering
    Yue Qin
    Shifei Ding
    Lijuan Wang
    Yanru Wang
    Cognitive Computation, 2019, 11 : 599 - 612
  • [22] Semi-supervised spectral clustering ensemble
    1600, ICIC Express Letters Office (10):
  • [23] Image Annotation with Semi-Supervised Clustering
    Sayar, Ahmet
    Yannan-Vural, Fatos T.
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 517 - 520
  • [24] Semi-supervised clustering of unknown expressions
    Jalal, Ahsan
    Tariq, Usman
    PATTERN RECOGNITION LETTERS, 2019, 120 : 46 - 53
  • [25] Semi-supervised clustering ensemble based on genetic algorithm model
    Bi, Sheng
    Li, Xiangli
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 55851 - 55865
  • [26] Manifold Regularized Gaussian Mixture Model for Semi-supervised Clustering
    Gan, Haitao
    Sang, Nong
    Huang, Rui
    Chen, Xi
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 361 - 365
  • [27] Composite kernels for semi-supervised clustering
    Domeniconi, Carlotta
    Peng, Jing
    Yan, Bojun
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 28 (01) : 99 - 116
  • [28] Fast semi-supervised evidential clustering
    Antoine, Violaine
    Guerrero, Jose A.
    Xie, Jiarui
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 133 (133) : 116 - 132
  • [29] Image Annotation With Semi-Supervised Clustering
    Sayar, Ahmet
    Vural, Fatos T. Yarman
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 12 - +
  • [30] Semi-supervised deep embedded clustering
    Ren, Yazhou
    Hu, Kangrong
    Dai, Xinyi
    Pan, Lili
    Hoi, Steven C. H.
    Xu, Zenglin
    NEUROCOMPUTING, 2019, 325 : 121 - 130