A semi-supervised probabilistic model for clustering large databases of complex images

被引:1
|
作者
Chandran, S. Nisha [1 ]
Gangodkar, Durgaprasad [1 ]
Mittal, Ankush [1 ]
机构
[1] Graph Era Univ, Dept CSE, Bell Rd, Dehra Dun 248001, Uttar Pradesh, India
基金
美国国家科学基金会;
关键词
Image clustering; Kullback-Leibler distance; Gaussian mixture modeling;
D O I
10.1007/s11042-017-4664-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image content clustering is an effective way to organize large databases thereby making the content based image retrieval process much easier. However, clustering of images with varied background and foreground is quite challenging. In this paper, we propose a novel image content clustering paradigm suitable for clustering large and diverse image databases. In our approach images are represented in a continuous domain based on a probabilistic Gaussian Mixture Model (GMM) with the images modeled as mixture of Gaussian distributions in the selected feature space. The distance metric between the Gaussian distributions is defined in the sense of Kullback-Leibler (KL) divergence. The clustering is done using a semi-supervised learning framework where labeled data in the form of cluster templates is used to classify the unlabelled data. The clusters are formed around initially chosen seeds and are updated in the due course based on user inputs. In our clustering approach the user interaction is done in a structured way as to get maximum inputs from the user in a limited time. We propose two methods to carry out the structured user interaction using which the cluster templates are updated to improve the quality of the clusters formed. The proposed method is experimentally evaluated on benchmark datasets that are specifically chosen to include a wide variation of images around a common theme that is typically encountered in applications like photo-summarization and poses a major semantic gap challenge to conventional clustering approaches. The experimental results presented demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:21937 / 21959
页数:23
相关论文
共 50 条
  • [1] A semi-supervised probabilistic model for clustering large databases of complex images
    S. Nisha Chandran
    Durgaprasad Gangodkar
    Ankush Mittal
    Multimedia Tools and Applications, 2017, 76 : 21937 - 21959
  • [2] A Kernel Probabilistic Model for Semi-supervised Co-clustering Ensemble
    Zhang, Yinghui
    JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) : 143 - 153
  • [3] Semi-supervised Probabilistic Distance Clustering and the Uncertainty of Classification
    Iyigun, Cem
    Ben-Israel, Adi
    ADVANCES IN DATA ANALYSIS, DATA HANDLING AND BUSINESS INTELLIGENCE, 2010, : 3 - 20
  • [4] A semi-supervised tool for clustering accounting databases with applications to internal controls
    Argyrou, Argyris
    Andreev, Andriy
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11176 - 11181
  • [5] Semi-Supervised Learning on Large Complex Simulations
    Korecki, John N.
    Banfield, Robert E.
    Hall, Lawrence O.
    Bowyer, Kevin W.
    Kegelmeyer, W. Philip
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2396 - 2399
  • [6] A new interactive semi-supervised clustering model for large image database indexing
    Hien Phuong Lai
    Visani, Muriel
    Boucher, Alain
    Ogier, Jean-Marc
    PATTERN RECOGNITION LETTERS, 2014, 37 : 94 - 106
  • [7] Urdu Documents Clustering with Unsupervised and Semi-Supervised Probabilistic Topic Modeling
    Mustafa, Mubashar
    Zeng, Feng
    Ghulam, Hussain
    Muhammad Arslan, Hafiz
    INFORMATION, 2020, 11 (11) : 1 - 16
  • [8] Semi-supervised clustering methods
    Bair, Eric
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05): : 349 - 361
  • [9] SEMI-SUPERVISED SPECTRAL CLUSTERING
    Mai, Xiaoyi
    Couillet, Romain
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 2012 - 2016
  • [10] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    INFORMATION SCIENCES, 2023, 632 : 164 - 200