Heterogeneous Image Feature Integration via Multi-Modal Spectral Clustering

被引:0
|
作者
Cai, Xiao [1 ]
Nie, Feiping [1 ]
Huang, Heng [1 ]
Kamangar, Farhad [1 ]
机构
[1] Univ Texas Arlington, Comp Sci & Engn Dept, Arlington, TX 76019 USA
关键词
CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, more and more visual descriptors have been proposed to describe objects and scenes appearing in images. Different features describe different aspects of the visual characteristics. How to combine these heterogeneous features has become an increasing critical problem. In this paper, we propose a novel approach to unsupervised integrate such heterogeneous features by performing multi-modal spectral clustering on unlabeled images and unsegmented images. Considering each type of feature as one modal, our new multi-modal spectral clustering (MMSC) algorithm is to learn a commonly shared graph Laplacian matrix by unifying different modals (image features). A non-negative relaxation is also added in our method to improve the robustness and efficiency of image clustering. We applied our MMSC method to integrate five types of popularly used image features, including SIFT, HOG, GIST, LBP, CENTRIST and evaluated the performance by two benchmark data sets: Caltech-101 and MSRC-v1. Compared with existing unsupervised scene and object categorization methods, our approach always achieves superior performances measured by three standard clustering evaluation metrices.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Volumetric feature points integration with bio-structure-informed guidance for deformable multi-modal CT image registration
    Zhang, Chulong
    He, Wenfeng
    Liu, Lin
    Dai, Jingjing
    Salim Ahmad, Isah
    Xie, Yaoqin
    Liang, Xiaokun
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (24):
  • [42] Heterogeneous Feature Selection With Multi-Modal Deep Neural Networks and Sparse Group LASSO
    Zhao, Lei
    Hu, Qinghua
    Wang, Wenwu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 1936 - 1948
  • [43] A multi-modal prostate segmentation scheme by combining spectral clustering and Active Shape Models
    Toth, Robert
    Tiwari, Pallavi
    Rosen, Mark
    Kalyanpur, Arjun
    Pungavkar, Sona
    Madabhushi, Anant
    MEDICAL IMAGING 2008: IMAGE PROCESSING, PTS 1-3, 2008, 6914
  • [44] CEFusion: Multi-Modal medical image fusion via cross encoder
    Zhu, Ya
    Wang, Xue
    Chen, Luping
    Nie, Rencan
    IET Image Processing, 2023, 16 (12) : 3177 - 3189
  • [45] CEFusion: Multi-Modal medical image fusion via cross encoder
    Zhu, Ya
    Wang, Xue
    Chen, Luping
    Nie, Rencan
    IET IMAGE PROCESSING, 2022, 16 (12) : 3177 - 3189
  • [46] Semantically Multi-modal Image Synthesis
    Zhu, Zhen
    Xu, Zhiliang
    You, Ansheng
    Bai, Xiang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5466 - 5475
  • [47] Unsupervised Multi-modal Medical Image Registration via Invertible Translation
    Guo, Mengjie
    COMPUTER VISION - ECCV 2024, PT XXXI, 2025, 15089 : 22 - 38
  • [48] Multi-modal semantic image segmentation
    Pemasiri, Akila
    Kien Nguyen
    Sridharan, Sridha
    Fookes, Clinton
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 202
  • [49] RGB-D Scene Classification via Multi-modal Feature Learning
    Cai, Ziyun
    Shao, Ling
    COGNITIVE COMPUTATION, 2019, 11 (06) : 825 - 840
  • [50] RGB-D Scene Classification via Multi-modal Feature Learning
    Ziyun Cai
    Ling Shao
    Cognitive Computation, 2019, 11 : 825 - 840