Locality constrained encoding of frequency and spatial information for image classification

被引:7
|
作者
Pan, Yongsheng [1 ]
Xia, Yong [1 ,2 ]
Song, Yang [3 ]
Cai, Weidong [3 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci & Engn, Shaanxi Key Lab Speech & Image Informat Proc SAII, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci & Engn, CMCC, Xian 710072, Peoples R China
[3] Univ Sydney, Sch Informat Technol, Biomed & Multimedia Informat Technol BMIT Res Grp, Camperdown, NSW 2006, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Image classification; Bag-of-features (BoF); Image decomposition; Wavelet transform; Spatial pyramid matching (SPM); REPRESENTATION; RETRIEVAL; FEATURES; MODEL;
D O I
10.1007/s11042-018-5712-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The bag-of-feature (BoF) model provides a way to construct high-level representation for image classification. Although spatial pyramid matching (SPM) has been incorporated into many of its extensions, these models intrinsically lack the mechanism to utilize frequency domain information. In this paper, we propose the locality-constrained encoding of frequency and spatial information (LEFSI) algorithm, in which an image is decomposed into multiple frequency components and each component is further decomposed into subregions using SPM. The scale-invariant feature transform (SIFT) descriptors are first calculated in each subregion, and then converted into a global descriptor by using the codebook generated on a category-by-category basis and locality-constrained linear coding (LLC). The image feature is defined as the concatenation of global descriptors constructed in all subregions. We evaluated this algorithm against several state-of-the-art models on six benchmark datasets. Our results suggest that the proposed LEFSI algorithm can describe images more effectively and provide more accurate image classification.
引用
收藏
页码:24891 / 24907
页数:17
相关论文
共 50 条
  • [31] Non-negative Locality-Constrained Linear Coding for Image Classification
    Liu, GuoJun
    Liu, Yang
    Guo, Maozu
    Liu, Peina
    Wang, ChunYu
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 462 - 471
  • [32] OPTICAL PSEUDOCOLOR ENCODING OF SPATIAL-FREQUENCY INFORMATION
    SZPONER, K
    ZAJAC, M
    OPTICA APPLICATA, 1981, 11 (04) : 617 - 625
  • [33] OPTICAL PSEUDOCOLOR ENCODING OF SPATIAL-FREQUENCY INFORMATION
    BESCOS, J
    STRAND, TC
    APPLIED OPTICS, 1978, 17 (16): : 2524 - &
  • [34] Large Deformation Image Classification Using Generalized Locality-Constrained Linear Coding
    Zhang, Pei
    Wee, Chong-Yaw
    Niethammer, Marc
    Shen, Dinggang
    Yap, Pew-Thian
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION (MICCAI 2013), PT I, 2013, 8149 : 292 - 299
  • [35] Spatial-spectral locality constrained elastic net hypergraph for hyperspectral image clustering
    Wang, Sujuan
    Sun, Yubao
    Hang, Renlong
    Liu, Qingshan
    Yuan, Xiaotong
    Liu, Guangcan
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2017, 38 (23) : 7374 - 7388
  • [36] Locality-Constrained Multi-Task Joint Sparse Representation for Image Classification
    Guo, Lihua
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (09): : 2177 - 2181
  • [37] Image classification method rationally utilizing spatial information of the image
    Chao Wu
    Yaqian Li
    Zhibiao Zhao
    Bin Liu
    Multimedia Tools and Applications, 2019, 78 : 19181 - 19199
  • [38] Image classification method rationally utilizing spatial information of the image
    Wu, Chao
    Li, Yaqian
    Zhao, Zhibiao
    Liu, Bin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (14) : 19181 - 19199
  • [39] Spatial correlation constrained sparse representation for hyperspectral image classification
    Wu, Z.-B. (fen_jin@163.com), 2012, Science Press (34):
  • [40] GEOMETRY-CONSTRAINED SPATIAL PYRAMID ADAPTATION FOR IMAGE CLASSIFICATION
    Tasli, H. Emrah
    Sicre, Ronan
    Gevers, Theo
    Alatan, A. Aydin
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1051 - 1055