A Review of Codebook Models in Patch-Based Visual Object Recognition

被引:12
|
作者
Ramanan, Amirthalingam [1 ]
Niranjan, Mahesan [1 ,2 ]
机构
[1] Univ Southampton, Sch Elect & Comp Sci, Southampton, Hants, England
[2] Univ Southampton, Informat Signals Images & Syst ISIS Res Grp, Southampton, Hants, England
关键词
Bag-of-features; Cluster analysis; Object recognition; Visual codebook; SIFT; CATEGORIZATION; FEATURES; TEXTURE; SPARSE; SCALE;
D O I
10.1007/s11265-011-0622-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The codebook model-based approach, while ignoring any structural aspect in vision, nonetheless provides state-of-the-art performances on current datasets. The key role of a visual codebook is to provide a way to map the low-level features into a fixed-length vector in histogram space to which standard classifiers can be directly applied. The discriminative power of such a visual codebook determines the quality of the codebook model, whereas the size of the codebook controls the complexity of the model. Thus, the construction of a codebook is an important step which is usually done by cluster analysis. However, clustering is a process that retains regions of high density in a distribution and it follows that the resulting codebook need not have discriminant properties. This is also recognised as a computational bottleneck of such systems. In our recent work, we proposed a resource-allocating codebook, to constructing a discriminant codebook in a one-pass design procedure that slightly outperforms more traditional approaches at drastically reduced computing times. In this review we survey several approaches that have been proposed over the last decade with their use of feature detectors, descriptors, codebook construction schemes, choice of classifiers in recognising objects, and datasets that were used in evaluating the proposed methods.
引用
收藏
页码:333 / 352
页数:20
相关论文
共 50 条
  • [1] A Review of Codebook Models in Patch-Based Visual Object Recognition
    Amirthalingam Ramanan
    Mahesan Niranjan
    [J]. Journal of Signal Processing Systems, 2012, 68 : 333 - 352
  • [2] Resource-Allocating Codebook for Patch-based Face Recognition
    Ramanan, Amirthalingam
    Niranjan, Mahesan
    [J]. 2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, 2009, : 268 - 271
  • [3] Patch-Based Separable Transformer for Visual Recognition
    Sun, Shuyang
    Yue, Xiaoyu
    Zhao, Hengshuang
    Torr, Philip H. S.
    Bai, Song
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 9241 - 9247
  • [4] DPT: Deformable Patch-based Transformer for Visual Recognition
    Chen, Zhiyang
    Zhu, Yousong
    Zhao, Chaoyang
    Hu, Guosheng
    Zeng, Wei
    Wang, Jinqiao
    Tang, Ming
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2899 - 2907
  • [5] One-pass Keypoint Selection to Construct Codebook for Patch-based Object Classification
    Vinoharan, Veerapathirapillai
    Ramanan, Amirthalingam
    [J]. 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS' 2018), 2018,
  • [6] Experiments with patch-based object classification
    Wijnhoven, R. G. J.
    de With, P. H. N.
    [J]. 2007 IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, 2007, : 105 - +
  • [7] Patch-based Within-Object Classification
    Aghajanian, Jania
    Warrell, Jonathan
    Prince, Simon J. D.
    Li, Peng
    Rohn, Jennifer L.
    Baum, Buzz
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1125 - 1132
  • [8] Properties of patch based approaches for the recognition of visual object classes
    Teynor, Alexandra
    Rahtu, Esa
    Setia, Lokesh
    Burkhardt, Hans
    [J]. PATTERN RECOGNITION, PROCEEDINGS, 2006, 4174 : 284 - 293
  • [9] Models for Patch-Based Image Restoration
    Mithun Das Gupta
    Shyamsundar Rajaram
    Nemanja Petrovic
    Thomas S. Huang
    [J]. EURASIP Journal on Image and Video Processing, 2009
  • [10] Models for Patch-Based Image Restoration
    Das Gupta, Mithun
    Rajaram, Shyamsundar
    Petrovic, Nemanja
    Huang, Thomas S.
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2009,