Regionlets for Generic Object Detection

被引:77
|
作者
Wang, Xiaoyu [1 ]
Yang, Ming [2 ]
Zhu, Shenghuo [3 ]
Lin, Yuanqing [1 ]
机构
[1] NEC Labs Amer, Dept Media Analyt, Cupertino, CA 95014 USA
[2] Facebook Inc, AI Res, Menlo Pk, CA USA
[3] Alibaba Grp, Hangzhou, Peoples R China
关键词
Object detection; regionlet; boosting; object proposals; selective search; deep convolutional neural network; RECOGNITION; HISTOGRAMS; GRADIENTS;
D O I
10.1109/TPAMI.2015.2389830
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generic object detection is confronted by dealing with different degrees of variations, caused by viewpoints or deformations in distinct object classes, with tractable computations. This demands for descriptive and flexible object representations which can be efficiently evaluated in many locations. We propose to model an object class with a cascaded boosting classifier which integrates various types of features from competing local regions, each of which may consist of a group of subregions, named as regionlets. A regionlet is a base feature extraction region defined proportionally to a detection window at an arbitrary resolution (i.e., size and aspect ratio). These regionlets are organized in small groups with stable relative positions to be descriptive to delineate fine-grained spatial layouts inside objects. Their features are aggregated into a one-dimensional feature within one group so as to be flexible to tolerate deformations. The most discriminative regionlets for each object class are selected through a boosting learning procedure. Our regionlet approach achieves very competitive performance on popular multi-class detection benchmark datasets with a single method, without any context. It achieves a detection mean average precision of 41.7 percent on the PASCAL VOC 2007 dataset, and 39.7 percent on the VOC 2010 for 20 object categories. We further develop support pixel integral images to efficiently augment regionlet features with the responses learned by deep convolutional neural networks. Our regionlet based method won second place in the ImageNet Large Scale Visual Object Recognition Challenge (ILSVRC 2013).
引用
收藏
页码:2071 / 2084
页数:14
相关论文
共 50 条
  • [1] Regionlets for Generic Object Detection
    Wang, Xiaoyu
    Yang, Ming
    Zhu, Shenghuo
    Lin, Yuanqing
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 17 - 24
  • [2] Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection
    Xu, Hongyu
    Lv, Xutao
    Wang, Xiaoyu
    Ren, Zhou
    Bodla, Navaneeth
    Chellappa, Rama
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (06) : 1914 - 1927
  • [3] Deep Regionlets for Object Detection
    Xu, Hongyu
    Lv, Xutao
    Wang, Xiaoyu
    Ren, Zhou
    Bodla, Navaneeth
    Chellappa, Rama
    [J]. COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 827 - 844
  • [4] Accurate Object Detection with Location Relaxation and Regionlets Re-localization
    Long, Chengjiang
    Wang, Xiaoyu
    Hua, Gang
    Yang, Ming
    Lin, Yuanqing
    [J]. COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 260 - 275
  • [5] Analysis of Regionlets for Pedestrian Detection
    Salscheider, Niels Ole
    Rehder, Eike
    Lauer, Martin
    [J]. ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 26 - 32
  • [6] Deep Learning for Generic Object Detection: A Survey
    Liu, Li
    Ouyang, Wanli
    Wang, Xiaogang
    Fieguth, Paul
    Chen, Jie
    Liu, Xinwang
    Pietikainen, Matti
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (02) : 261 - 318
  • [7] Local structured representation for generic object detection
    Junge Zhang
    Kaiqi Huang
    Tieniu Tan
    Zhaoxiang Zhang
    [J]. Frontiers of Computer Science, 2017, 11 : 632 - 648
  • [8] Deep Learning for Generic Object Detection: A Survey
    Li Liu
    Wanli Ouyang
    Xiaogang Wang
    Paul Fieguth
    Jie Chen
    Xinwang Liu
    Matti Pietikäinen
    [J]. International Journal of Computer Vision, 2020, 128 : 261 - 318
  • [9] Local structured representation for generic object detection
    Zhang, Junge
    Huang, Kaiqi
    Tan, Tieniu
    Zhang, Zhaoxiang
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2017, 11 (04) : 632 - 648
  • [10] Occlusion Handling in Generic Object Detection: A Review
    Saleh, Kaziwa
    Szenasi, Sandor
    Vamossy, Zoltan
    [J]. 2021 IEEE 19TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2021), 2021, : 477 - 484