Regionlets for Generic Object Detection

被引：77

作者：

Wang, Xiaoyu ^{[1
]}

Yang, Ming ^{[2
]}

Zhu, Shenghuo ^{[3
]}

Lin, Yuanqing ^{[1
]}

机构：

[1] NEC Labs Amer, Dept Media Analyt, Cupertino, CA 95014 USA

[2] Facebook Inc, AI Res, Menlo Pk, CA USA

[3] Alibaba Grp, Hangzhou, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2015年 / 37卷 / 10期

关键词：

Object detection; regionlet; boosting; object proposals; selective search; deep convolutional neural network; RECOGNITION; HISTOGRAMS; GRADIENTS;

D O I：

10.1109/TPAMI.2015.2389830

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Generic object detection is confronted by dealing with different degrees of variations, caused by viewpoints or deformations in distinct object classes, with tractable computations. This demands for descriptive and flexible object representations which can be efficiently evaluated in many locations. We propose to model an object class with a cascaded boosting classifier which integrates various types of features from competing local regions, each of which may consist of a group of subregions, named as regionlets. A regionlet is a base feature extraction region defined proportionally to a detection window at an arbitrary resolution (i.e., size and aspect ratio). These regionlets are organized in small groups with stable relative positions to be descriptive to delineate fine-grained spatial layouts inside objects. Their features are aggregated into a one-dimensional feature within one group so as to be flexible to tolerate deformations. The most discriminative regionlets for each object class are selected through a boosting learning procedure. Our regionlet approach achieves very competitive performance on popular multi-class detection benchmark datasets with a single method, without any context. It achieves a detection mean average precision of 41.7 percent on the PASCAL VOC 2007 dataset, and 39.7 percent on the VOC 2010 for 20 object categories. We further develop support pixel integral images to efficiently augment regionlet features with the responses learned by deep convolutional neural networks. Our regionlet based method won second place in the ImageNet Large Scale Visual Object Recognition Challenge (ILSVRC 2013).

引用

页码：2071 / 2084

页数：14

共 50 条

[1] Regionlets for Generic Object Detection
Wang, Xiaoyu
Yang, Ming
Zhu, Shenghuo
Lin, Yuanqing
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 17 - 24
[2] Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection
Xu, Hongyu
Lv, Xutao
Wang, Xiaoyu
Ren, Zhou
Bodla, Navaneeth
Chellappa, Rama
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (06) : 1914 - 1927
[3] Deep Regionlets for Object Detection
Xu, Hongyu
Lv, Xutao
Wang, Xiaoyu
Ren, Zhou
Bodla, Navaneeth
Chellappa, Rama
[J]. COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 827 - 844
[4] Accurate Object Detection with Location Relaxation and Regionlets Re-localization
Long, Chengjiang
Wang, Xiaoyu
Hua, Gang
Yang, Ming
Lin, Yuanqing
[J]. COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 260 - 275
[5] Analysis of Regionlets for Pedestrian Detection
Salscheider, Niels Ole
Rehder, Eike
Lauer, Martin
[J]. ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 26 - 32
[6] Deep Learning for Generic Object Detection: A Survey
Liu, Li
Ouyang, Wanli
Wang, Xiaogang
Fieguth, Paul
Chen, Jie
Liu, Xinwang
Pietikainen, Matti
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (02) : 261 - 318
[7] Local structured representation for generic object detection
Junge Zhang
Kaiqi Huang
Tieniu Tan
Zhaoxiang Zhang
[J]. Frontiers of Computer Science, 2017, 11 : 632 - 648
[8] Deep Learning for Generic Object Detection: A Survey
Li Liu
Wanli Ouyang
Xiaogang Wang
Paul Fieguth
Jie Chen
Xinwang Liu
Matti Pietikäinen
[J]. International Journal of Computer Vision, 2020, 128 : 261 - 318
[9] Local structured representation for generic object detection
Zhang, Junge
Huang, Kaiqi
Tan, Tieniu
Zhang, Zhaoxiang
[J]. FRONTIERS OF COMPUTER SCIENCE, 2017, 11 (04) : 632 - 648
[10] Occlusion Handling in Generic Object Detection: A Review
Saleh, Kaziwa
Szenasi, Sandor
Vamossy, Zoltan
[J]. 2021 IEEE 19TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2021), 2021, : 477 - 484

← 1 2 3 4 5 →