Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection

被引:34
|
作者
Wang, Keze [1 ,3 ]
Lini, Liang [1 ]
Zuo, Wangmeng [2 ]
Gu, Shuhang [3 ]
Zhang, Lei [3 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[3] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR.2016.235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature representation and object category classification are two key components of most object detection methods. While significant improvements have been achieved for deep feature representation learning, traditional SVM/softmax classifiers remain the dominant methods for the final object category classification. However, SVM/softmax classifiers lack the capacity of explicitly exploiting the complex structure of deep features, as they are purely discriminative methods. The recently proposed discriminative dictionary pair learning (DPL) model involves a fidelity term to minimize the reconstruction loss and a discrimination term to enhance the discriminative capability of the learned dictionary pair, and thus is appropriate for balancing the representation and discrimination to boost object detection performance. In this paper, we propose a novel object detection system by unifying DPL with the convolutional feature learning. Specifically, we incorporate DPL as a Dictionary Pair Classifier Layer (DPCL) into the deep architecture, and develop an end-to-end learning algorithm for optimizing the dictionary pairs and the neural networks simultaneously. Moreover, we design a multi-task loss for guiding our model to accomplish the three correlated tasks: objectness estimation, categoryness computation, and bounding box regression. From the extensive experiments on PASCAL VOC 2007/2012 benchmarks, our approach demonstrates the effectiveness to substantially improve the performances over the popular existing object detection frameworks (e.g., R-CNN [13] and FRCN [12]), and achieves new state-of-the-arts.
引用
收藏
页码:2138 / 2146
页数:9
相关论文
共 50 条
  • [21] Block dictionary learning-driven convolutional neural networks for fewshot face recognition
    Qiao Du
    Feipeng Da
    [J]. The Visual Computer, 2021, 37 : 663 - 672
  • [22] Block dictionary learning-driven convolutional neural networks for fewshot face recognition
    Du, Qiao
    Da, Feipeng
    [J]. VISUAL COMPUTER, 2021, 37 (04): : 663 - 672
  • [23] Object detection and feature base learning with sparse convolutional neural networks
    Gepperth, Alexander R. T.
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2006, 4087 : 221 - 232
  • [24] Using convolutional neural networks for image semantic segmentation and object detection
    Li, Shuangmei
    Huang, Chengning
    [J]. Systems and Soft Computing, 2024, 6
  • [25] Improved Object Detection With Iterative Localization Refinement in Convolutional Neural Networks
    Cheng, Kai-Wen
    Chen, Yie-Tarng
    Fang, Wen-Hsien
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (09) : 2261 - 2275
  • [26] Object Detection utilizing Modified Auto Encoder and Convolutional Neural Networks
    Nourmohammadi-Khiarak, Jalil
    Mazaheri, Samaneh
    Moosavi-Tayebi, Rohollah
    Noorbakhsh-Devlagh, Hamid
    [J]. 2018 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2018, : 43 - 49
  • [27] Fully Convolutional Neural Networks for Dynamic Object Detection in Grid Maps
    Piewak, Florian
    Rehfeld, Timo
    Weber, Michael
    Zoellner, J. Marius
    [J]. 2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 392 - 398
  • [28] ITERATIVE LOCALIZATION REFINEMENT IN CONVOLUTIONAL NEURAL NETWORKS FOR IMPROVED OBJECT DETECTION
    Cheng, Kai-Wen
    Chen, Yie-Tarng
    Fang, Wen-Hsien
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3643 - 3647
  • [29] Object detection using convolutional neural networks for natural disaster recovery
    Salluri, Deva Kumar
    Bade, Kalpana
    Madala, Gargi
    [J]. International Journal of Safety and Security Engineering, 2020, 10 (02) : 285 - 291
  • [30] Subcategory-aware Convolutional Neural Networks for Object Proposals and Detection
    Xiang, Yu
    Choi, Wongun
    Lin, Yuanqing
    Savarese, Silvio
    [J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 924 - 933