Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection

被引:34
|
作者
Wang, Keze [1 ,3 ]
Lini, Liang [1 ]
Zuo, Wangmeng [2 ]
Gu, Shuhang [3 ]
Zhang, Lei [3 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[3] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR.2016.235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature representation and object category classification are two key components of most object detection methods. While significant improvements have been achieved for deep feature representation learning, traditional SVM/softmax classifiers remain the dominant methods for the final object category classification. However, SVM/softmax classifiers lack the capacity of explicitly exploiting the complex structure of deep features, as they are purely discriminative methods. The recently proposed discriminative dictionary pair learning (DPL) model involves a fidelity term to minimize the reconstruction loss and a discrimination term to enhance the discriminative capability of the learned dictionary pair, and thus is appropriate for balancing the representation and discrimination to boost object detection performance. In this paper, we propose a novel object detection system by unifying DPL with the convolutional feature learning. Specifically, we incorporate DPL as a Dictionary Pair Classifier Layer (DPCL) into the deep architecture, and develop an end-to-end learning algorithm for optimizing the dictionary pairs and the neural networks simultaneously. Moreover, we design a multi-task loss for guiding our model to accomplish the three correlated tasks: objectness estimation, categoryness computation, and bounding box regression. From the extensive experiments on PASCAL VOC 2007/2012 benchmarks, our approach demonstrates the effectiveness to substantially improve the performances over the popular existing object detection frameworks (e.g., R-CNN [13] and FRCN [12]), and achieves new state-of-the-arts.
引用
收藏
页码:2138 / 2146
页数:9
相关论文
共 50 条
  • [41] Object Detection and Depth Estimation Approach Based on Deep Convolutional Neural Networks
    Wang, Huai-Mu
    Lin, Huei-Yung
    Chang, Chin-Chen
    [J]. SENSORS, 2021, 21 (14)
  • [42] Learning Point Processes and Convolutional Neural Networks for Object Detection in Satellite Images
    Mabon, Jules
    Ortner, Mathias
    Zerubia, Josiane
    [J]. REMOTE SENSING, 2024, 16 (06)
  • [43] A New Method Based on Deep Convolutional Neural Networks for Object Detection and Classification
    Liu, Yan
    Zhuxngjie, Zhu
    Zhang, Qiuhui
    Ding, Xiaotian
    Wang, Ruonan
    Han, Senyao
    Li, Chi
    [J]. AATCC JOURNAL OF RESEARCH, 2021, 8 (1_SUPPL): : 38 - 46
  • [44] Multiscale Convolutional Neural Networks for Geospatial Object Detection in VHR Satellite Images
    Yao, Qunli
    Hu, Xian
    Lei, Hong
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (01) : 23 - 27
  • [45] Salient Object Detection Using Cascaded Convolutional Neural Networks and Adversarial Learning
    Tang, Youbao
    Wu, Xiangqian
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (09) : 2237 - 2247
  • [46] Enhanced Object Detection With Deep Convolutional Neural Networks for Advanced Driving Assistance
    Wei, Jian
    He, Jianhua
    Zhou, Yi
    Chen, Kai
    Tang, Zuoyin
    Xiong, Zhiliang
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) : 1572 - 1583
  • [47] Adaptive Deep Convolutional Neural Networks for Scene-Specific Object Detection
    Li, Xudong
    Ye, Mao
    Liu, Yiguang
    Zhu, Ce
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (09) : 2538 - 2551
  • [48] Object Detection Using Convolutional Neural Networks in a Coarse-to-Fine Manner
    Li, Xiaobin
    Wang, Shengjin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (11) : 2037 - 2041
  • [49] Object Detection for Unmanned Aerial Vehicle Camera via Convolutional Neural Networks
    Saetchnikov, Ivan V.
    Tcherniavskaia, Elina A.
    Skakun, Victor V.
    [J]. IEEE Journal on Miniaturization for Air and Space Systems, 2021, 2 (02): : 98 - 103
  • [50] Fusing LiDAR and Color Imagery for Object Detection using Convolutional Neural Networks
    Farahnakian, Fahimeh
    Heikkonen, Jukka
    [J]. PROCEEDINGS OF 2020 23RD INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2020), 2020, : 241 - 247