Rich feature hierarchies for accurate object detection and semantic segmentation

被引:12622
|
作者
Girshick, Ross [1 ]
Donahue, Jeff [1 ]
Darrell, Trevor [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
10.1109/CVPR.2014.81
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that typically combine multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012-achieving a mAP of 53.3%. Our approach combines two key insights: (1) one can apply high-capacity convolutional neural networks (CNNs) to bottom-up region proposals in order to localize and segment objects and (2) when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost. Since we combine region proposals with CNNs, we call our method R-CNN: Regions with CNN features. Source code for the complete system is available at http://www.cs.berkeley.edu/similar to rbg/rcnn.
引用
收藏
页码:580 / 587
页数:8
相关论文
共 50 条
  • [31] Enhanced semantic feature pyramid network for small object detection
    Chen, Yuqi
    Zhu, Xiangbin
    Li, Yonggang
    Wei, Yuanwang
    Ye, Lihua
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 113
  • [32] SAFPN: a full semantic feature pyramid network for object detection
    Gaihua Wang
    Qi Li
    Nengyuan Wang
    Hong Liu
    Pattern Analysis and Applications, 2023, 26 : 1729 - 1739
  • [33] Semantic Guided Feature Aggregation Network for Salient Object Detection
    Wang Z.-W.
    Song H.-H.
    Fan J.-Q.
    Liu Q.-S.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2386 - 2395
  • [34] Object Detection Oriented Feature Pooling for Video Semantic Indexing
    Ueki, Kazuya
    Kobayashi, Tetsunori
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 44 - 51
  • [35] SSFENET: SPATIAL AND SEMANTIC FEATURE ENHANCEMENT NETWORK FOR OBJECT DETECTION
    Wang, Tianyuan
    Ma, Can
    Su, Haoshan
    Wang, Weiping
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1500 - 1504
  • [36] Contrastive and consistent feature learning for weakly supervised object localization and semantic segmentation
    Ki, Minsong
    Uh, Youngjung
    Lee, Wonyoung
    Byun, Hyeran
    NEUROCOMPUTING, 2021, 445 : 244 - 254
  • [37] A feature enriching object detection framework with weak segmentation loss
    Zhang, Tianqi
    Hao, Li-Ying
    Guo, Ge
    NEUROCOMPUTING, 2019, 335 : 72 - 80
  • [38] Feature Cascade Underwater Object Detection Based on Stereo Segmentation
    Kong, Weiyi
    Yang, Menglong
    Huang, Qinzhen
    JOURNAL OF COASTAL RESEARCH, 2020, : 140 - 144
  • [39] Qualitative multiscale feature hierarchies for object tracking
    Bretzner, L
    Lindeberg, T
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2000, 11 (02) : 115 - 129
  • [40] Efficient and Accurate Text Detection Combining Differentiable Binarization with Semantic Segmentation
    Liu, Yue
    Shi, Ying
    Lin, Chaojun
    Hua, Jie
    Huang, Ziqi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 630 - 642