Rich feature hierarchies for accurate object detection and semantic segmentation

被引:12622
|
作者
Girshick, Ross [1 ]
Donahue, Jeff [1 ]
Darrell, Trevor [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
10.1109/CVPR.2014.81
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that typically combine multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012-achieving a mAP of 53.3%. Our approach combines two key insights: (1) one can apply high-capacity convolutional neural networks (CNNs) to bottom-up region proposals in order to localize and segment objects and (2) when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost. Since we combine region proposals with CNNs, we call our method R-CNN: Regions with CNN features. Source code for the complete system is available at http://www.cs.berkeley.edu/similar to rbg/rcnn.
引用
收藏
页码:580 / 587
页数:8
相关论文
共 50 条
  • [21] The Amalgamation of the Object Detection and Semantic Segmentation for Steel Surface Defect Detection
    Sharma, Mansi
    Lim, Jongtae
    Lee, Hansung
    APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [22] Establishing effective learning bridge cross multi-scale feature maps for object detection and semantic segmentation
    Wang, Bo
    Feng, Zeyu
    Li, Jun
    Sheng, Qinghong
    Ling, Xiao
    Liu, Xiang
    Wang, Haowen
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025, 46 (02) : 509 - 537
  • [23] Efficient Task-Specific Feature Re-Fusion for More Accurate Object Detection and Instance Segmentation
    Wang, Cheng
    Fang, Yuxin
    Fang, Jiemin
    Guo, Peng
    Wu, Rui
    Huang, He
    Wang, Xinggang
    Huang, Chang
    Liu, Wenyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5350 - 5360
  • [24] Robust Object Detection and Localization Using Semantic Segmentation Network
    Raghu, A. Francis Alexander
    Ananth, J. P.
    COMPUTER JOURNAL, 2021, 64 (10): : 1531 - 1548
  • [25] Semantic Object Segmentation via Detection in Weakly Labeled Video
    Zhang, Yu
    Chen, Xiaowu
    Li, Jia
    Wang, Chen
    Xia, Changqun
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3641 - 3649
  • [26] Joint Multiclass Object Detection and Semantic Segmentation for Autonomous Driving
    Abdigapporov, Shakhboz
    Miraliev, Shokhrukh
    Kakani, Vijay
    Kim, Hakil
    IEEE ACCESS, 2023, 11 : 37637 - 37649
  • [27] Adaptive Generation of Weakly Supervised Semantic Segmentation for Object Detection
    Shibao Li
    Yixuan Liu
    Yunwu Zhang
    Yi Luo
    Jianhang Liu
    Neural Processing Letters, 2023, 55 : 657 - 670
  • [28] Leveraging Spatial-semantic Information in Object Detection and Segmentation
    Guo Q.-Z.
    Yuan C.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (06): : 2776 - 2788
  • [29] Adaptive Generation of Weakly Supervised Semantic Segmentation for Object Detection
    Li, Shibao
    Liu, Yixuan
    Zhang, Yunwu
    Luo, Yi
    Liu, Jianhang
    NEURAL PROCESSING LETTERS, 2023, 55 (01) : 657 - 670
  • [30] SAFPN: a full semantic feature pyramid network for object detection
    Wang, Gaihua
    Li, Qi
    Wang, Nengyuan
    Liu, Hong
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (04) : 1729 - 1739