Rich feature hierarchies for accurate object detection and semantic segmentation

被引:12622
|
作者
Girshick, Ross [1 ]
Donahue, Jeff [1 ]
Darrell, Trevor [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
10.1109/CVPR.2014.81
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that typically combine multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012-achieving a mAP of 53.3%. Our approach combines two key insights: (1) one can apply high-capacity convolutional neural networks (CNNs) to bottom-up region proposals in order to localize and segment objects and (2) when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost. Since we combine region proposals with CNNs, we call our method R-CNN: Regions with CNN features. Source code for the complete system is available at http://www.cs.berkeley.edu/similar to rbg/rcnn.
引用
收藏
页码:580 / 587
页数:8
相关论文
共 50 条
  • [41] SCAN: Semantic Context Aware Network for Accurate Small Object Detection
    Guan, Linting
    Wu, Yan
    Zhao, Junqiao
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2018, 11 (01) : 951 - 961
  • [42] One For All: A Mutual Enhancement Method for Object Detection and Semantic Segmentation
    Zhang, Shichao
    Zhang, Zhe
    Sun, Libo
    Qin, Wenhu
    APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [43] Using convolutional neural networks for image semantic segmentation and object detection
    Li, Shuangmei
    Huang, Chengning
    SYSTEMS AND SOFT COMPUTING, 2024, 6
  • [44] MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features
    Chen, Liang-Chieh
    Hermans, Alexander
    Papandreou, George
    Schroff, Florian
    Wang, Peng
    Adam, Hartwig
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4013 - 4022
  • [45] Real-time Object Detection and Semantic Segmentation for Autonomous Driving
    Li, Baojun
    Liu, Shun
    Xu, Weichao
    Qiu, Wei
    MIPPR 2017: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2018, 10608
  • [46] Segmentation-based multi-class semantic object detection
    Vieux, Remi
    Benois-Pineau, Jenny
    Domenger, Jean-Philippe
    Braquelaire, Achille
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 305 - 326
  • [47] Research on Multitask Deep Learning Network for Semantic Segmentation and Object Detection
    Rui, Ting
    Xiao, Feng
    Tang, Jian
    Zhang, Fukai
    Yang, Chengsong
    Liu, Min
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 708 - 718
  • [48] AdvNet: Multi-Task Fusion of Object Detection and Semantic Segmentation
    Liu, Xiaohan
    Wang, Heng
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3359 - 3362
  • [49] Fusing Semantic Segmentation and Object Detection for Visual SLAM in Dynamic Scenes
    Yu, Peilin
    Guo, Chi
    Liu, Yang
    Zhang, Huyin
    PROCEEDINGS OF 27TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, VRST 2021, 2021,
  • [50] Traffic Scene Perception Based on Joint Object Detection and Semantic Segmentation
    Libo Weng
    Yingjie Wang
    Fei Gao
    Neural Processing Letters, 2022, 54 : 5333 - 5349