Region-Based Convolutional Networks for Accurate Object Detection and Segmentation

被引:1844
|
作者
Girshick, Ross [1 ]
Donahue, Jeff [2 ]
Darrell, Trevor [2 ]
Malik, Jitendra [2 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Object recognition; detection; semantic segmentation; convolutional networks; deep learning; transfer learning; REPRESENTATION; HISTOGRAMS; GRADIENTS; FEATURES; SCENE;
D O I
10.1109/TPAMI.2015.2437384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection performance, as measured on the canonical PASCAL VOC Challenge datasets, plateaued in the final years of the competition. The best-performing methods were complex ensemble systems that typically combined multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 50 percent relative to the previous best result on VOC 2012-achieving a mAP of 62.4 percent. Our approach combines two ideas: (1) one can apply high-capacity convolutional networks (CNNs) to bottom-up region proposals in order to localize and segment objects and (2) when labeled training data are scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, boosts performance significantly. Since we combine region proposals with CNNs, we call the resulting model an R-CNN or Region-based Convolutional Network. Source code for the complete system is available at http://www.cs.berkeley.edu/similar to rbg/rcnn.
引用
收藏
页码:142 / 158
页数:17
相关论文
共 50 条
  • [1] R-FCN plus plus : Towards Accurate Region-Based Fully Convolutional Networks for Object Detection
    Li, Zeming
    Chen, Yilun
    Yu, Gang
    Deng, Yangdong
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7073 - 7080
  • [2] Small Object Detection via Precise Region-Based Fully Convolutional Networks
    Zhang, Dengyong
    Hu, Jiawei
    Li, Feng
    Ding, Xiangling
    Sangaiah, Arun Kumar
    Sheng, S. Victor
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (02): : 1503 - 1517
  • [3] A Region-Based Efficient Network for Accurate Object Detection
    Guan, Yurong
    Aamir, Muhammad
    Hu, Zhihua
    Abro, Waheed Ahmed
    Rahman, Ziaur
    Dayo, Zaheer Ahmed
    Akram, Shakeel
    [J]. TRAITEMENT DU SIGNAL, 2021, 38 (02) : 481 - 494
  • [4] R-FCN: Object Detection via Region-based Fully Convolutional Networks
    Dai, Jifeng
    Li, Yi
    He, Kaiming
    Sun, Jian
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [5] RR-FCN: Rotational Region-Based Fully Convolutional Networks for Object Detection
    Zhang, Dingqian
    Zhang, Hui
    Li, Haichang
    Hu, Xiaohui
    [J]. ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2018, 2018, 893 : 58 - 70
  • [6] OBJECT DETECTION AND SEGMENTATION ON A HIERARCHICAL REGION-BASED IMAGE REPRESENTATION
    Vilaplana, Veronica
    Marques, Ferran
    Leon, Miriam
    Gasull, Antoni
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3933 - 3936
  • [7] Automatic image segmentation using Region-Based convolutional networks for Melanoma skin cancer detection
    Tovar-Parra, Karen Dayana
    Calvo-Valverde, Luis Alexander
    Montero-Zeledon, Ernesto
    Murillo-Fernandez, Mac Arturo
    Perez-Hidalgo, Jose Esteban
    Gutierrez-Fallas, Dionisio Alberto
    [J]. TECNOLOGIA EN MARCHA, 2022, 35
  • [8] Graphic Logo Detection with Deep Region-based Convolutional Networks
    Li, Yuanyuan
    Shi, Qiuyue
    Deng, Jiangfan
    Su, Fei
    [J]. 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [9] A region-based convolutional network for nuclei detection and segmentation in microscopy images
    Liang, Hao
    Cheng, Zhiming
    Zhong, Haiqin
    Qu, Aiping
    Chen, Lingna
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 71
  • [10] Masking Salient Object Detection, a Mask Region-based Convolutional Neural Network Analysis for Segmentation of Salient Objects
    Krinski, Bruno A.
    Ruiz, Daniel, V
    Machado, Guilherme Z.
    Todt, Eduardo
    [J]. 2019 LATIN AMERICAN ROBOTICS SYMPOSIUM, 2019 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR) AND 2019 WORKSHOP ON ROBOTICS IN EDUCATION (LARS-SBR-WRE 2019), 2019, : 55 - 60