Rich feature hierarchies for accurate object detection and semantic segmentation

被引：12622

作者：

Girshick, Ross ^{[1
]}

Donahue, Jeff ^{[1
]}

Darrell, Trevor ^{[1
]}

Malik, Jitendra ^{[1
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年

关键词：

D O I：

10.1109/CVPR.2014.81

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that typically combine multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012-achieving a mAP of 53.3%. Our approach combines two key insights: (1) one can apply high-capacity convolutional neural networks (CNNs) to bottom-up region proposals in order to localize and segment objects and (2) when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost. Since we combine region proposals with CNNs, we call our method R-CNN: Regions with CNN features. Source code for the complete system is available at http://www.cs.berkeley.edu/similar to rbg/rcnn.

引用

页码：580 / 587

页数：8

共 50 条

[41] SCAN: Semantic Context Aware Network for Accurate Small Object Detection
Guan, Linting
Wu, Yan
Zhao, Junqiao
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2018, 11 (01) : 951 - 961
[42] One For All: A Mutual Enhancement Method for Object Detection and Semantic Segmentation
Zhang, Shichao
Zhang, Zhe
Sun, Libo
Qin, Wenhu
APPLIED SCIENCES-BASEL, 2020, 10 (01):
[43] Using convolutional neural networks for image semantic segmentation and object detection
Li, Shuangmei
Huang, Chengning
SYSTEMS AND SOFT COMPUTING, 2024, 6
[44] MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features
Chen, Liang-Chieh
Hermans, Alexander
Papandreou, George
Schroff, Florian
Wang, Peng
Adam, Hartwig
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4013 - 4022
[45] Real-time Object Detection and Semantic Segmentation for Autonomous Driving
Li, Baojun
Liu, Shun
Xu, Weichao
Qiu, Wei
MIPPR 2017: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2018, 10608
[46] Segmentation-based multi-class semantic object detection
Vieux, Remi
Benois-Pineau, Jenny
Domenger, Jean-Philippe
Braquelaire, Achille
MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 305 - 326
[47] Research on Multitask Deep Learning Network for Semantic Segmentation and Object Detection
Rui, Ting
Xiao, Feng
Tang, Jian
Zhang, Fukai
Yang, Chengsong
Liu, Min
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 708 - 718
[48] AdvNet: Multi-Task Fusion of Object Detection and Semantic Segmentation
Liu, Xiaohan
Wang, Heng
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3359 - 3362
[49] Fusing Semantic Segmentation and Object Detection for Visual SLAM in Dynamic Scenes
Yu, Peilin
Guo, Chi
Liu, Yang
Zhang, Huyin
PROCEEDINGS OF 27TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, VRST 2021, 2021,
[50] Traffic Scene Perception Based on Joint Object Detection and Semantic Segmentation
Libo Weng
Yingjie Wang
Fei Gao
Neural Processing Letters, 2022, 54 : 5333 - 5349

← 1 2 3 4 5 →