FoveaBox: Beyound Anchor-Based Object Detection

被引:662
|
作者
Kong, Tao [1 ]
Sun, Fuchun [2 ]
Liu, Huaping [2 ]
Jiang, Yuning [1 ]
Li, Lei [1 ]
Shi, Jianbo [3 ]
机构
[1] ByteDance AI Lab, Beijing 100098, Peoples R China
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol BNRis, Dept Comp Sci & Lochnol, Beijing 100084, Peoples R China
[3] Univ Penn, Grasp Lab, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Object detection; anchor free; foveabox;
D O I
10.1109/TIP.2020.3002345
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present FoveaBox, an accurate, flexible, and completely anchor-free framework for object detection. While almost all state-of-the-art object detectors utilize predefined anchors to enumerate possible locations, scales and aspect ratios for the search of the objects, their performance and generalization ability are also limited to the design of anchors. Instead, FoveaBox directly learns the object existing possibility and the bounding box coordinates without anchor reference. This is achieved by: (a) predicting category-sensitive semantic maps for the object existing possibility, and (b) producing category-agnostic bounding box for each position that potentially contains an object. The scales of target boxes are naturally associated with feature pyramid representations. In FoveaBox, an instance is assigned to adjacent feature levels to make the model more accurate.We demonstrate its effectiveness on standard benchmarks and report extensive experimental analysis. Without bells and whistles, FoveaBox achieves state-of-the-art single model performance on the standard COCO and Pascal VOC object detection benchmark. More importantly, FoveaBox avoids all computation and hyper-parameters related to anchor boxes, which are often sensitive to the final detection performance. We believe the simple and effective approach will serve as a solid baseline and help ease future research for object detection. The code has been made publicly available at https://github.com/taokong/FoveaBox.
引用
收藏
页码:7389 / 7398
页数:10
相关论文
共 50 条
  • [1] Analysis and a Solution of Momentarily Missed Detection for Anchor-based Object Detectors
    Hosoya, Yusuke
    Suganuma, Masanori
    Okatani, Takayuki
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1399 - 1407
  • [2] Efficient Hardware Post Processing of Anchor-Based Object Detection on FPGA
    Zhang, Hui
    Wu, Wei
    Ma, Yufei
    Wang, Zhongfeng
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 580 - 585
  • [3] Anchor-Based Transformer for Temporal LiDAR 3D Object Detection
    Gu, Rongqi
    Wu, Fei
    Liu, Peigen
    Yang, Chu
    Lu, Yaohan
    Chen, Guang
    2024 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS, ICARM 2024, 2024, : 45 - 50
  • [4] SWDet: Anchor-Based Object Detector for Solid Waste Detection in Aerial Images
    Zhou, Liming
    Rao, Xiaohan
    Li, Yahui
    Zuo, Xianyu
    Liu, Yang
    Lin, Yinghao
    Yang, Yong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 306 - 320
  • [5] Adaptive Training Strategies for Small Object Detection Using Anchor-Based Detectors
    Zhang, Shenmeng
    Sun, Yongqing
    Su, Jia
    Gan, Guoxi
    Wen, Zonghui
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 28 - 39
  • [6] ANCHOR-BASED GROUP DETECTION IN CROWD SCENES
    Chen, Mulin
    Wang, Qi
    Li, Xuelong
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1378 - 1382
  • [7] Anchor-Based 6D Object Pose Estimation
    Liu, Zehao
    Wang, Hao
    Liu, Fuchang
    2021 IEEE 7TH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY (ICVR 2021), 2021, : 33 - 40
  • [8] Location-Aware Box Reasoning for Anchor-Based Single-Shot Object Detection
    Ma, Wenchi
    Li, Kaidong
    Wang, Guanghui
    IEEE ACCESS, 2020, 8 : 129300 - 129309
  • [9] Unbiased Teacher v2: Semi-supervised Object Detection for Anchor-free and Anchor-based Detectors
    Liu, Yen-Cheng
    Ma, Chih-Yao
    Kira, Zsolt
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9809 - 9818
  • [10] A fast and efficient green apple object detection model based on Foveabox
    Jia, Weikuan
    Wang, Zhifen
    Zhang, Zhonghua
    Yang, Xinbo
    Hou, Sujuan
    Zheng, Yuanjie
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 5156 - 5169