CenterNet plus plus for Object Detection

被引:5
|
作者
Duan, Kaiwen [1 ]
Bai, Song [2 ]
Xie, Lingxi [3 ]
Qi, Honggang [1 ]
Huang, Qingming [1 ]
Tian, Qi [3 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China
[2] Univ Oxford, Oxford OX1 2JD, Oxfordshire, England
[3] Huawei Inc, Shenzhen 518129, Peoples R China
基金
中国国家自然科学基金;
关键词
Anchor-free; bottom-up; deep learning; object detection;
D O I
10.1109/TPAMI.2023.3342120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are two mainstream approaches for object detection: top-down and bottom-up. The state-of-the-art approaches are mainly top-down methods. In this paper, we demonstrate that bottom-up approaches show competitive performance compared with top-down approaches and have higher recall rates. Our approach, named CenterNet, detects each object as a triplet of keypoints (top-left and bottom-right corners and the center keypoint). We first group the corners according to some designed cues and confirm the object locations based on the center keypoints. The corner keypoints allow the approach to detect objects of various scales and shapes and the center keypoint reduces the confusion introduced by a large number of false-positive proposals. Our approach is an anchor-free detector because it does not need to define explicit anchor boxes. We adapt our approach to backbones with different structures, including 'hourglass'-like networks and 'pyramid'-like networks, which detect objects in single-resolution and multi-resolution feature maps, respectively. On the MS-COCO dataset, CenterNet with Res2Net-101 and Swin-Transformer achieve average precisions (APs) of 53.7% and 57.1%, respectively, outperforming all existing bottom-up detectors and achieving state-of-the-art performance. We also design a real-time CenterNet model, which achieves a good trade-off between accuracy and speed, with an AP of 43.6% at 30.5 frames per second (FPS).
引用
收藏
页码:3509 / 3521
页数:13
相关论文
共 50 条
  • [31] Fusion plus plus : Volumetric Object-Level SLAM
    McCormac, John
    Clark, Ronald
    Bloesch, Michael
    Davison, Andrew J.
    Leutenegger, Stefan
    [J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 32 - 41
  • [32] Real-time Object Detection with FPGA Using CenterNet
    Solovyev, Roman A.
    Telpukhov, Dmitry, V
    Romanova, IrMa I.
    Kustov, Alexander G.
    Mkrtchan, Ilya A.
    [J]. PROCEEDINGS OF THE 2021 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (ELCONRUS), 2021, : 2029 - 2034
  • [33] CADC plus plus : Advanced Consensus-Aware Dynamic Convolution for Co-Salient Object Detection
    Zhang, Ni
    Liu, Nian
    Nan, Fang
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 2741 - 2757
  • [34] Application of an improved CenterNet in remote sensing images object detection
    Tian Z.
    Zhang H.
    Wang K.
    Liu S.
    Zou Q.
    Zhao Z.
    Chen Y.
    [J]. National Remote Sensing Bulletin, 2023, 27 (12) : 2706 - 2715
  • [35] Intelligent Vehicle Object Detection Algorithm Based on Lightweight CenterNet
    Yue, Yongheng
    Ning, Ruihou
    [J]. Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2024, 52 (08): : 45 - 55
  • [36] FEATURE ENHANCED CENTERNET FOR OBJECT DETECTION IN REMOTE SENSING IMAGES
    Zhang, Tong
    Wang, Guanqun
    Zhuang, Yin
    Chen, He
    Shi, Hao
    Chen, Liang
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1639 - 1642
  • [37] RADAR plus RGB FUSION FOR ROBUST OBJECT DETECTION IN AUTONOMOUS VEHICLE
    Yadav, Ritu
    Vierling, Axel
    Berns, Karsten
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1986 - 1990
  • [38] R-FCN plus plus : Towards Accurate Region-Based Fully Convolutional Networks for Object Detection
    Li, Zeming
    Chen, Yilun
    Yu, Gang
    Deng, Yangdong
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7073 - 7080
  • [39] Formal Verification of Object Layout for C plus plus Multiple Inheritance
    Ramananandro, Tahina
    Dos Reis, Gabriel
    Leroy, Xavier
    [J]. ACM SIGPLAN NOTICES, 2011, 46 (01) : 67 - 79
  • [40] Pin plus plus : An Object-Oriented Framework for Writing Pintools
    Hill, James H.
    Feiock, Dennis C.
    [J]. ACM SIGPLAN NOTICES, 2015, 50 (03) : 133 - 141