Bounding box regression with balance for harmonious object detection

被引:1
|
作者
Wang, Chenzhong [1 ]
Gong, Xun [1 ,2 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Sichuan, Peoples R China
[2] Mfg Ind Chains Collaborat & Informat Support Techn, Chengdu 610031, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Reinforcement learning; Bounding box regression;
D O I
10.1016/j.jvcir.2022.103665
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Localization is an essential part of object detection, which is usually accomplished by bounding box regression guided by en-norm-based or IoU-based loss functions, where IoU is known for its scale-invariant characteristics. However, introducing the scale-invariance into regression loss in traditional IoU-based methods may result in a bias in favor of smaller boxes and cause redundancy and unstable oscillations. To make up for these shortages of IoU-based losses, we propose a Scale-Balanced Factor (SF) that stabilizes the regression process via a simple adaptive factor. Furthermore, to compensate for the imbalance of different types of losses caused by SF and other IoU-based loss functions, regression losses are always multiplied by a hyperparameter, which is purely empirical and is hard to find an optimum. To address this issue, a Multi-Task Reinforced Equilibrium (MRE) is proposed to dynamically tweak the learning rate of each task based on reinforcement learning. The MRE can guarantee more balanced parameters and maximize the benefit of SF or other improvement methods for IoU. By incorporating the proposed SF and MRE into the classic detectors (RetinaNet, YOLO, and Faster R-CNN, etc.), we have achieved significant performance gains on MS COCO (0.8 AP similar to 1.9 AP) and PASCAL VOC (0.6 AP similar to 2.2 AP).
引用
收藏
页数:10
相关论文
共 50 条
  • [21] On Improving Bounding Box Representations for Oriented Object Detection
    Yao, Yanqing
    Cheng, Gong
    Wang, Guangxing
    Li, Shengyang
    Zhou, Peicheng
    Xie, Xingxing
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [22] On Improving Bounding Box Representations for Oriented Object Detection
    Yao, Yanqing
    Cheng, Gong
    Wang, Guangxing
    Li, Shengyang
    Zhou, Peicheng
    Xie, Xingxing
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [23] Object-aware bounding box regression for online multi-object tracking
    Li, Hongli
    Dong, Yongsheng
    Li, Xuelong
    [J]. NEUROCOMPUTING, 2023, 518 : 440 - 452
  • [24] N-IoU: better IoU-based bounding box regression loss for object detection
    Su, Keke
    Cao, Lihua
    Zhao, Botong
    Li, Ning
    Wu, Di
    Han, Xiyu
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (06): : 3049 - 3063
  • [25] N-IoU: better IoU-based bounding box regression loss for object detection
    Keke Su
    Lihua Cao
    Botong Zhao
    Ning Li
    Di Wu
    Xiyu Han
    [J]. Neural Computing and Applications, 2024, 36 : 3049 - 3063
  • [26] Building a Bridge of Bounding Box Regression Between Oriented and Horizontal Object Detection in Remote Sensing Images
    Qian, Xiaoliang
    Wu, Baokun
    Cheng, Gong
    Yao, Xiwen
    Wang, Wei
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [27] BoxMask: Revisiting Bounding Box Supervision for Video Object Detection
    Hashmi, Khurram Azeem
    Pagani, Alain
    Stricker, Didier
    Afzal, Muhammad Zeshan
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2029 - 2039
  • [28] Visual Ranging Based on Object Detection Bounding Box Optimization
    Shi, Zhou
    Li, Zhongguo
    Che, Sai
    Gao, Miaowei
    Tang, Hongchuan
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [29] Faster Bounding Box Annotation for Object Detection in Indoor Scenes
    Adhikari, Bishwo
    Peltomaki, Jukka
    Puura, Jussi
    Huttunen, Heikki
    [J]. PROCEEDINGS OF THE 2018 7TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP), 2018,
  • [30] Can we trust bounding box annotations for object detection?
    Murrugarra-Llerena, Jeffri
    Kirsten, L. N.
    Jung, Claudio R.
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4812 - 4821