A dual-balanced network for long-tail distribution object detection

被引:0
|
作者
Gong, Huiyun [1 ]
Li, Yeguang [2 ]
Dong, Jian [1 ,3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Management Changchun Univ Technol, Sch Econ, Jilin, Peoples R China
[3] China Elect Standardizat Inst, Beijing, Peoples R China
关键词
computer vision; learning (artificial intelligence); object detection; SMOTE;
D O I
10.1049/cvi2.12182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection on datasets with imbalanced distributions (i.e. long-tail distributions) dataset is a significantly challenging task. Some re-balancing solutions, such as re-weighting and re-sampling have two main disadvantages. First, re-balancing strategies only utilise a coarse-grained global threshold to suppress some of the most influential categories, while overlooking locally influential categories. Second, very few studies have specifically designed algorithms for object detection tasks under long-tail distribution. To address these two issues, a dual-balanced network for fine-grained re-balancing object detection is proposed. Our re-balancing strategies are both in proposal and classification logic, corresponding to two sub-networks, the Balance Region Proposal Network (BRPN) and the Balance Classification Network (BCN). The BRPN sub-network equalises the number of proposals in the background and foreground by reducing the sampling probability of simple backgrounds, and the BCN sub-network equalises the logic between head and tail categories by globally suppressing negative gradients and locally fixing the over-suppressed negative gradients. In addition, the authors advise a balance binary cross entropy loss to jointly re-balance the entire network. This design can be generalised to different two-stage object detection frameworks. The experimental mAP result of 26.40% on this LVIS-v0.5 dataset outperforms most SOTA methods.
引用
收藏
页码:565 / 575
页数:11
相关论文
共 50 条
  • [1] Factors in Finetuning Deep Model for Object Detection with Long-tail Distribution
    Ouyang, Wanli
    Wang, Xiaogang
    Zhang, Cong
    Yang, Xiaokang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 864 - 873
  • [2] Logit Normalization for Long-Tail Object Detection
    Zhao, Liang
    Teng, Yao
    Wang, Limin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (06) : 2114 - 2134
  • [3] Towards Resolving the Challenge of Long-tail Distribution in UAV Images for Object Detection
    Yu, Weiping
    Yang, Taojiannan
    Chen, Chen
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3257 - 3266
  • [4] Adaptive Class Suppression Loss for Long-Tail Object Detection
    Wang, Tong
    Zhu, Yousong
    Zhao, Chaoyang
    Zeng, Wei
    Wang, Jinqiao
    Tang, Ming
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3102 - 3111
  • [5] Distance metric-based learning for long-tail object detection
    Shao, Mingwen
    Peng, Zilu
    IMAGE AND VISION COMPUTING, 2024, 142
  • [6] Overcoming the Challenges of Long-Tail Distribution in Nighttime Vehicle Detection
    Zhang, Houwang
    Chan, Leanne Lai Hang
    IEEE INTELLIGENT SYSTEMS, 2024, 39 (02) : 51 - 60
  • [7] Capturing long-tail distributions of object subcategories
    Zhu, Xiangxin
    Anguelov, Dragomir
    Ramanan, Deva
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 915 - 922
  • [8] Study of Phase Modulations With Dual-Balanced Detection in Coherent Homodyne Optical CDMA Network
    Karbassian, M. Massoud
    Ghafouri-Shiraz, H.
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2008, 26 (13-16) : 2840 - 2847
  • [9] THE LONG-TAIL DISTRIBUTION FUNCTION OF MUTATIONS IN BACTERIA
    Gonzalez, Augusto
    REVISTA CUBANA DE FISICA, 2015, 32 (02): : 86 - 89
  • [10] Spectral Domain Optical Coherence Tomography with Dual-balanced Detection
    Bo, En
    Liu, Xinyu
    Chen, Si
    Luo, Yuemei
    Wang, Nanshuo
    Wang, Xianghong
    Liu, Linbo
    DESIGN AND QUALITY FOR BIOMEDICAL TECHNOLOGIES IX, 2016, 9700