Gaussian Combined Distance: A Generic Metric for Object Detection

被引:0
|
作者
Guan, Ziqian [1 ]
Fu, Xieyi [1 ]
Huang, Pengjun [1 ]
Zhang, Hengyuan [1 ]
Du, Hubin [1 ]
Liu, Yongtao [1 ]
Wang, Yinglin [2 ]
Ma, Qang [2 ]
机构
[1] North China Inst Sci & Technol, Key Lab Special Robots Safety Prod & Emergency Dis, Langfang 065201, Peoples R China
[2] Hegang Ind Technol Serv Co Ltd, Langfang 065008, Peoples R China
关键词
Measurement; Object detection; Feature extraction; Optimization; Detectors; Geoscience and remote sensing; Accuracy; Training; Sensitivity; Convergence; Generic metric; tiny object detection;
D O I
10.1109/LGRS.2025.3531970
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In object detection, a well-defined similarity metric can significantly enhance the model performance. Currently, the intersection over union (IoU)-based similarity metric is the most commonly preferred choice for detectors. However, detectors using IoU as a similarity metric often perform poorly when detecting small objects because of their sensitivity to minor positional deviations. To address this issue, recent studies have proposed the Wasserstein distance (WD) as an alternative to IoU for measuring the similarity of Gaussian-distributed bounding boxes. However, we have observed that the WD lacks scale invariance, which negatively impacts the model's generalization capability. In addition, when used as a loss function, its independent optimization of the center attributes leads to slow model convergence and unsatisfactory detection precision. To address these challenges, we introduce the Gaussian Combined Distance (GCD). Through analytical examination of GCD and its gradient, we demonstrate that GCD not only possesses scale invariance but also facilitates joint optimization, which enhances model localization performance. Extensive experiments on the AI-TOD-v2 dataset for tiny object detection show that GCD, as a bounding box regression loss function and label assignment metric, achieves state-of-the-art (SOTA) performance across various detectors. We further validated the generalizability of GCD on the MS-COCO-2017 and Visdrone-2019 datasets, where it outperforms the WD across diverse scales of datasets. The code is available at: https://github.com/MArKkwanGuan/mmdet-GCD.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss
    Yang, Xue
    Yan, Junchi
    Ming, Qi
    Wang, Wentao
    Zhang, Xiaopeng
    Tian, Qi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [2] Gaussian guided IoU: A better metric for balanced learning on object detection
    Gou, Lijun
    Wu, Shengkai
    Yang, Jinrong
    Yu, Hangcheng
    Li, Xiaoping
    IET COMPUTER VISION, 2022, 16 (06) : 556 - 566
  • [3] Distance metric-based learning for long-tail object detection
    Shao, Mingwen
    Peng, Zilu
    IMAGE AND VISION COMPUTING, 2024, 142
  • [4] Regionlets for Generic Object Detection
    Wang, Xiaoyu
    Yang, Ming
    Zhu, Shenghuo
    Lin, Yuanqing
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 17 - 24
  • [5] Regionlets for Generic Object Detection
    Wang, Xiaoyu
    Yang, Ming
    Zhu, Shenghuo
    Lin, Yuanqing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (10) : 2071 - 2084
  • [6] Online Distance Metric Learning for Object Tracking
    Tsagkatakis, Grigorios
    Savakis, Andreas
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (12) : 1810 - 1821
  • [7] Learning distance metric for object contour tracking
    Wu, Yuwei
    Ma, Bo
    PATTERN ANALYSIS AND APPLICATIONS, 2014, 17 (02) : 265 - 277
  • [8] Learning distance metric for object contour tracking
    Yuwei Wu
    Bo Ma
    Pattern Analysis and Applications, 2014, 17 : 265 - 277
  • [9] Enhancing rotated object detection via anisotropic Gaussian bounding box and Bhattacharyya distance
    Thai, Chien
    Trang, Mai Xuan
    Ninh, Huong
    Ly, Hoang Hiep
    Le, Anh Son
    NEUROCOMPUTING, 2025, 623
  • [10] 3D object detection algorithm fusing dense connectivity and Gaussian distance
    Cheng, Xin
    Liu, Sheng-Xian
    Zhou, Jing-Mei
    Zhou, Zhou
    Zhao, Xiang-Mo
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (12): : 3589 - 3600