Gaussian Combined Distance: A Generic Metric for Object Detection

被引:0
|
作者
Guan, Ziqian [1 ]
Fu, Xieyi [1 ]
Huang, Pengjun [1 ]
Zhang, Hengyuan [1 ]
Du, Hubin [1 ]
Liu, Yongtao [1 ]
Wang, Yinglin [2 ]
Ma, Qang [2 ]
机构
[1] North China Inst Sci & Technol, Key Lab Special Robots Safety Prod & Emergency Dis, Langfang 065201, Peoples R China
[2] Hegang Ind Technol Serv Co Ltd, Langfang 065008, Peoples R China
关键词
Measurement; Object detection; Feature extraction; Optimization; Detectors; Geoscience and remote sensing; Accuracy; Training; Sensitivity; Convergence; Generic metric; tiny object detection;
D O I
10.1109/LGRS.2025.3531970
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In object detection, a well-defined similarity metric can significantly enhance the model performance. Currently, the intersection over union (IoU)-based similarity metric is the most commonly preferred choice for detectors. However, detectors using IoU as a similarity metric often perform poorly when detecting small objects because of their sensitivity to minor positional deviations. To address this issue, recent studies have proposed the Wasserstein distance (WD) as an alternative to IoU for measuring the similarity of Gaussian-distributed bounding boxes. However, we have observed that the WD lacks scale invariance, which negatively impacts the model's generalization capability. In addition, when used as a loss function, its independent optimization of the center attributes leads to slow model convergence and unsatisfactory detection precision. To address these challenges, we introduce the Gaussian Combined Distance (GCD). Through analytical examination of GCD and its gradient, we demonstrate that GCD not only possesses scale invariance but also facilitates joint optimization, which enhances model localization performance. Extensive experiments on the AI-TOD-v2 dataset for tiny object detection show that GCD, as a bounding box regression loss function and label assignment metric, achieves state-of-the-art (SOTA) performance across various detectors. We further validated the generalizability of GCD on the MS-COCO-2017 and Visdrone-2019 datasets, where it outperforms the WD across diverse scales of datasets. The code is available at: https://github.com/MArKkwanGuan/mmdet-GCD.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] YOLO-G Abandoned Object Detection Method Combined with Gaussian Mixture Model and GhostNet
    Lin D.
    Zhou Z.
    Guo B.
    Min W.
    Han Q.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (01): : 99 - 107
  • [22] Object Detection and Distance Measurement in Teleoperation
    Zhang, Ailing
    Chu, Meng
    Chen, Zixin
    Zhou, Fuqiang
    Gao, Shuo
    MACHINES, 2022, 10 (05)
  • [23] Neighborhood sampling confidence metric for object detection
    Christophe Gouguenheim
    Ahmad Berjaoui
    AI and Ethics, 2024, 4 (1): : 57 - 64
  • [24] VmAP: A Fair Metric for Video Object Detection
    Sobti, Anupam
    Mavi, Vaibhav
    Balakrishnan, M.
    Arora, Chetan
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2224 - 2232
  • [25] The optimal distance measure for object detection
    Mahamud, S
    Hebert, M
    2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 248 - 255
  • [26] An analytic distance metric for Gaussian mixture models with application in image retrieval
    Sfikas, G
    Constantinopoulos, C
    Likas, A
    Galatsanos, NP
    ARTIFICIAL NEURAL NETWORKS: FORMAL MODELS AND THEIR APPLICATIONS - ICANN 2005, PT 2, PROCEEDINGS, 2005, 3697 : 835 - 840
  • [27] Gaussian Mixture Background for Salient Object Detection
    Su, Zhuo
    Zheng, Hong
    Song, Guorui
    PROCEEDINGS OF THE 10TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2017, : 165 - 170
  • [28] Rotated Object Detection with Circular Gaussian Distribution
    Xu, Hang
    Liu, Xinyuan
    Ma, Yike
    Zhu, Zunjie
    Wang, Shuai
    Yan, Chenggang
    Dai, Feng
    ELECTRONICS, 2023, 12 (15)
  • [29] Edge Detection Method of Gaussian Block Distance
    Jia, Di
    Xia, Cheng-long
    Sun, Jin-guang
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3049 - 3053
  • [30] Moving object detection based on Gaussian pyramid
    Tu, Lifen
    Zhong, Sidong
    Peng, Qi
    Mei, Tiancan
    Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2013, 44 (07): : 2778 - 2786