Gaussian Combined Distance: A Generic Metric for Object Detection

被引:0
|
作者
Guan, Ziqian [1 ]
Fu, Xieyi [1 ]
Huang, Pengjun [1 ]
Zhang, Hengyuan [1 ]
Du, Hubin [1 ]
Liu, Yongtao [1 ]
Wang, Yinglin [2 ]
Ma, Qang [2 ]
机构
[1] North China Inst Sci & Technol, Key Lab Special Robots Safety Prod & Emergency Dis, Langfang 065201, Peoples R China
[2] Hegang Ind Technol Serv Co Ltd, Langfang 065008, Peoples R China
关键词
Measurement; Object detection; Feature extraction; Optimization; Detectors; Geoscience and remote sensing; Accuracy; Training; Sensitivity; Convergence; Generic metric; tiny object detection;
D O I
10.1109/LGRS.2025.3531970
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In object detection, a well-defined similarity metric can significantly enhance the model performance. Currently, the intersection over union (IoU)-based similarity metric is the most commonly preferred choice for detectors. However, detectors using IoU as a similarity metric often perform poorly when detecting small objects because of their sensitivity to minor positional deviations. To address this issue, recent studies have proposed the Wasserstein distance (WD) as an alternative to IoU for measuring the similarity of Gaussian-distributed bounding boxes. However, we have observed that the WD lacks scale invariance, which negatively impacts the model's generalization capability. In addition, when used as a loss function, its independent optimization of the center attributes leads to slow model convergence and unsatisfactory detection precision. To address these challenges, we introduce the Gaussian Combined Distance (GCD). Through analytical examination of GCD and its gradient, we demonstrate that GCD not only possesses scale invariance but also facilitates joint optimization, which enhances model localization performance. Extensive experiments on the AI-TOD-v2 dataset for tiny object detection show that GCD, as a bounding box regression loss function and label assignment metric, achieves state-of-the-art (SOTA) performance across various detectors. We further validated the generalizability of GCD on the MS-COCO-2017 and Visdrone-2019 datasets, where it outperforms the WD across diverse scales of datasets. The code is available at: https://github.com/MArKkwanGuan/mmdet-GCD.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Generalized distance metric as a robust similarity measure for mobile object trajectories
    Pathak, Garima
    Madria, Sanjay
    Tiwari, Spandan
    IEEE INTERNATIONAL CONFERENCE ON SENSOR NETWORKS, UBIQUITOUS, AND TRUSTWORTHY COMPUTING, VOL 1, PROCEEDINGS, 2006, : 160 - +
  • [32] SEMI-SUPERVISED DISTANCE METRIC LEARNING FOR VISUAL OBJECT CLASSIFICATION
    Cevikalp, Hakan
    Paredes, Roberto
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2009, : 315 - +
  • [33] Weak hypotheses and boosting for generic object detection and recognition
    Opelt, A
    Fussenegger, M
    Pinz, A
    Auer, P
    COMPUTER VISION - ECCV 2004, PT 2, 2004, 3022 : 71 - 84
  • [34] Toward Generic and Controllable Attacks Against Object Detection
    Li, Guopeng
    Xu, Yue
    Ding, Jian
    Xia, Gui-Song
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [35] Confidence Rated Boosting Algorithm for Generic Object Detection
    Zaidi, Nayyar A.
    Suter, David
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3221 - 3224
  • [36] Generic Object Detection Using Improved Gentleboost Classifier
    Guo, Li
    Liao, Yu
    Luo, Daisheng
    Liao, Honghua
    INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 1528 - 1535
  • [37] A Training-Free Approach for Generic Object Detection
    Baheti, Bhakti, V
    Talbar, Sanjay N.
    Gajre, Suhas S.
    IETE JOURNAL OF RESEARCH, 2022, 68 (01) : 482 - 495
  • [38] Generic Object Class Detection Using Feature Maps
    Danielsson, Oscar
    Carlsson, Stefan
    IMAGE ANALYSIS: 17TH SCANDINAVIAN CONFERENCE, SCIA 2011, 2011, 6688 : 348 - 359
  • [39] On structuring multiple grouping hypotheses in generic object detection
    Villeneuve, Guillaume
    Bergevin, Robert
    2013 INTERNATIONAL CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2013, : 340 - 347
  • [40] A supervised learning framework for generic object detection in images
    Ali, S
    Shah, M
    TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1347 - 1354