MSO-DETR: Metric space optimization for few-shot object detection

被引:0
|
作者
Sima, Haifeng [1 ,2 ]
Wang, Manyang [1 ]
Liu, Lanlan [3 ]
Zhang, Yudong [1 ,4 ]
Sun, Junding [1 ]
机构
[1] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo, Peoples R China
[2] Henan Polytech Univ, Inst Quantitat Remote Sensing & Smart Agr, Jiaozuo, Peoples R China
[3] Henan Polytech Univ, Fac Arts & Law, Jiaozuo, Peoples R China
[4] Univ Leicester, Sch Comp & Math Sci, Leicester, England
基金
中国国家自然科学基金;
关键词
computer vision; deep learning; machine learning;
D O I
10.1049/cit2.12342
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the metric-based meta-learning detection model, the distribution of training samples in the metric space has great influence on the detection performance, and this influence is usually ignored by traditional meta-detectors. In addition, the design of metric space might be interfered with by the background noise of training samples. To tackle these issues, we propose a metric space optimisation method based on hyperbolic geometry attention and class-agnostic activation maps. First, the geometric properties of hyperbolic spaces to establish a structured metric space are used. A variety of feature samples of different classes are embedded into the hyperbolic space with extremely low distortion. This metric space is more suitable for representing tree-like structures between categories for image scene analysis. Meanwhile, a novel similarity measure function based on Poincar & eacute; distance is proposed to evaluate the distance of various types of objects in the feature space. In addition, the class-agnostic activation maps (CCAMs) are employed to re-calibrate the weight of foreground feature information and suppress background information. Finally, the decoder processes the high-level feature information as the decoding of the query object and detects objects by predicting their locations and corresponding task encodings. Experimental evaluation is conducted on Pascal VOC and MS COCO datasets. The experiment results show that the effectiveness of the authors' method surpasses the performance baseline of the excellent few-shot detection models.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Weakly Supervised Few-Shot Object Detection With DETR
    Zhang, Chenbo
    Zhang, Yinglu
    Zhang, Lu
    Zhao, Jiajia
    Guan, Jihong
    Zhou, Shuigeng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7015 - 7023
  • [2] Few-Shot Object Detection via Metric Learning
    Zhu Min
    Zhang Chongyang
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [3] Feature reconstruction and metric based network for few-shot object detection
    Li, Yuewen
    Feng, Wenquan
    Lyu, Shuchang
    Zhao, Qi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 227
  • [4] Few-Shot Object Detection: A Survey
    Antonelli, Simone
    Avola, Danilo
    Cinque, Luigi
    Crisostomi, Donato
    Foresti, Gian Luca
    Galasso, Fabio
    Marini, Marco Raoul
    Mecca, Alessio
    Pannone, Daniele
    ACM COMPUTING SURVEYS, 2022, 54 (11S)
  • [5] Few-Shot Video Object Detection
    Fan, Qi
    Tang, Chi-Keung
    Tai, Yu-Wing
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 76 - 98
  • [6] Few-Shot Object Counting and Detection
    Thanh Nguyen
    Chau Pham
    Khoi Nguyen
    Minh Hoai
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 348 - 365
  • [7] Few-Shot Object Detection of drones
    Zou Weibao
    Liu Xindi
    Yang Jitao
    Qu Wei
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 1030 - 1034
  • [8] Decoupled Metric Network for Single-Stage Few-Shot Object Detection
    Lu, Yue
    Chen, Xingyu
    Wu, Zhengxing
    Yu, Junzhi
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (01) : 514 - 525
  • [9] Sampling-invariant fully metric learning for few-shot object detection
    Leng, Jiaxu
    Chen, Taiyue
    Gao, Xinbo
    Mo, Mengjingcheng
    Yu, Yongtao
    Zhang, Yan
    NEUROCOMPUTING, 2022, 511 : 54 - 66
  • [10] Incremental-DETR: Incremental Few-Shot Object Detection via Self-Supervised Learning
    Dong, Na
    Zhang, Yongqiang
    Ding, Mingli
    Lee, Gim Hee
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 543 - 551