Long-Tailed Metrics and Object Detection in Camera Trap Datasets

被引:2
|
作者
He, Wentong [1 ,2 ]
Luo, Ze [1 ]
Tong, Xinyu [1 ,2 ]
Hu, Xiaoyi [1 ,2 ]
Chen, Can [1 ]
Shu, Zufei [3 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Guangdong Chebaling Natl Nat Reserve, Shaoguan 512528, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 10期
关键词
camera trap; long-tailed metrics; class imbalance; object/box-level scale imbalance; deep learning; object detection; sample relationship; IMAGES;
D O I
10.3390/app13106029
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With their advantages in wildlife surveys and biodiversity monitoring, camera traps are widely used, and have been used to gather massive amounts of animal images and videos. The application of deep learning techniques has greatly promoted the analysis and utilization of camera trap data in biodiversity management and conservation. However, the long-tailed distribution of the camera trap dataset can degrade the deep learning performance. In this study, for the first time, we quantified the long-tailedness of class and object/box-level scale imbalance of camera trap datasets. In the camera trap dataset, the imbalance problem is prevalent and severe, in terms of class and object/box-level scale. The camera trap dataset has worse object/box-level scale imbalance, and too few samples of small objects, making deep learning more challenging. Furthermore, we used the BatchFormer module to exploit sample relationships, and improved the performance of the general object detection model, DINO, by up to 2.9% and up to 3.3% in terms of class imbalance and object/box-level scale imbalance. The experimental results showed that the sample relationship was simple and effective, improving detection performance in terms of class and object/box-level scale imbalance, but that it could not make up for the low number of small objects in the camera trap dataset.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Exploring Classification Equilibrium in Long-Tailed Object Detection
    Feng, Chengjian
    Zhong, Yujie
    Huang, Weilin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3397 - 3406
  • [2] Long-Tailed Object Detection for Multimodal Remote Sensing Images
    Yang, Jiaxin
    Yu, Miaomiao
    Li, Shuohao
    Zhang, Jun
    Hu, Shengze
    REMOTE SENSING, 2023, 15 (18)
  • [3] On Model Calibration for Long-Tailed Object Detection and Instance Segmentation
    Pan, Tai-Yu
    Zhang, Cheng
    Li, Yandong
    Hu, Hexiang
    Xuan, Dong
    Changpinyo, Soravit
    Gong, Boqing
    Chao, Wei-Lun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Semi-Supervised and Long-Tailed Object Detection with CascadeMatch
    Zang, Yuhang
    Zhou, Kaiyang
    Huang, Chen
    Loy, Chen Change
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (04) : 987 - 1001
  • [5] Balanced Classification: A Unified Framework for Long-Tailed Object Detection
    Qi, Tianhao
    Xie, Hongtao
    Li, Pandeng
    Ge, Jiannan
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3088 - 3101
  • [6] Equalized Focal Loss for Dense Long-Tailed Object Detection
    Li, Bo
    Yao, Yongqiang
    Tan, Jingru
    Zhang, Gang
    Yu, Fengwei
    Lu, Jianwei
    Luo, Ye
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6980 - 6989
  • [7] Adaptive Hierarchical Representation Learning for Long-Tailed Object Detection
    Li, Banghuai
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2303 - 2312
  • [8] Semi-Supervised and Long-Tailed Object Detection with CascadeMatch
    Yuhang Zang
    Kaiyang Zhou
    Chen Huang
    Chen Change Loy
    International Journal of Computer Vision, 2023, 131 : 987 - 1001
  • [9] Equalization Loss for Long-Tailed Object Recognition
    Tan, Jingru
    Wang, Changbao
    Li, Buyu
    Li, Quanquan
    Ouyang, Wanli
    Yin, Changqing
    Yan, Junjie
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11659 - 11668
  • [10] MOSAICOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection
    Zhang, Cheng
    Pan, Tai-Yu
    Li, Yandong
    Hu, Hexiang
    Xuan, Dong
    Changpinyo, Soravit
    Gong, Boqing
    Chao, Wei-Lun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 407 - 417