Long-Tailed Metrics and Object Detection in Camera Trap Datasets

被引:2
|
作者
He, Wentong [1 ,2 ]
Luo, Ze [1 ]
Tong, Xinyu [1 ,2 ]
Hu, Xiaoyi [1 ,2 ]
Chen, Can [1 ]
Shu, Zufei [3 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Guangdong Chebaling Natl Nat Reserve, Shaoguan 512528, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 10期
关键词
camera trap; long-tailed metrics; class imbalance; object/box-level scale imbalance; deep learning; object detection; sample relationship; IMAGES;
D O I
10.3390/app13106029
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With their advantages in wildlife surveys and biodiversity monitoring, camera traps are widely used, and have been used to gather massive amounts of animal images and videos. The application of deep learning techniques has greatly promoted the analysis and utilization of camera trap data in biodiversity management and conservation. However, the long-tailed distribution of the camera trap dataset can degrade the deep learning performance. In this study, for the first time, we quantified the long-tailedness of class and object/box-level scale imbalance of camera trap datasets. In the camera trap dataset, the imbalance problem is prevalent and severe, in terms of class and object/box-level scale. The camera trap dataset has worse object/box-level scale imbalance, and too few samples of small objects, making deep learning more challenging. Furthermore, we used the BatchFormer module to exploit sample relationships, and improved the performance of the general object detection model, DINO, by up to 2.9% and up to 3.3% in terms of class imbalance and object/box-level scale imbalance. The experimental results showed that the sample relationship was simple and effective, improving detection performance in terms of class and object/box-level scale imbalance, but that it could not make up for the low number of small objects in the camera trap dataset.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Long-tailed object detection of kitchen waste with class-instance balanced detector
    LeYuan Fang
    Qi Tang
    LiHan Ouyang
    JunWu Yu
    JiaXing Lin
    ShuaiYu Ding
    Lin Tang
    Science China Technological Sciences, 2023, 66 : 2361 - 2372
  • [22] Learning with Free Object Segments for Long-Tailed Instance Segmentation
    Zhang, Cheng
    Pan, Tai-Yu
    Chen, Tianle
    Zhong, Jike
    Fu, Wenjin
    Chao, Wei-Lun
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 655 - 672
  • [23] Understanding of and reasoning about object-object relationships in long-tailed macaques?
    Schloegl, Christian
    Waldmann, Michael R.
    Fischer, Julia
    ANIMAL COGNITION, 2013, 16 (03) : 493 - 507
  • [24] The long-tailed rat
    Gold, AG
    ASIAN FOLKLORE STUDIES, 2004, 63 (02): : 243 - 265
  • [25] LONG-TAILED PAIR
    SCROGGIE, MG
    WIRELESS WORLD, 1968, 74 (1396): : 369 - &
  • [26] Towards Long-Tailed 3D Detection
    Peri, Neehar
    Dave, Achal
    Ramanan, Deva
    Kong, Shu
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1904 - 1915
  • [27] Long-Tailed Anomaly Detection with Learnable Class Names
    Ho, Chih-Hui
    Peng, Kuan-Chuan
    Vasconcelos, Nuno
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 12435 - 12446
  • [28] A Review of Video Object Detection: Datasets, Metrics and Methods
    Zhu, Haidi
    Wei, Haoran
    Li, Baoqing
    Yuan, Xiaobing
    Kehtarnavaz, Nasser
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 24
  • [29] TEMPORAL FLOW MASK ATTENTION FOR OPEN-SET LONG-TAILED RECOGNITION OF WILD ANIMALS IN CAMERA-TRAP IMAGES
    Kim, Jeongsoo
    Woo, Sangmin
    Park, Byeongjun
    Kim, Changick
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2152 - 2156
  • [30] Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation
    Wu, Jialian
    Song, Liangchen
    Wang, Tiancai
    Zhang, Qian
    Yuan, Junsong
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1570 - 1578