Long-Tailed Metrics and Object Detection in Camera Trap Datasets

被引:2
|
作者
He, Wentong [1 ,2 ]
Luo, Ze [1 ]
Tong, Xinyu [1 ,2 ]
Hu, Xiaoyi [1 ,2 ]
Chen, Can [1 ]
Shu, Zufei [3 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Guangdong Chebaling Natl Nat Reserve, Shaoguan 512528, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 10期
关键词
camera trap; long-tailed metrics; class imbalance; object/box-level scale imbalance; deep learning; object detection; sample relationship; IMAGES;
D O I
10.3390/app13106029
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With their advantages in wildlife surveys and biodiversity monitoring, camera traps are widely used, and have been used to gather massive amounts of animal images and videos. The application of deep learning techniques has greatly promoted the analysis and utilization of camera trap data in biodiversity management and conservation. However, the long-tailed distribution of the camera trap dataset can degrade the deep learning performance. In this study, for the first time, we quantified the long-tailedness of class and object/box-level scale imbalance of camera trap datasets. In the camera trap dataset, the imbalance problem is prevalent and severe, in terms of class and object/box-level scale. The camera trap dataset has worse object/box-level scale imbalance, and too few samples of small objects, making deep learning more challenging. Furthermore, we used the BatchFormer module to exploit sample relationships, and improved the performance of the general object detection model, DINO, by up to 2.9% and up to 3.3% in terms of class imbalance and object/box-level scale imbalance. The experimental results showed that the sample relationship was simple and effective, improving detection performance in terms of class and object/box-level scale imbalance, but that it could not make up for the low number of small objects in the camera trap dataset.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] ROOSTING OF LONG-TAILED TITS
    SMITH, RT
    BRITISH BIRDS, 1978, 71 (08): : 362 - 362
  • [42] Leveraging Semisupervised Learning for Domain Adaptation: Enhancing Safety at Construction Sites through Long-Tailed Object Detection
    Tran, Dai Quoc
    Jeon, Yuntae
    Aboah, Armstrong
    Bak, Jinyeong
    Park, Minsoo
    Park, Seunghee
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2025, 151 (01)
  • [43] Long-tailed tipplers (Malaysia)
    不详
    SMITHSONIAN, 2008, 39 (07) : 16 - 16
  • [44] Long-tailed Distribution Adaptation
    Peng, Zhiliang
    Huang, Wei
    Guo, Zonghao
    Zhang, Xiaosong
    Jiao, Jianbin
    Ye, Qixiang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3275 - 3282
  • [45] Boosting Long-tailed Object Detection via Step-wise Learning on Smooth-tail Data
    Dong, Na
    Zhang, Yongqiang
    Ding, Mingli
    Lee, Gim Hee
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6917 - 6926
  • [46] The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition
    Tan, Jingru
    Li, Bo
    Lu, Xin
    Yao, Yongqiang
    Yu, Fengwei
    He, Tong
    Ouyang, Wanli
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13876 - 13892
  • [47] Object Manipulation and Tool Use in Nicobar Long-Tailed Macaques (Macaca fascicularis umbrosus)
    Mazumder, Jayashree
    Kaburu, Stefano S. K.
    INTERNATIONAL JOURNAL OF PRIMATOLOGY, 2020, 41 (01) : 141 - 159
  • [48] CONVOLUTIONS OF LONG-TAILED AND SUBEXPONENTIAL DISTRIBUTIONS
    Foss, Sergey
    Korshunov, Dmitry
    Zachary, Stan
    JOURNAL OF APPLIED PROBABILITY, 2009, 46 (03) : 756 - 767
  • [49] PSEUDOTUBERCULOSIS IN RED LONG-TAILED MONKEYS
    DSIKIDSE, EK
    BALOEWA, EJ
    PEKERMAN, SM
    GORISLAWETS, JJ
    ZEITSCHRIFT FUR VERSUCHSTIERKUNDE, 1972, 14 (03): : 147 - +
  • [50] Mutual Learning for Long-Tailed Recognition
    Park, Changhwa
    Yim, Junho
    Jun, Eunji
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2674 - 2683