Animal Pose Tracking: 3D Multimodal Dataset and Token-based Pose Optimization

被引:5
|
作者
Patel, Mahir [1 ]
Gu, Yiwen [1 ]
Carstensen, Lucas C. [1 ]
Hasselmo, Michael E. [2 ]
Betke, Margrit [1 ,2 ]
机构
[1] Boston Univ, Dept Comp Sci, 111 Cummington St, Boston, MA 02215 USA
[2] Boston Univ, Ctr Syst Neurosci, Boston, MA 02215 USA
关键词
Animal video dataset; Pose estimation; Tracking; Optimization; Thermal infrared; Multimodal;
D O I
10.1007/s11263-022-01714-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate tracking of the 3D pose of animals from video recordings is critical for many behavioral studies, yet there is a dearth of publicly available datasets that the computer vision community could use for model development. We here introduce the Rodent3D dataset that records animals exploring their environment and/or interacting with each other with multiple cameras and modalities (RGB, depth, thermal infrared). Rodent3D consists of 200 min of multimodal video recordings from up to three thermal and three RGB-D synchronized cameras (approximately 4 million frames). For the task of optimizing estimates of pose sequences provided by existing pose estimation methods, we provide a baseline model called OptiPose. While deep-learned attention mechanisms have been used for pose estimation in the past, with OptiPose, we propose a different way by representing 3D poses as tokens for which deep-learned context models pay attention to both spatial and temporal keypoint patterns. Our experiments show how OptiPose is highly robust to noise and occlusion and can be used to optimize pose sequences provided by state-of-the-art models for animal pose estimation.
引用
收藏
页码:514 / 530
页数:17
相关论文
共 50 条
  • [1] Animal Pose Tracking: 3D Multimodal Dataset and Token-based Pose Optimization
    Mahir Patel
    Yiwen Gu
    Lucas C. Carstensen
    Michael E. Hasselmo
    Margrit Betke
    [J]. International Journal of Computer Vision, 2023, 131 : 514 - 530
  • [2] Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape
    Xu, Jiacong
    Zhang, Yi
    Peng, Jiawei
    Ma, Wufei
    Jesslen, Artur
    Ji, Pengliang
    Hu, Qixin
    Zhang, Jiehua
    Liu, Qihao
    Wang, Jiahao
    Ji, Wei
    Wang, Chen
    Yuan, Xiaoding
    Kaushik, Prakhar
    Zhang, Guofeng
    Liu, Jie
    Xie, Yushan
    Cui, Yawen
    Yuille, Alan
    Kortylewski, Adam
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9065 - 9075
  • [3] Multimodal human motion dataset of 3D anatomical landmarks and pose keypoints
    Ruescas-Nicolau, Ana Virginia
    Medina-Ripoll, Enrique Jose
    Bernabe, Eduardo Parrilla
    Martinez, Helios de Rosario
    [J]. DATA IN BRIEF, 2024, 53
  • [4] Animal Pose Estimation Based on 3D Priors
    Dai, Xiaowei
    Li, Shuiwang
    Zhao, Qijun
    Yang, Hongyu
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [5] 3D Head Pose and Gaze Tracking and Their Application to Diverse Multimodal Tasks
    Mora, Kenneth Alberto Funes
    [J]. ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 345 - 348
  • [6] 3D Human Pose Tracking Based on Depth Camera and Dynamic Programming Optimization
    Lie, Wen-Nung
    Shiu, Hung-Wei
    Huang, Chieh
    [J]. 2012 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 2012), 2012, : 1863 - 1866
  • [7] Depth-based 3D Hand Pose Tracking
    Quach, Kha Gia
    Chi Nhan Duong
    Luu, Khoa
    Bui, Tien D.
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2746 - 2751
  • [8] The Monocular Model-based 3D Pose Tracking
    Tong, Guofeng
    Liu, Ran
    Li, Hairong
    [J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 980 - 985
  • [9] U3PT: A New Dataset for Unconstrained 3D Pose Tracking Evaluation
    Tran, Ngoc-Trung
    Ababsa, Fakhreddine
    Charbit, Maurice
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT I, 2015, 9256 : 642 - 653
  • [10] Camera Pose Optimization for 3D Mapping
    Lluvia, Iker
    Lazkano, Elena
    Ansuategi, Ander
    [J]. IEEE ACCESS, 2023, 11 : 9122 - 9135