Transformer-based rapid human pose estimation network

被引:6
|
作者
Wang, Dong [1 ]
Xie, Wenjun [2 ,3 ]
Cai, Youcheng [1 ]
Li, Xinjie [1 ]
Liu, Xiaoping [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Sch Software, Hefei 230009, Peoples R China
[3] Hefei Univ Technol, Anhui Prov Key Lab Ind Safety & Emergency Technol, Hefei 230601, Peoples R China
来源
COMPUTERS & GRAPHICS-UK | 2023年 / 116卷
关键词
Transformer architecture; Human pose estimation; Inference speed; Computational cost; ACTION RECOGNITION; SKELETON;
D O I
10.1016/j.cag.2023.09.001
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most current human pose estimation methods pursue excellent performance via large models and intensive computational requirements, resulting in slower models. These methods cannot be effectively adopted for human pose estimation in real applications due to their high memory and computational costs. To achieve a trade-off between accuracy and efficiency, we propose TRPose, a Transformer-based network for human pose estimation rapidly. TRPose consists of an early convolutional stage and a later Transformer stage seamlessly. Concretely, the convolutional stage forms a Rapid Fusion Module (RFM), which efficiently acquires multi-scale features via three parallel convolution branches. The Transformer stage utilizes multi-resolution Transformers to construct a Dual scale Encoder Module (DEM), aiming at learning long-range dependencies from different scale features of the whole human skeletal keypoints. The experiments show that TRPose acquires 74.3 AP and 73.8 AP on COCO validation and testdev datasets with 170+ FPS on a GTX2080Ti, which achieves the better efficiency and effectiveness tradeoffs than most state-of-the-art methods. Our model also outperforms mainstream Transformer-based architectures on MPII dataset, yielding 89.9 PCK@0.5 score on val set without extra data. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:317 / 326
页数:10
相关论文
共 50 条
  • [31] Human Pose Estimation Based on Deep Neural Network
    Zhu, Lingfei
    Wan, Wanggen
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 90 - 96
  • [32] Multilingual Transformer-Based Personality Traits Estimation
    Leonardi, Simone
    Monti, Diego
    Rizzo, Giuseppe
    Morisio, Maurizio
    INFORMATION, 2020, 11 (04)
  • [33] Gated Region-Refine pose transformer for human pose estimation
    Wang, Tianfeng
    Zhang, Xiaoxu
    NEUROCOMPUTING, 2023, 530 : 37 - 47
  • [34] Combination of Deep Learner Network and Transformer for 3D Human Pose Estimation
    Tien-Dat Tran
    Xuan-Thuy Vo
    Duy-Linh Nguyen
    Jo, Kang-Hyun
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 174 - 178
  • [35] CTHPose: An Efficient and Effective CNN-Transformer Hybrid Network for Human Pose Estimation
    Chen, Danya
    Wu, Lijun
    Chen, Zhicong
    Lin, Xufeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT V, 2024, 14429 : 327 - 339
  • [36] A transformer-based adversarial network framework for steganography
    Xiao, Chaoen
    Peng, Sirui
    Zhang, Lei
    Wang, Jianxin
    Ding, Ding
    Zhang, Jianyi
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 269
  • [37] A TRANSFORMER-BASED SIAMESE NETWORK FOR CHANGE DETECTION
    Bandara, Wele Gedara Chaminda
    Patel, Vishal M.
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 207 - 210
  • [38] Transformer-based Point Cloud Generation Network
    Xu, Rui
    Hui, Le
    Han, Yuehui
    Qian, Jianjun
    Xie, Jin
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4169 - 4177
  • [39] A Transformer-Based Network for Hyperspectral Object Tracking
    Gao, Long
    Chen, Langkun
    Liu, Pan
    Jiang, Yan
    Xie, Weiying
    Li, Yunsong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [40] Privacy Protection in Transformer-based Neural Network
    Lang, Jiaqi
    Li, Linjing
    Chen, Weiyun
    Zeng, Daniel
    2019 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2019, : 182 - 184