MAPoseNet: Animal pose estimation network via multi-scale convolutional attention

被引:0
|
作者
Liu, Sicong [1 ]
Fan, Qingcheng [1 ]
Li, Shuqin [1 ]
Zhao, Chunjiang [1 ,2 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, 3 Taicheng Rd, Yangling 712100, Peoples R China
[2] Beijing Acad Agr & Forestry Sci, Res Ctr Informat Technol, Beijing 100097, Peoples R China
关键词
Animal pose estimation; Attention mechanism; Asymmetric convolution; Feature pyramid; IDENTIFICATION;
D O I
10.1016/j.jvcir.2023.103989
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Animal pose estimation serves as an upstream task for recognizing and understanding animal behavior. Over the last year, the accuracy of the deep learning-based method has steadily improved, but at the expense of the model's inference speed. This paper uses an efficient and powerful model to improve inference speed and accuracy. The classic encoder-decoder architecture is chosen. For estimating animal pose, our model based on a feature pyramid and a multi-scale asymmetric convolution attention mechanism is developed and named MAPoseNet (Animal Pose Estimation Network Via Multi-scale Convolutional Attention). MAPoseNet consists of an encoder and a decoder. Rather than typical self-attention, the encoder's attention mechanism comprises multi-scale, asymmetric convolutions that are lightweight and instrumental in improving inference speed. A feature pyramid and a feature balance module make up the decoder. The public dataset AP-10K is used to train and test MAPoseNet. A series of experimental results demonstrate that the MAPoseNet model provides cutting-edge performance. MAPoseNet outperforms HRFormer by 1.3 AP and 0.8 AR, with 33.7% fewer FLOPs and 66% faster inference speed. And our model surpasses HRNet and HRFormer on the Animal Pose dataset as well. Our model has achieved a win-win situation regarding inference speed and accuracy.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Head Pose Estimation Based on Multi-Scale Convolutional Neural Network
    Liang Lingyu
    Zhang Tiantian
    He Wei
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (13)
  • [2] Multi-scale Attention Aided Multi-Resolution Network for Human Pose Estimation
    Selvam, Srinika
    Mishra, Deepak
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 461 - 472
  • [3] Hand pose estimation with multi-scale network
    Zhongxu Hu
    Youmin Hu
    Bo Wu
    Jie Liu
    Dongmin Han
    Thomas Kurfess
    [J]. Applied Intelligence, 2018, 48 : 2501 - 2515
  • [4] Hand pose estimation with multi-scale network
    Hu, Zhongxu
    Hu, Youmin
    Wu, Bo
    Liu, Jie
    Han, Dongmin
    Kurfess, Thomas
    [J]. APPLIED INTELLIGENCE, 2018, 48 (08) : 2501 - 2515
  • [5] Age Estimation by Multi-scale Convolutional Network
    Yi, Dong
    Lei, Zhen
    Li, Stan Z.
    [J]. COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 144 - 158
  • [6] Multi-Scale Collaborative Network for Human Pose Estimation
    Guo, Chunsheng
    Zhou, Jialuo
    Du, Wenlong
    Zhang, Xuguang
    [J]. INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2019, 16 (04)
  • [7] MULTI-SCALE SUPERVISED NETWORK FOR HUMAN POSE ESTIMATION
    Ke, Lipeng
    Chang, Ming-Ching
    Qi, Honggang
    Lyu, Siwei
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 564 - 568
  • [8] Human Pose Estimation Based on Lightweight Multi-Scale Coordinate Attention
    Li, Xin
    Guo, Yuxin
    Pan, Weiguo
    Liu, Hongzhe
    Xu, Bingxin
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [9] A lightweight pose estimation network with multi-scale receptive field
    Li, Shuo
    Dai, Ju
    Chen, Zhangmeng
    Pan, Junjun
    [J]. VISUAL COMPUTER, 2023, 39 (08): : 3429 - 3440
  • [10] A lightweight pose estimation network with multi-scale receptive field
    Shuo Li
    Ju Dai
    Zhangmeng Chen
    Junjun Pan
    [J]. The Visual Computer, 2023, 39 : 3429 - 3440