Vision Transformer-based pilot pose estimation

被引:0
|
作者
Wu, Honglan [1 ]
Liu, Hao [1 ]
Sun, Youchao [1 ]
机构
[1] College of Civil Aviation, Nanjing University of Aeronautics and Astronautics, Nanjing,211106, China
关键词
Convolutional neural networks;
D O I
10.13700/j.bh.1001-5965.2022.0811
中图分类号
学科分类号
摘要
Human pose estimation is an important aspect in the field of behavioral perception and a key technology in the way of intelligent interaction in the cockpit of civil aircraft. To establish an explainable link between the complex lighting environment in the cockpit of civil aircraft and the performance of the pilot pose estimation model, the visual Transformer-based pilot pose (ViTPPose) estimation model is proposed. In order to capture the global correlation of subsequent higher-order features while expanding the perceptual field, this model employs a two-branch Transformer module with several coding layers at the end of the convolutional neural networks (CNN)backbone network. The coding layers combine the Transformer and the dilated convolution. Based on the flight crew’s standard operating procedures, a pilot maneuvering behavior keypoint detection dataset is established for flight simulation scenarios. ViTPPose estimation model completes the pilot seating estimation on this dataset and verifies its validity by comparing it with the benchmark model. The seating estimation heatmap is created in the context of the cockpit’s complicated lighting to examine the model’s preferred lighting intensity, evaluate the ViTPPose estimation model’s performance under various lighting conditions, and highlight the model’s reliance on various lighting intensities. © 2024 Beijing University of Aeronautics and Astronautics (BUAA). All rights reserved.
引用
收藏
页码:3100 / 3110
相关论文
共 50 条
  • [11] Transformer-Based Parameter Estimation in Statistics
    Yin, Xiaoxin
    Yin, David S.
    [J]. MATHEMATICS, 2024, 12 (07)
  • [12] A Transformer-based multi-modal fusion network for 6D pose estimation
    Hong, Jia-Xin
    Zhang, Hong-Bo
    Liu, Jing-Hua
    Lei, Qing
    Yang, Li-Jie
    Du, Ji-Xiang
    [J]. INFORMATION FUSION, 2024, 105
  • [13] ViT-rPPG: a vision transformer-based network for remote heart rate estimation
    Sun, Wei
    Sun, Qing
    Sun, Hong-Mei
    Sun, Qi
    Jia, Rui-Sheng
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
  • [14] ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
    Xu, Yufei
    Zhang, Jing
    Zhang, Qiming
    Tao, Dacheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [15] YOLOPose V2: Understanding and improving transformer-based 6D pose estimation
    Periyasamy, Arul Selvam
    Amini, Arash
    Tsaturyan, Vladimir
    Behnke, Sven
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 168
  • [16] Multi-hypothesis representation learning for transformer-based 3D human pose estimation
    Li, Wenhao
    Liu, Hong
    Tang, Hao
    Wang, Pichao
    [J]. PATTERN RECOGNITION, 2023, 141
  • [17] Vision Transformer-based overlay processor for Edge Computing
    Liu, Fang
    Fan, Zimeng
    Hu, Wei
    Xu, Dian
    Peng, Min
    He, Jing
    He, Yanxiang
    [J]. APPLIED SOFT COMPUTING, 2024, 156
  • [18] Vision Transformer-based recognition of diabetic retinopathy grade
    Wu, Jianfang
    Hu, Ruo
    Xiao, Zhenghong
    Chen, Jiaxu
    Liu, Jingwei
    [J]. MEDICAL PHYSICS, 2021, 48 (12) : 7850 - 7863
  • [19] Strawberry disease identification with vision transformer-based models
    Nguyen, Hai Thanh
    Tran, Tri Dac
    Nguyen, Thanh Tuong
    Pham, Nhi Minh
    Nguyen Ly, Phuc Hoang
    Luong, Huong Hoang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73101 - 73126
  • [20] Multilingual Transformer-Based Personality Traits Estimation
    Leonardi, Simone
    Monti, Diego
    Rizzo, Giuseppe
    Morisio, Maurizio
    [J]. INFORMATION, 2020, 11 (04)