Vision Transformer-based pilot pose estimation

被引：0

作者：

Wu, Honglan ^{[1
]}

Liu, Hao ^{[1
]}

Sun, Youchao ^{[1
]}

机构：

[1] College of Civil Aviation, Nanjing University of Aeronautics and Astronautics, Nanjing,211106, China

来源：

Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics | 2024年 / 50卷 / 10期

关键词：

Convolutional neural networks;

D O I：

10.13700/j.bh.1001-5965.2022.0811

中图分类号：

学科分类号：

摘要：

Human pose estimation is an important aspect in the field of behavioral perception and a key technology in the way of intelligent interaction in the cockpit of civil aircraft. To establish an explainable link between the complex lighting environment in the cockpit of civil aircraft and the performance of the pilot pose estimation model, the visual Transformer-based pilot pose (ViTPPose) estimation model is proposed. In order to capture the global correlation of subsequent higher-order features while expanding the perceptual field, this model employs a two-branch Transformer module with several coding layers at the end of the convolutional neural networks （CNN）backbone network. The coding layers combine the Transformer and the dilated convolution. Based on the flight crew’s standard operating procedures, a pilot maneuvering behavior keypoint detection dataset is established for flight simulation scenarios. ViTPPose estimation model completes the pilot seating estimation on this dataset and verifies its validity by comparing it with the benchmark model. The seating estimation heatmap is created in the context of the cockpit’s complicated lighting to examine the model’s preferred lighting intensity, evaluate the ViTPPose estimation model’s performance under various lighting conditions, and highlight the model’s reliance on various lighting intensities. © 2024 Beijing University of Aeronautics and Astronautics (BUAA). All rights reserved.

引用

页码：3100 / 3110

共 50 条

[11] Transformer-Based Parameter Estimation in Statistics
Yin, Xiaoxin
Yin, David S.
[J]. MATHEMATICS, 2024, 12 (07)
[12] A Transformer-based multi-modal fusion network for 6D pose estimation
Hong, Jia-Xin
Zhang, Hong-Bo
Liu, Jing-Hua
Lei, Qing
Yang, Li-Jie
Du, Ji-Xiang
[J]. INFORMATION FUSION, 2024, 105
[13] ViT-rPPG: a vision transformer-based network for remote heart rate estimation
Sun, Wei
Sun, Qing
Sun, Hong-Mei
Sun, Qi
Jia, Rui-Sheng
[J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
[14] ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Xu, Yufei
Zhang, Jing
Zhang, Qiming
Tao, Dacheng
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[15] YOLOPose V2: Understanding and improving transformer-based 6D pose estimation
Periyasamy, Arul Selvam
Amini, Arash
Tsaturyan, Vladimir
Behnke, Sven
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 168
[16] Multi-hypothesis representation learning for transformer-based 3D human pose estimation
Li, Wenhao
Liu, Hong
Tang, Hao
Wang, Pichao
[J]. PATTERN RECOGNITION, 2023, 141
[17] Vision Transformer-based overlay processor for Edge Computing
Liu, Fang
Fan, Zimeng
Hu, Wei
Xu, Dian
Peng, Min
He, Jing
He, Yanxiang
[J]. APPLIED SOFT COMPUTING, 2024, 156
[18] Vision Transformer-based recognition of diabetic retinopathy grade
Wu, Jianfang
Hu, Ruo
Xiao, Zhenghong
Chen, Jiaxu
Liu, Jingwei
[J]. MEDICAL PHYSICS, 2021, 48 (12) : 7850 - 7863
[19] Strawberry disease identification with vision transformer-based models
Nguyen, Hai Thanh
Tran, Tri Dac
Nguyen, Thanh Tuong
Pham, Nhi Minh
Nguyen Ly, Phuc Hoang
Luong, Huong Hoang
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73101 - 73126
[20] Multilingual Transformer-Based Personality Traits Estimation
Leonardi, Simone
Monti, Diego
Rizzo, Giuseppe
Morisio, Maurizio
[J]. INFORMATION, 2020, 11 (04)

← 1 2 3 4 5 →