Attentive Fusion for Efficient Wrist-Worn Gesture Recognition Based on Dual-View Cameras

被引:0
|
作者
Quan, Pengyuan [1 ]
Mao, Zihao [1 ]
Chen, Nenglun [1 ]
Zhang, Yang [2 ]
Zhang, Kao [1 ]
Pan, Zhigeng [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Nanjing 210044, Peoples R China
[2] Hubei Univ Technol, Sch Mech Engn, Wuhan 430068, Peoples R China
基金
中国国家自然科学基金;
关键词
Cameras; Gesture recognition; Feature extraction; Sensors; Lighting; Attention mechanisms; Intelligent sensors; Wearable devices; Visualization; Wearable sensors; Attention mechanism; gesture recognition; lightweight; wrist-worn cameras;
D O I
10.1109/JSEN.2024.3500576
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Wearable hand gesture recognition has attracted considerable attention in the field of smart sensors, human-computer interaction, etc. In this article, we propose a portable device for gesture recognition based on dual-view wrist-worn cameras. Our device is easy to wear and will not impede hand movement. The main challenges we face are the sever occlusion and diverse lighting conditions, which significantly hinder the accuracy of gesture recognition. To mitigate these issues, we propose a novel framework based on attentive feature fusion. The overall framework mainly consists of three components, including an adaptive feature amplification module, an attentive feature enhancing module, and a cross-view attention module. Our intuition is that the detailed features from palm and dorsal hand regions can complement each other and can be regarded as strong cues for inferring hand gestures. Moreover, by adaptively adjusting the brightness of the images, the proposed method can be adaptable to diverse lighting conditions. Our method is efficient and quite effective. By comparing our work with other state-of-the-art methods, we achieve superior performance under various experiment configurations. Extensive experimental results validate that our proposed framework surpasses existing state-of-the-art works by a significant margin.
引用
收藏
页码:2008 / 2018
页数:11
相关论文
共 45 条
  • [41] UWB Radar Traffic Gesture Recognition Based on Range-Doppler Dual-Channel Fusion Visual Transformer Network
    Xiong, Ziqin
    Zhang, Jiaxuan
    Yin, Jiacheng
    Xiong, Gang
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, ICDSP 2024, 2024, : 1 - 7
  • [42] An efficient speech emotion recognition based on a dual-stream CNN-transformer fusion network
    Tellai M.
    Gao L.
    Mao Q.
    International Journal of Speech Technology, 2023, 26 (02) : 541 - 557
  • [43] Robust Hand Gesture Recognition Using a Deformable Dual-Stream Fusion Network Based on CNN-TCN for FMCW Radar
    Zhu, Meiyi
    Zhang, Chaoyi
    Wang, Jianquan
    Sun, Lei
    Fu, Meixia
    SENSORS, 2023, 23 (20)
  • [44] Gesture recognition using dual-stream CNN based on fusion of sEMG energy kernel phase portrait and IMU amplitude image
    Xu, Liukai
    Zhang, Keqin
    Yang, Genke
    Chu, Jian
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 73
  • [45] Dual-Polarization SAR Ship Target Recognition Based on Mini Hourglass Region Extraction and Dual-Channel Efficient Fusion Network
    Xiong, Gang
    Xi, Yunlong
    Chen, Di
    Yu, Wenxian
    IEEE ACCESS, 2021, 9 : 29078 - 29089