UniFi: A Unified Framework for Generalizable Gesture Recognition with Wi-Fi Signals Using Consistency-guided Multi-View Networks

被引:3
|
作者
Liu, Yan [1 ]
Yu, Anlan [1 ]
Wang, Leye [1 ]
Guo, Bin [2 ]
Li, Yang [1 ]
Yi, Enze [1 ]
Zhang, Daqing [1 ,3 ,4 ]
机构
[1] Peking Univ, Sch Comp Sci, Key Lab High Confidence Software Technol, Minist Educ, Beijing, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[3] Telecom SudParis, Evry, France
[4] Inst Polytech Paris, Evry, France
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Wireless Sensing; Channel State Information (CSI); Gesture Recognition; Deep learning;
D O I
10.1145/3631429
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, considerable endeavors have been devoted to exploring Wi-Fi-based sensing technologies by modeling the intricate mapping between received signals and corresponding human activities. However, the inherent complexity of Wi-Fi signals poses significant challenges for practical applications due to their pronounced susceptibility to deployment environments. To address this challenge, we delve into the distinctive characteristics of Wi-Fi signals and distill three pivotal factors that can be leveraged to enhance generalization capabilities of deep learning-based Wi-Fi sensing models: 1) effectively capture valuable input to mitigate the adverse impact of noisy measurements; 2) adaptively fuse complementary information from multiple Wi-Fi devices to boost the distinguishability of signal patterns associated with different activities; 3) extract generalizable features that can overcome the inconsistent representations of activities under different environmental conditions (e.g., locations, orientations). Leveraging these insights, we design a novel and unified sensing framework based on Wi-Fi signals, dubbed UniFi, and use gesture recognition as an application to demonstrate its effectiveness. UniFi achieves robust and generalizable gesture recognition in real-world scenarios by extracting discriminative and consistent features unrelated to environmental factors from pre-denoised signals collected by multiple transceivers. To achieve this, we first introduce an effective signal preprocessing approach that captures the applicable input data from noisy received signals for the deep learning model. Second, we propose a multi-view deep network based on spatio-temporal cross-view attention that integrates multi-carrier and multi-device signals to extract distinguishable information. Finally, we present the mutual information maximization as a regularizer to learn environment-invariant representations via contrastive loss without requiring access to any signals from unseen environments for practical adaptation. Extensive experiments on the Widar 3.0 dataset demonstrate that our proposed framework significantly outperforms state-of-the-art approaches in different settings (99% and 90%-98% accuracy for in-domain and cross-domain recognition without additional data collection and model training).
引用
收藏
页数:29
相关论文
共 4 条
  • [1] Robust gesture recognition method toward intelligent environment using Wi-Fi signals
    Ding, Xue
    Yu, Xiao
    Zhong, Yi
    Xie, Weiliang
    Cai, Bowen
    You, Minglei
    Jiang, Ting
    MEASUREMENT, 2024, 231
  • [2] A New Method of Human Gesture Recognition Using Wi-Fi Signals Based on XGBoost
    Ding, Xue
    Jiang, Ting
    Xue, Wenling
    Li, Zhiwei
    Zhong, Yi
    2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS), 2020, : 237 - 241
  • [3] A New Method of Dynamic Gesture Recognition Using Wi-Fi Signals Based on Adaboost
    Ding, Xue
    Jiang, Ting
    Zou, WeiXia
    2017 17TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2017,
  • [4] Sign Language Recognition Using Two-Stream Convolutional Neural Networks with Wi-Fi Signals
    Lee, Chien-Cheng
    Gao, Zhongjian
    APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 13