RNN-Based Room Scale Hand Motion Tracking

被引:49
|
作者
Mao, Wenguang [1 ]
Wang, Mei [1 ]
Sun, Wei [1 ]
Qiu, Lili [1 ]
Pradhan, Swadhin [1 ]
Chen, Yi-Chao [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
关键词
Acoustic motion tracking; recurrent neural network; MUSIC;
D O I
10.1145/3300061.3345439
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Smart speakers allow users to interact with home appliances using voice commands and are becoming increasingly popular. While voice-based interface is intuitive, it is insufficient in many scenarios, such as in noisy or quiet environments, for users with language barriers, or in applications that require continuous motion tracking. Motion-based control is attractive and complementary to existing voice-based control. However, accurate and reliable room-scale motion tracking poses a significant challenge due to low SNR, interference, and varying mobility. To this end, we develop a novel recurrent neural network (RNN) based system that uses speakers and microphones to realize accurate room-scale tracking. Our system jointly estimates the propagation distance and angle-of-arrival (AoA) of signals reflected by the hand, based on AoA-distance profiles generated by 2D MUSIC. We design a series of techniques to significantly enhance the profile quality under low SNR. We feed the profiles in a recent history to our RNN to estimate the distance and AoA. In this way, we can exploit the temporal structure among consecutive profiles to remove the impact of noise, interference and mobility. Using extensive evaluation, we show our system achieves 1.2-3.7 cm error within 4.5 m range, supports tracking multiple users, and is robust against ambient sound. To our knowledge, this is the first acoustic device-free room-scale tracking system.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] RNN-based Method for Fault Diagnosis of Grinding System
    Qu Xing-yu
    Zeng Peng
    Xu Chengcheng
    Fu Dong-dong
    [J]. 2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 673 - 678
  • [22] An overview of RNN-based Mandarin speech recognition approaches
    Liao, YF
    Hong, WT
    Wang, WJ
    Wang, YR
    Chen, SH
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 1999, 22 (05) : 535 - 547
  • [23] An Analysis of the RNN-Based Spoken Term Detection Training
    Svec, Jan
    Smidl, Lubos
    Psutka, Josef V.
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 119 - 129
  • [24] RNN-Based Approach for Broccoli Harvest Time Forecast
    Lohachov, Mykhailo
    Korei, Ryoji
    Oki, Kazuo
    Yoshida, Koshi
    Azechi, Issaku
    Salem, Salem Ibrahim
    Utsumi, Nobuyuki
    [J]. AGRONOMY-BASEL, 2024, 14 (02):
  • [25] A RNN-Based Hyper-heuristic for Combinatorial Problems
    Kieffer, Emmanuel
    Duflo, Gabriel
    Danoy, Gregoire
    Varrette, Sebastien
    Bouvry, Pascal
    [J]. EVOLUTIONARY COMPUTATION IN COMBINATORIAL OPTIMIZATION, EVOCOP 2022, 2022, 13222 : 17 - 32
  • [26] Learning RNN-Based Gene Regulatory Networks for Robot Control
    Lee, Wei-Po
    Yang, Tsung-Hsien
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, 2009, 61 : 93 - 102
  • [27] A modular RNN-based method for continuous Mandarin speech recognition
    Liao, YF
    Chen, SH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 252 - 263
  • [28] RNN-Based Sequence-Preserved Attention for Dependency Parsing
    Zhou, Yi
    Zhou, Junying
    Liu, Lu
    Feng, Jiangtao
    Peng, Haoyuan
    Zheng, Xiaoqing
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5738 - 5745
  • [29] Heuristic RNN-based Kalman filter for fetal electrocardiogram extraction
    Karthik, G. L.
    Ravindran, R. Samson
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (06) : 7329 - 7340
  • [30] An RNN-based Learnable Extended Kalman Filter Design and Application
    Zheng, Tianyu
    Yao, Yu
    He, Fenghua
    Zhang, Xinran
    [J]. 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 3304 - 3309