CubeLearn: End-to-End Learning for Human Motion Recognition From Raw mmWave Radar Signals

被引:20
|
作者
Zhao, Peijun [1 ,2 ]
Lu, Chris Xiaoxuan [3 ]
Wang, Bing [1 ,4 ]
Trigoni, Niki [1 ]
Markham, Andrew [1 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford OX1 3BW, England
[2] MIT, Dept Mech Engn, Cambridge, MA 02139 USA
[3] Univ Edinburgh, Dept Informat, Edinburgh EH8 9AB, Scotland
[4] Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R China
关键词
Doppler radar; Millimeter wave communication; Discrete Fourier transforms; Radar applications; Chirp; Neural networks; Convolutional neural networks; End-to-end neural network; mmWave radar; motion recognition;
D O I
10.1109/JIOT.2023.3237494
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
mmWave FMCW radar has attracted a huge amount of research interest for human-centered applications in recent years, such as human gesture and activity recognition. Most existing pipelines are built upon conventional discrete Fourier transform (DFT) preprocessing and deep neural network classifier hybrid methods, with a majority of previous works focusing on designing the downstream classifier to improve overall accuracy. In this work, we take a step back and look at the preprocessing module. To avoid the drawbacks of conventional DFT preprocessing, we propose a complex-weighted learnable preprocessing module, named CubeLearn, to directly extract features from raw radar signal and build an end-to-end deep neural network for mmWave FMCW radar motion recognition applications. Extensive experiments show that our CubeLearn module consistently improves the classification accuracies of different pipelines, especially, benefiting those simpler models, which are more likely to be used on edge devices due to their computational efficiency. We provide ablation studies on initialization methods and structure of the proposed module, as well as an evaluation of the running time on PC and edge devices. This work also serves as a comparison of different approaches toward data cube slicing. Through our task-agnostic design, we propose a first step toward a generic end-to-end solution for radar recognition problems.
引用
收藏
页码:10236 / 10249
页数:14
相关论文
共 50 条
  • [21] Deep End-to-End Representation Learning for Food Type Recognition from Speech
    Sertolli, Benjamin
    Cummins, Nicholas
    Sengur, Abdulkadir
    Schuller, Bjorn W.
    ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 574 - 578
  • [22] Traffic Signal Recognition Using End-to-End Deep Learning
    Sarker, Tonmoy
    Meng, Xiangyu
    TRAN-SET 2022, 2022, : 182 - 191
  • [23] Continual Learning for Monolingual End-to-End Automatic Speech Recognition
    Vander Eeckt, Steven
    Van Hamme, Hugo
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 459 - 463
  • [24] End-to-End Audiovisual Speech Recognition System With Multitask Learning
    Tao, Fei
    Busso, Carlos
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1 - 11
  • [25] End-to-End Speech Recognition Sequence Training With Reinforcement Learning
    Tjandra, Andros
    Sakti, Sakriani
    Nakamura, Satoshi
    IEEE ACCESS, 2019, 7 : 79758 - 79769
  • [26] End-to-End Automatic Speech Recognition with Deep Mutual Learning
    Masumura, Ryo
    Ihori, Mana
    Takashima, Akihiko
    Tanaka, Tomohiro
    Ashihara, Takanori
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 632 - 637
  • [27] An End-to-End Deep Learning Framework for Wideband Signal Recognition
    Vagollari, Adela
    Hirschbeck, Martin
    Gerstacker, Wolfgang
    IEEE ACCESS, 2023, 11 : 52899 - 52922
  • [28] End-to-end Convolutional Sequence Learning for ASL Fingerspelling Recognition
    Papadimitriou, Katerina
    Potamianos, Gerasimos
    INTERSPEECH 2019, 2019, : 2315 - 2319
  • [29] Arabic speech recognition using end-to-end deep learning
    Alsayadi, Hamzah A.
    Abdelhamid, Abdelaziz A.
    Hegazy, Islam
    Fayed, Zaki T.
    IET SIGNAL PROCESSING, 2021, 15 (08) : 521 - 534
  • [30] Investigation of Transfer Learning for End-to-End Russian Speech Recognition
    Kipyatkova, Irina
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 349 - 357