A user-friendly headset for radar-based silent speech recognition

被引:1
|
作者
Digehsara, Pouriya Amini [1 ]
de Menezes, Joao Vitor Possamai [1 ]
Wagner, Christoph [1 ]
Baerhold, Michael [2 ]
Schaffer, Petr [2 ]
Plettemeier, Dirk [2 ]
Birkholz, Peter [1 ]
机构
[1] Tech Univ Dresden, Inst Acoust & Speech Commun, Dresden, Germany
[2] Tech Univ Dresden, Inst Commun Technol, Dresden, Germany
来源
关键词
silent speech interfaces; wearable headset; BiLSTM; radar imaging; speech-related biosignals;
D O I
10.21437/Interspeech.2022-10090
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Silent speech interfaces allow speech communication to take place in the absence of the acoustic speech signal. Radar-based sensing with radio antennas on the speakers' face can be used as a non-invasive modality to measure speech articulation in such applications. One of the major challenges with this approach is the variability between different sessions, mainly due to the repositioning of the antennas on the face of the speaker. In order to reduce the impact of this influencing factor, we developed a wearable headset that can be 3D-printed with flexible materials and weighs only about 69 g. For evaluation, a radar-based word recognition experiment was performed, where five speakers recorded a speech corpus in multiple sessions, alternatively with the headset and with double-sided tape to place the antennas on the face. By using a bidirectional long short-term memory network for classification, an average intersession word accuracy of 76.50% and 68.18% was obtained using the headset and the tape, respectively. This indicates that the antenna (re-) positioning accuracy with the headset is not worse than that with the double-sided tape while providing other benefits.
引用
收藏
页码:4835 / 4839
页数:5
相关论文
共 50 条
  • [21] FPGA Accelerator for Radar-Based Human Activity Recognition
    Long, Kangjie
    Rao, Chaolin
    Zhang, Xiangyu
    Ye, Wenbin
    Lou, Xin
    2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 391 - 394
  • [22] A Survey on Radar-Based Continuous Human Activity Recognition
    Ullmann, Ingrid
    Guendel, Ronny G.
    Kruse, Nicolas Christian
    Fioranelli, Francesco
    Yarovoy, Alexander
    IEEE JOURNAL OF MICROWAVES, 2023, 3 (03): : 938 - 950
  • [23] Radar-based Dataset Development for Human Activity Recognition
    Ahmed, A.
    Zhang, Y. D.
    2020 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM, 2020,
  • [24] Evaluation of different antenna types and positions in a stepped frequency continuous-wave radar-based silent speech interface
    de Menezes, Joao Vitor Possamai
    Digehsara, Pouriya Amini
    Wagner, Christoph
    Muetze, Marco
    Baerhold, Michael
    Schaffer, Petr
    Plettemeier, Dirk
    Birkholz, Peter
    INTERSPEECH 2022, 2022, : 3633 - 3637
  • [25] ISR: indoor shop recognition via user-friendly and efficient fingerprinting on smartphones
    Dong Zhao
    Huaiyu Xu
    Jiaqi An
    Liang Liu
    Huadong Ma
    Machine Vision and Applications, 2017, 28 : 781 - 791
  • [26] ISR: indoor shop recognition via user-friendly and efficient fingerprinting on smartphones
    Zhao, Dong
    Xu, Huaiyu
    An, Jiaqi
    Liu, Liang
    Ma, Huadong
    MACHINE VISION AND APPLICATIONS, 2017, 28 (07) : 781 - 791
  • [27] EasyModel: a user-friendly web-based interface based on MODELLER
    Seyed Shahriar Arab
    Alireza Dantism
    Scientific Reports, 13 (1)
  • [28] EasyModel: a user-friendly web-based interface based on MODELLER
    Arab, Seyed Shahriar
    Dantism, Alireza
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [29] USER-FRIENDLY IMAGE-BASED SEGMENTATION AND ANALYSIS OF CHROMOSOMES
    Uhlmann, V.
    Delgado-Gonzalo, R.
    Unser, M.
    Michel, P. O.
    Baldi, L.
    Wurm, F. M.
    2016 IEEE 13TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2016, : 395 - 398
  • [30] User-friendly Visual Secret Sharing Based on Random Grids
    Paknahad, S. Mohammad
    Hosseini, S. Abolfazl
    Alaghband, Mandi R.
    2015 SIGNAL PROCESSING AND INTELLIGENT SYSTEMS CONFERENCE (SPIS), 2015, : 58 - 62