A user-friendly headset for radar-based silent speech recognition

被引:1
|
作者
Digehsara, Pouriya Amini [1 ]
de Menezes, Joao Vitor Possamai [1 ]
Wagner, Christoph [1 ]
Baerhold, Michael [2 ]
Schaffer, Petr [2 ]
Plettemeier, Dirk [2 ]
Birkholz, Peter [1 ]
机构
[1] Tech Univ Dresden, Inst Acoust & Speech Commun, Dresden, Germany
[2] Tech Univ Dresden, Inst Commun Technol, Dresden, Germany
来源
关键词
silent speech interfaces; wearable headset; BiLSTM; radar imaging; speech-related biosignals;
D O I
10.21437/Interspeech.2022-10090
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Silent speech interfaces allow speech communication to take place in the absence of the acoustic speech signal. Radar-based sensing with radio antennas on the speakers' face can be used as a non-invasive modality to measure speech articulation in such applications. One of the major challenges with this approach is the variability between different sessions, mainly due to the repositioning of the antennas on the face of the speaker. In order to reduce the impact of this influencing factor, we developed a wearable headset that can be 3D-printed with flexible materials and weighs only about 69 g. For evaluation, a radar-based word recognition experiment was performed, where five speakers recorded a speech corpus in multiple sessions, alternatively with the headset and with double-sided tape to place the antennas on the face. By using a bidirectional long short-term memory network for classification, an average intersession word accuracy of 76.50% and 68.18% was obtained using the headset and the tape, respectively. This indicates that the antenna (re-) positioning accuracy with the headset is not worse than that with the double-sided tape while providing other benefits.
引用
收藏
页码:4835 / 4839
页数:5
相关论文
共 50 条
  • [1] RaSSpeR: Radar-based Silent Speech Recognition
    Ferreira, David
    Silva, Samuel
    Curado, Francisco
    Teixeira, Antonio
    INTERSPEECH 2021, 2021, : 646 - 650
  • [2] IR-UWB Radar-Based Contactless Silent Speech Recognition of Vowels, Consonants, Words, and Phrases
    Lee, Sunghwa
    Shin, Younghoon
    Kim, Myungjong
    Seo, Jiwon
    IEEE ACCESS, 2023, 11 : 144844 - 144859
  • [3] RADAR SIMULATION AND USER-FRIENDLY - ARE THEY MUTUALLY EXCLUSIVE
    ALLEN, J
    BOREK, S
    PECHEWLYS, DA
    PROCEEDINGS OF THE 1989 SUMMER COMPUTER SIMULATION CONFERENCE, 1989, : 222 - 226
  • [4] Radar-based Feature Design and Multiclass Classification for Road User Recognition
    Scheiner, Nicolas
    Appenrodt, Nils
    Dickmann, Juergen
    Sick, Bernhard
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 779 - 786
  • [5] IS IT POSSIBLE TO MAKE PIXEL-BASED RADAR IMAGE CLASSIFICATION USER-FRIENDLY?
    Pisani, R.
    Riedel, P.
    Gomes, A.
    Mizobe, R.
    Papa, J.
    2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 4304 - 4307
  • [6] User-Friendly LabVIEW GUI for Prosthetic Hand Control using Emotiv EEG Headset
    Abu Kasim, Mohamad Amlie
    Low, Cheng Yee
    Ayub, Muhammad Azmi
    Zakaria, Noor Ayuni Che
    Salleh, Muhammad Haszerul Mohd
    Johar, Khairunnisa
    Hamli, Hizzul
    2016 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTICS AND INTELLIGENT SENSORS (IRIS 2016), 2017, 105 : 276 - 281
  • [7] Doppler Radar-Based Human Speech Recognition Using Mobile Vision Transformer
    Li, Wei
    Geng, Yongfu
    Gao, Yang
    Ding, Qining
    Li, Dandan
    Liu, Nanqi
    Chen, Jinheng
    ELECTRONICS, 2023, 12 (13)
  • [8] A User-Friendly Interface for Fingerprint Recognition Systems based on Natural Language Processing
    Conti, V.
    Militello, C.
    Sorbello, F.
    Vitabile, S.
    CISIS: 2009 INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, VOLS 1 AND 2, 2009, : 736 - +
  • [9] A USER-FRIENDLY ARCHITECTURE BASED ON CORPORATE CONFIDENCE
    Suh, Eulho
    Chanjoong, Kim
    SPACE, 2014, (554): : 34 - 41
  • [10] USER-FRIENDLY BIOMETRIC CAMERA FOR SPEEDING IRIS RECOGNITION SYSTEMS
    Lorenz, Michael G.
    Mengibar-Pozo, Luis
    Liu-Jimenez, Judith
    Fernandez-Saavedra, Belen
    42ND ANNUAL 2008 IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY, PROCEEDINGS, 2008, : 241 - 246