Silent Speech Interface Using Ultrasonic Doppler Sonar

被引：4

作者：

Lee, Ki-Seung ^{[1
]}

机构：

[1] Konkuk Univ, Dept Elect Engn, Seoul 143701, South Korea

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2020年 / E103D卷 / 08期

关键词：

silent speech interface; ultrasonic Doppler; deep neural networks; RECOGNITION; SENSOR;

D O I：

10.1587/transinf.2019EDP7211

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Some non-acoustic modalities have the ability to reveal certain speech attributes that can be used for synthesizing speech signals without acoustic signals. This study validated the use of ultrasonic Doppler frequency shifts caused by facial movements to implement a silent speech interface system. A 40kHz ultrasonic beam is incident to a speaker's mouth region. The features derived from the demodulated received signals were used to estimate the speech parameters. A nonlinear regression approach was employed in this estimation where the relationship between ultrasonic features and corresponding speech is represented by deep neural networks (DNN). In this study, we investigated the discrepancies between the ultrasonic signals of audible and silent speech to validate the possibility for totally silent communication. Since reference speech signals are not available in silently mouthed ultrasonic signals, a nearest-neighbor search and alignment method was proposed, wherein alignment was achieved by determining the optimal pair of ultrasonic and audible features in the sense of a minimum mean square error criterion. The experimental results showed that the performance of the ultrasonic Doppler-based method was superior to that of EMG-based speech estimation, and was comparable to an image-based method.

引用

页码：1875 / 1887

页数：13

共 50 条

[21] Development of a Silent Speech Interface for Augmented Reality Applications
Walck, Christine
Rivas, Tania
Flanagan, Riley
Fornito, Michael
[J]. COMPUTER METHODS, IMAGING AND VISUALIZATION IN BIOMECHANICS AND BIOMEDICAL ENGINEERING II, 2023, 38 : 208 - 214
[22] Representation Learning of Tongue Dynamics for a Silent Speech Interface
Wang, Hongcui
Roussel, Pierre
Denby, Bruce
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (12): : 2209 - 2217
[23] Silent Speech Interface Design Methodology and Case Study
LI Wenshi
[J]. Chinese Journal of Electronics, 2016, 25 (01) : 88 - 92
[24] Ultrasound-Based Silent Speech Interface Using Convolutional and Recurrent Neural Networks
Moliner Juanpere, Eloi
Csapo, Tamas Gabor
[J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2019, 105 (04) : 587 - 590
[25] DOPPLER SONAR
不详
[J]. NAVAL ENGINEERS JOURNAL, 1969, 81 (06) : 111 - &
[26] Measurement of fish velocity using Doppler sonar
Zedel, L
Knutsen, T
[J]. OCEANS 2000 MTS/IEEE - WHERE MARINE SCIENCE AND TECHNOLOGY MEET, VOLS 1-3, CONFERENCE PROCEEDINGS, 2000, : 1951 - 1956
[27] DNN-based Ultrasound-to-Speech Conversion for a Silent Speech Interface
Csapo, Temas Gabor
Grosz, Tamas
Gosztolya, Gabor
Toth, Laszlo
Marko, Alexandra
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3672 - 3676
[28] Estimating speech parameters for ultrasonic Doppler signal using LSTM recurrent neural networks
Joo, Hyeong-Kil
Lee, Ki-Seung
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (04): : 433 - 441
[29] Silent vs Vocalized Articulation for a Portable Ultrasound-Based Silent Speech Interface
Florescu, Victoria-M
Crevier-Buchman, Lise
Denby, Bruce
Hueber, Thomas
Colazo-Simon, Antonia
Pillot-Loiseau, Claire
Roussel, Pierre
Gendrot, Cedric
Quattrocchi, Sophie
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 450 - +
[30] Ultrasound-Based Silent Speech Interface using Sequential Convolutional Auto-encoder
Xu, Kele
Wu, Yuxiang
Gao, Zhifeng
[J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2194 - 2195

← 1 2 3 4 5 →