Statistical Mapping between Articulatory and Acoustic Data for an Ultrasound-based Silent Speech Interface

被引：0

作者：

Hueber, Thomas ^{[1
]}

Benaroya, Elie-Laurent ^{[2
]}

Denby, Bruce ^{[2
,3
]}

Chollet, Gerard ^{[4
]}

机构：

[1] UMR 5216 CNRS INP UJF U Stendhal, GIPSA Lab, Grenoble, France

[2] ESPCI Paristech, Sigma Lab, Paris, France

[3] Univ Paris 06, Paris, France

[4] Telecom ParisTech, LTCY CNRS, Paris, France

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

silent speech interface; GMM; HMM; ultrasound; video; multimodal; statistical mapping;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents recent developments on our "silent speech interface" that converts tongue and lip motions, captured by ultrasound and video imaging, into audible speech. In our previous studies, the mapping between the observed articulatory movements and the resulting speech sound was achieved using a unit selection approach. We investigate here the use of statistical mapping techniques, based on the joint modeling of visual and spectral features, using respectively Gaussian Mixture Models (GMM) and Hidden Markov Models (HMM). The prediction of the voiced/unvoiced parameter from visual articulatory data is also investigated using an artificial neural network (ANN). A continuous speech database consisting of one-hour of high-speed ultrasound and video sequences was specifically recorded to evaluate the proposed mapping techniques.

引用

页码：600 / +

页数：2

共 50 条

[1] Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis
Csapo, Tamas Gabor
Zainko, Csaba
Toth, Laszlo
Gosztolya, Gabor
Marko, Alexandra
[J]. INTERSPEECH 2020, 2020, : 2727 - 2731
[2] Autoencoder-Based Articulatory-to-Acoustic Mapping for Ultrasound Silent Speech Interfaces
Gosztolya, Gabor
Pinter, Adam
Toth, Laszlo
Grosz, Tamas
Marko, Alexandra
Csapo, Tamas Gabor
[J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[3] Ultrasound-based Silent Speech Interface Built on a Continuous Vocoder
Csapo, Tamas Gabor
Al-Radhi, Mohammed Salah
Nemeth, Geza
Gosztolya, Gabor
Grosz, Tamas
Toth, Laszlo
Marko, Alexandra
[J]. INTERSPEECH 2019, 2019, : 894 - 898
[4] Eigentongue feature extraction for an ultrasound-based silent speech interface
Hueber, T.
Aversano, G.
Chollet, G.
Denby, B.
Dreyfus, G.
Oussar, Y.
Roussel, P.
Stone, M.
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 1245 - +
[5] Silent vs Vocalized Articulation for a Portable Ultrasound-Based Silent Speech Interface
Florescu, Victoria-M
Crevier-Buchman, Lise
Denby, Bruce
Hueber, Thomas
Colazo-Simon, Antonia
Pillot-Loiseau, Claire
Roussel, Pierre
Gendrot, Cedric
Quattrocchi, Sophie
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 450 - +
[6] Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech Interface
Hueber, Thomas
Bailly, Gerard
Denby, Bruce
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 722 - 725
[7] Ultrasound-Based Silent Speech Interface Using Convolutional and Recurrent Neural Networks
Moliner Juanpere, Eloi
Csapo, Tamas Gabor
[J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2019, 105 (04) : 587 - 590
[8] Ultrasound-Based Silent Speech Interface using Sequential Convolutional Auto-encoder
Xu, Kele
Wu, Yuxiang
Gao, Zhifeng
[J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2194 - 2195
[9] Speech modelling based on acoustic-to-articulatory mapping
Schoentgen, J
[J]. NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 114 - 135
[10] Neural Speaker Embeddings for Ultrasound-based Silent Speech Interfaces
Shandiz, Amin Honarmandi
Toth, Laszlo
Gosztolya, Gabor
Marko, Alexandra
Csapo, Tamas Gabor
[J]. INTERSPEECH 2021, 2021, : 1932 - 1936

← 1 2 3 4 5 →