Statistical Mapping between Articulatory and Acoustic Data for an Ultrasound-based Silent Speech Interface

被引:0
|
作者
Hueber, Thomas [1 ]
Benaroya, Elie-Laurent [2 ]
Denby, Bruce [2 ,3 ]
Chollet, Gerard [4 ]
机构
[1] UMR 5216 CNRS INP UJF U Stendhal, GIPSA Lab, Grenoble, France
[2] ESPCI Paristech, Sigma Lab, Paris, France
[3] Univ Paris 06, Paris, France
[4] Telecom ParisTech, LTCY CNRS, Paris, France
关键词
silent speech interface; GMM; HMM; ultrasound; video; multimodal; statistical mapping;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents recent developments on our "silent speech interface" that converts tongue and lip motions, captured by ultrasound and video imaging, into audible speech. In our previous studies, the mapping between the observed articulatory movements and the resulting speech sound was achieved using a unit selection approach. We investigate here the use of statistical mapping techniques, based on the joint modeling of visual and spectral features, using respectively Gaussian Mixture Models (GMM) and Hidden Markov Models (HMM). The prediction of the voiced/unvoiced parameter from visual articulatory data is also investigated using an artificial neural network (ANN). A continuous speech database consisting of one-hour of high-speed ultrasound and video sequences was specifically recorded to evaluate the proposed mapping techniques.
引用
收藏
页码:600 / +
页数:2
相关论文
共 50 条
  • [1] Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis
    Csapo, Tamas Gabor
    Zainko, Csaba
    Toth, Laszlo
    Gosztolya, Gabor
    Marko, Alexandra
    [J]. INTERSPEECH 2020, 2020, : 2727 - 2731
  • [2] Autoencoder-Based Articulatory-to-Acoustic Mapping for Ultrasound Silent Speech Interfaces
    Gosztolya, Gabor
    Pinter, Adam
    Toth, Laszlo
    Grosz, Tamas
    Marko, Alexandra
    Csapo, Tamas Gabor
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [3] Ultrasound-based Silent Speech Interface Built on a Continuous Vocoder
    Csapo, Tamas Gabor
    Al-Radhi, Mohammed Salah
    Nemeth, Geza
    Gosztolya, Gabor
    Grosz, Tamas
    Toth, Laszlo
    Marko, Alexandra
    [J]. INTERSPEECH 2019, 2019, : 894 - 898
  • [4] Eigentongue feature extraction for an ultrasound-based silent speech interface
    Hueber, T.
    Aversano, G.
    Chollet, G.
    Denby, B.
    Dreyfus, G.
    Oussar, Y.
    Roussel, P.
    Stone, M.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 1245 - +
  • [5] Silent vs Vocalized Articulation for a Portable Ultrasound-Based Silent Speech Interface
    Florescu, Victoria-M
    Crevier-Buchman, Lise
    Denby, Bruce
    Hueber, Thomas
    Colazo-Simon, Antonia
    Pillot-Loiseau, Claire
    Roussel, Pierre
    Gendrot, Cedric
    Quattrocchi, Sophie
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 450 - +
  • [6] Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech Interface
    Hueber, Thomas
    Bailly, Gerard
    Denby, Bruce
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 722 - 725
  • [7] Ultrasound-Based Silent Speech Interface Using Convolutional and Recurrent Neural Networks
    Moliner Juanpere, Eloi
    Csapo, Tamas Gabor
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2019, 105 (04) : 587 - 590
  • [8] Ultrasound-Based Silent Speech Interface using Sequential Convolutional Auto-encoder
    Xu, Kele
    Wu, Yuxiang
    Gao, Zhifeng
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2194 - 2195
  • [9] Speech modelling based on acoustic-to-articulatory mapping
    Schoentgen, J
    [J]. NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 114 - 135
  • [10] Neural Speaker Embeddings for Ultrasound-based Silent Speech Interfaces
    Shandiz, Amin Honarmandi
    Toth, Laszlo
    Gosztolya, Gabor
    Marko, Alexandra
    Csapo, Tamas Gabor
    [J]. INTERSPEECH 2021, 2021, : 1932 - 1936