A SPARSE SMOOTHING APPROACH FOR GAUSSIAN MIXTURE MODEL BASED ACOUSTIC-TO-ARTICULATORY INVERSION

被引:0
|
作者
Sudhakar, Prasad [1 ]
Jacques, Laurent [1 ]
Ghosh, Prasanta Kumar
机构
[1] Catholic Univ Louvain, ICTEAM ELEN, Louvain, Belgium
关键词
acoustic-to-articulatory inversion; smoothing; Gaussian mixture model; sparsity; chambolle-pock; l(1) minimization;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is well-known that the performance of the Gaussian Mixture Model (GMM) based Acoustic-to-Articulatory Inversion (AAI) improves by either incorporating smoothness constraint directly in the inversion criterion or smoothing (low-pass filtering) estimated articulator trajectories in a post-processing step, where smoothing is performed independently of the inversion. As the low-pass filtering is independent of inversion, the smoothed articulator trajectory samples no longer remain optimal as per the inversion criterion. In this work, we propose a sparse smoothing technique which constrains the smoothed articulator trajectory to be different from the estimated trajectory only at a sparse subset of samples while simultaneously achieving the required degree of smoothness. Inversion experiments on the articulatory database show that the sparse smoothing achieves an AAI performance similar to that using low-pass filtering but in sparse smoothing similar to 15% (on average) of the samples in the smoothed articulator trajectory remain identical to those in the estimated articulator trajectory thereby preserve their AAI optimality as opposed to 0% in low-pass filtering.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion
    Xie, Xurong
    Liu, Xunying
    Lee, Tan
    Wang, Lan
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 36 - 40
  • [22] Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion
    Ouni, S
    Laprie, Y
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (01): : 444 - 460
  • [23] Temporal Convolution Network Based Joint Optimization of Acoustic-to-Articulatory Inversion
    Sun, Guolun
    Huang, Zhihua
    Wang, Li
    Zhang, Pengyuan
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [24] Is average RMSE appropriate for evaluating acoustic-to-articulatory inversion?
    Fang, Qiang
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 997 - 1003
  • [25] Improved subject-independent acoustic-to-articulatory inversion
    National Institute of Technology, Karnataka , Mangalore
    575025, India
    不详
    560012, India
    [J]. Speech Commun, (1-16):
  • [26] Acoustic-to-Articulatory Inversion Using Particle Swarm Optimization
    Fairee, Suthida
    Sirinaovakul, Booncharoen
    Prom-on, Santitham
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2015,
  • [27] Acoustic-to-articulatory inversion from infants' vowel vocalizations
    Oohashi, Hiroki
    Watanabe, Hama
    Taga, Gentaro
    [J]. NEUROSCIENCE RESEARCH, 2011, 71 : E286 - E286
  • [28] Acoustic-to-articulatory mapping based on mixture of probabilistic canonical correlation analysis
    Uchida, Hidetsugu
    Saito, Daisuke
    Minematsu, Nobuaki
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 989 - 993
  • [29] Improved subject-independent acoustic-to-articulatory inversion
    Afshan, Amber
    Ghosh, Prasanta Kumar
    [J]. SPEECH COMMUNICATION, 2015, 66 : 1 - 16
  • [30] Multi-corpus Acoustic-to-articulatory Speech Inversion
    Seneviratne, Nadee
    Sivaraman, Ganesh
    Espy-Wilson, Carol
    [J]. INTERSPEECH 2019, 2019, : 859 - 863