MLLR-PRSW for Kinematic-Independent Acoustic-to-Articulatory Inversion

被引:0
|
作者
Bozorg, Narjes [1 ]
Johnson, Michael T. [1 ]
机构
[1] Univ Kentucky, Dept Elect & Comp Engn, Lexington, KY 40506 USA
关键词
Speaker Independent acoustic-to-articulatory inversion; electromagnetic articulography; maximum likelihood linear regression; parallel reference speaker weighting;
D O I
10.1109/isspit47144.2019.9001752
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an improved method for kinematic-independent acoustic-to-articulatory inversion, using acoustic adaptation to estimate weights for articulatory model creation from reference speakers. Paired acoustic and articulatory data from the Marquette Electromagnetic Articulography corpus of Mandarin Accented English (EMAMAE) are used for experimental evaluation. The new method is a modification of the Parallel Reference Speaker Weighting (PRSW) inversion algorithm, in which two separate methods are used for acoustic and articulatory adaptation. A Maximum Likelihood Linear Regression (MLLR) approach is used for acoustic adaptation model and the PRSW weighted reference speaker approach is used for articulatory model adaptation. The new MLLR-PRSW adaptation method outperforms the baseline PRSW method on inversion of new test subjects where no kinematic data is used for training, providing estimated trajectories very close to the results from speaker dependent models that do use kinematic data.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Parallel Reference Speaker Weighting for Kinematic-Independent Acoustic-to-Articulatory Inversion
    Ji, An
    Johnson, Michael T.
    Berry, Jeffrey J.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (10) : 1865 - 1875
  • [2] A SUBJECT-INDEPENDENT ACOUSTIC-TO-ARTICULATORY INVERSION
    Ghosh, Prasanta Kumar
    Narayanan, Shrikanth S.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4624 - 4627
  • [3] Improved subject-independent acoustic-to-articulatory inversion
    Afshan, Amber
    Ghosh, Prasanta Kumar
    [J]. SPEECH COMMUNICATION, 2015, 66 : 1 - 16
  • [4] Autoregressive Articulatory WaveNet Flow for Speaker-Independent Acoustic-to-Articulatory Inversion
    Bozorg, Narjes
    Johnson, Michael T.
    Soleymanpour, Mohammad
    [J]. 2021 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2021, : 156 - 161
  • [5] Better acoustic normalization in subject independent acoustic-to-articulatory inversion: benefit to recognition
    Afshan, Amber
    Ghosh, Prasanta Kumar
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5395 - 5399
  • [6] Formant Trajectories for Acoustic-to-Articulatory Inversion
    Ozbek, I. Yuecel
    Hasegawa-Johnson, Mark
    Demirekler, Muebeccel
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2783 - +
  • [7] Incorporation of phonetic constraints in acoustic-to-articulatory inversion
    Potard, Blaise
    Laprie, Yves
    Ouni, Slim
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04): : 2310 - 2323
  • [8] A DEEP RECURRENT APPROACH FOR ACOUSTIC-TO-ARTICULATORY INVERSION
    Liu, Peng
    Yu, Quanjie
    Wu, Zhiyong
    Kang, Shiyin
    Meng, Helen
    Cai, Lainhong
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4450 - 4454
  • [9] A generalized smoothness criterion for acoustic-to-articulatory inversion
    Ghosh, Prasanta Kumar
    Narayanan, Shrikanth
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (04): : 2162 - 2172
  • [10] Acoustic-to-Articulatory Inversion based on Local Regression
    Al Moubayed, Samer
    Ananthakrishnan, G.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 937 - 940