Multi-corpus Acoustic-to-articulatory Speech Inversion

被引:8
|
作者
Seneviratne, Nadee [1 ]
Sivaraman, Ganesh [2 ]
Espy-Wilson, Carol [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Pindrop, Atlanta, GA USA
来源
关键词
Acoustic-to-articulatory speech inversion; multi-task learning; articulatory phonology; tract variables;
D O I
10.21437/Interspeech.2019-3168
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
There are several technologies like Electromagnetic articulometry (EMA), ultrasound, real-time Magnetic Resonance Imaging (MRI), and X-ray microbeam that are used to measure speech articulatory movements. Each of these techniques provides a different view of the vocal tract. The measurements performed using the similar techniques also differ greatly due to differences in the placement of sensors, and the anatomy of speakers. This limits most articulatory studies to single datasets. However to yield better results in its applications, the speech inversion systems should be more generalized, which requires the combination of data from multiple sources. This paper proposes a multi-task learning based deep neural network architecture for acoustic-to-articulatory speech inversion trained using three different articulatory datasets - two of them were measured using EMA, and one using X-ray microbeam. Experiments show improved accuracy of the proposed acoustic-to-articulatory mapping compared to the systems trained using single datasets.
引用
收藏
页码:859 / 863
页数:5
相关论文
共 50 条
  • [1] ACOUSTIC-TO-ARTICULATORY INVERSION FOR DYSARTHRIC SPEECH BY USING CROSS-CORPUS ACOUSTIC-ARTICULATORY DATA
    Maharana, Sarthak Kumar
    Illa, Aravind
    Mannem, Renuka
    Belur, Yamini
    Shetty, Preetie
    Kumar, Veeramani Preethish
    Vengalil, Seena
    Polavarapu, Kiran
    Atchayaram, Nalini
    Ghosh, Prasanta Kumar
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6458 - 6462
  • [2] Acoustic-to-articulatory Speech Inversion with Multi-task Learning
    Siriwardena, Yashish M.
    Sivaraman, Ganesh
    Espy-Wilson, Carol
    [J]. INTERSPEECH 2022, 2022, : 5020 - 5024
  • [3] A COMPARATIVE STUDY OF ACOUSTIC-TO-ARTICULATORY INVERSION FOR NEUTRAL AND WHISPERED SPEECH
    Illa, Aravind
    Meenakshi, Nisha G.
    Ghosh, Prasanta Kumar
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5075 - 5079
  • [4] ACOUSTIC-TO-ARTICULATORY INVERSION BASED ON SPEECH DECOMPOSITION AND AUXILIARY FEATURE
    Wang, Jianrong
    Liu, Jinyu
    Zhao, Longxuan
    Wang, Shanyu
    Yu, Ruiguo
    Liu, Li
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4808 - 4812
  • [5] Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models
    Shahrebabaki, Abdolreza Sabzi
    Salvi, Giampiero
    Svendsen, Torbjorn
    Siniscalchi, Sabato Marco
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 135 - 147
  • [6] The impact of cross language on acoustic-to-articulatory inversion and its influence on articulatory speech synthesis
    Illa, Aravind
    Nair, Aanish
    Ghosh, Prasanta Kumar
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8267 - 8271
  • [7] Formant Trajectories for Acoustic-to-Articulatory Inversion
    Ozbek, I. Yuecel
    Hasegawa-Johnson, Mark
    Demirekler, Muebeccel
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2783 - +
  • [8] Analysis of acoustic-to-articulatory speech inversion across different accents and languages
    Sivaraman, Ganesh
    Espy-Wilson, Carol
    Wieling, Martijn
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 974 - 978
  • [9] Incorporation of phonetic constraints in acoustic-to-articulatory inversion
    Potard, Blaise
    Laprie, Yves
    Ouni, Slim
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04): : 2310 - 2323
  • [10] A SUBJECT-INDEPENDENT ACOUSTIC-TO-ARTICULATORY INVERSION
    Ghosh, Prasanta Kumar
    Narayanan, Shrikanth S.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4624 - 4627