Multi-corpus Acoustic-to-articulatory Speech Inversion

被引：8

作者：

Seneviratne, Nadee ^{[1
]}

Sivaraman, Ganesh ^{[2
]}

Espy-Wilson, Carol ^{[1
]}

机构：

[1] Univ Maryland, College Pk, MD 20742 USA

[2] Pindrop, Atlanta, GA USA

来源：

INTERSPEECH 2019 | 2019年

关键词：

Acoustic-to-articulatory speech inversion; multi-task learning; articulatory phonology; tract variables;

D O I：

10.21437/Interspeech.2019-3168

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

There are several technologies like Electromagnetic articulometry (EMA), ultrasound, real-time Magnetic Resonance Imaging (MRI), and X-ray microbeam that are used to measure speech articulatory movements. Each of these techniques provides a different view of the vocal tract. The measurements performed using the similar techniques also differ greatly due to differences in the placement of sensors, and the anatomy of speakers. This limits most articulatory studies to single datasets. However to yield better results in its applications, the speech inversion systems should be more generalized, which requires the combination of data from multiple sources. This paper proposes a multi-task learning based deep neural network architecture for acoustic-to-articulatory speech inversion trained using three different articulatory datasets - two of them were measured using EMA, and one using X-ray microbeam. Experiments show improved accuracy of the proposed acoustic-to-articulatory mapping compared to the systems trained using single datasets.

引用

页码：859 / 863

页数：5

共 50 条

[1] ACOUSTIC-TO-ARTICULATORY INVERSION FOR DYSARTHRIC SPEECH BY USING CROSS-CORPUS ACOUSTIC-ARTICULATORY DATA
Maharana, Sarthak Kumar
Illa, Aravind
Mannem, Renuka
Belur, Yamini
Shetty, Preetie
Kumar, Veeramani Preethish
Vengalil, Seena
Polavarapu, Kiran
Atchayaram, Nalini
Ghosh, Prasanta Kumar
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6458 - 6462
[2] Acoustic-to-articulatory Speech Inversion with Multi-task Learning
Siriwardena, Yashish M.
Sivaraman, Ganesh
Espy-Wilson, Carol
[J]. INTERSPEECH 2022, 2022, : 5020 - 5024
[3] A COMPARATIVE STUDY OF ACOUSTIC-TO-ARTICULATORY INVERSION FOR NEUTRAL AND WHISPERED SPEECH
Illa, Aravind
Meenakshi, Nisha G.
Ghosh, Prasanta Kumar
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5075 - 5079
[4] ACOUSTIC-TO-ARTICULATORY INVERSION BASED ON SPEECH DECOMPOSITION AND AUXILIARY FEATURE
Wang, Jianrong
Liu, Jinyu
Zhao, Longxuan
Wang, Shanyu
Yu, Ruiguo
Liu, Li
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4808 - 4812
[5] Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models
Shahrebabaki, Abdolreza Sabzi
Salvi, Giampiero
Svendsen, Torbjorn
Siniscalchi, Sabato Marco
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 135 - 147
[6] The impact of cross language on acoustic-to-articulatory inversion and its influence on articulatory speech synthesis
Illa, Aravind
Nair, Aanish
Ghosh, Prasanta Kumar
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8267 - 8271
[7] Formant Trajectories for Acoustic-to-Articulatory Inversion
Ozbek, I. Yuecel
Hasegawa-Johnson, Mark
Demirekler, Muebeccel
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2783 - +
[8] Analysis of acoustic-to-articulatory speech inversion across different accents and languages
Sivaraman, Ganesh
Espy-Wilson, Carol
Wieling, Martijn
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 974 - 978
[9] Incorporation of phonetic constraints in acoustic-to-articulatory inversion
Potard, Blaise
Laprie, Yves
Ouni, Slim
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04): : 2310 - 2323
[10] A SUBJECT-INDEPENDENT ACOUSTIC-TO-ARTICULATORY INVERSION
Ghosh, Prasanta Kumar
Narayanan, Shrikanth S.
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4624 - 4627

← 1 2 3 4 5 →