Robust Word Recognition using articulatory trajectories and Gestures

被引:0
|
作者
Mitra, Vikramjit [1 ]
Nam, Hosung [2 ]
Espy-Wilson, Carol [1 ]
Saltzman, Elliot [2 ,3 ]
Goldstein, Louis [2 ,4 ]
机构
[1] Univ Maryland, Dept Elect & Comp Eng, Syst Res Inst, College Pk, MD 20742 USA
[2] Haskins Labs Inc, New Haven, CT USA
[3] Boston Univ, Dept Phys Therapy & Athlet Training, Boston, MA USA
[4] Univ Southern Calif, Dept Linguist, Los Angeles, CA USA
关键词
Noise Robust Speech Recognition; Articulatory Phonology; Speech gestures; Tract Variables; TADA Model Neural Networks; Speech Inversion;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Articulatory Phonology views speech as an ensemble of constricting events (e.g. narrowing lips, raising tongue tip), gestures, at distinct organs (lips, tongue tip, tongue body, velum, and glottis) along the vocal tract. This study shows that articulatory information in the form of gestures and their output trajectories (tract variable time functions or TVs) can help to improve the performance of automatic speech recognition systems. The lack of any natural speech database containing such articulatory information prompted us to use a synthetic speech dataset (obtained from Haskins Laboratories TAsk Dynamic model of speech production) that contains acoustic waveform for a given utterance and its corresponding gestures and TVs. First, we propose neural network based models to recognize the gestures and estimate the TVs from acoustic information. Second, the "synthetic-data trained" articulatory models were applied to the natural speech utterances in Aurora-2 corpus to estimate their gestures and TVs. Finally, we show that the estimated articulatory information helps to improve the noise robustness of a word recognition system when used along with the cepstral features.
引用
收藏
页码:2038 / +
页数:2
相关论文
共 50 条
  • [1] Recognizing articulatory gestures from speech for robust speech recognition
    Mitra, Vikramjit
    Nam, Hosung
    Espy-Wilson, Carol
    Saltzman, Elliot
    Goldstein, Louis
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (03): : 2270 - 2287
  • [2] Seeing the initial articulatory gestures of a word triggers lexical access
    Fort, Mathilde
    Kandel, Sonia
    Chipot, Justine
    Savariaux, Christophe
    Granjon, Lionel
    Spinelli, Elsa
    [J]. LANGUAGE AND COGNITIVE PROCESSES, 2013, 28 (08): : 1207 - 1223
  • [3] Decoding Of Articulatory Gestures During Word Production Using Speech Motor And Premotor Cortical Activity
    Mugler, Emily M.
    Goldrick, Matthew
    Rosenow, Joshua M.
    Tate, Matthew C.
    Slutzky, Marc W.
    [J]. 2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 5339 - 5342
  • [4] The Effects of Articulatory Suppression on Word Recognition in Serbian
    Lazar Tenjović
    Dejan Lalović
    [J]. Journal of Psycholinguistic Research, 2005, 34 : 541 - 553
  • [5] The effects of articulatory suppression on word recognition in serbian
    Tenjovic, L
    Lalovic, D
    [J]. JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 2005, 34 (06) : 541 - 553
  • [6] REPRESENTATION OF VOICING CONTRASTS USING ARTICULATORY GESTURES
    GOLDSTEIN, L
    BROWMAN, CP
    [J]. JOURNAL OF PHONETICS, 1986, 14 (02) : 339 - 342
  • [7] Sequencing of Articulatory Gestures using Cost Optimization
    Simko, Juraj
    Cummins, Fred
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 68 - 71
  • [8] ARTICULATORY TRAJECTORIES FOR LARGE-VOCABULARY SPEECH RECOGNITION
    Mitra, Vikramjit
    Wang, Wen
    Stolcke, Andreas
    Nam, Hosung
    Richey, Colleen
    Yuan, Jiahong
    Liberman, Mark
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7145 - 7149
  • [9] Articulatory Information for Noise Robust Speech Recognition
    Mitra, Vikramjit
    Nam, Hosung
    Espy-Wilson, Carol Y.
    Saltzman, Elliot
    Goldstein, Louis
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1913 - 1924
  • [10] Robust Interactive Method for Hand Gestures Recognition Using Machine Learning
    Alteaimi, Amal Abdullah Mohammed
    Ben Othman, Mohamed Tahar
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 577 - 595