Robust Word Recognition using articulatory trajectories and Gestures

被引:0
|
作者
Mitra, Vikramjit [1 ]
Nam, Hosung [2 ]
Espy-Wilson, Carol [1 ]
Saltzman, Elliot [2 ,3 ]
Goldstein, Louis [2 ,4 ]
机构
[1] Univ Maryland, Dept Elect & Comp Eng, Syst Res Inst, College Pk, MD 20742 USA
[2] Haskins Labs Inc, New Haven, CT USA
[3] Boston Univ, Dept Phys Therapy & Athlet Training, Boston, MA USA
[4] Univ Southern Calif, Dept Linguist, Los Angeles, CA USA
关键词
Noise Robust Speech Recognition; Articulatory Phonology; Speech gestures; Tract Variables; TADA Model Neural Networks; Speech Inversion;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Articulatory Phonology views speech as an ensemble of constricting events (e.g. narrowing lips, raising tongue tip), gestures, at distinct organs (lips, tongue tip, tongue body, velum, and glottis) along the vocal tract. This study shows that articulatory information in the form of gestures and their output trajectories (tract variable time functions or TVs) can help to improve the performance of automatic speech recognition systems. The lack of any natural speech database containing such articulatory information prompted us to use a synthetic speech dataset (obtained from Haskins Laboratories TAsk Dynamic model of speech production) that contains acoustic waveform for a given utterance and its corresponding gestures and TVs. First, we propose neural network based models to recognize the gestures and estimate the TVs from acoustic information. Second, the "synthetic-data trained" articulatory models were applied to the natural speech utterances in Aurora-2 corpus to estimate their gestures and TVs. Finally, we show that the estimated articulatory information helps to improve the noise robustness of a word recognition system when used along with the cepstral features.
引用
收藏
页码:2038 / +
页数:2
相关论文
共 50 条
  • [41] Robust Recognition of Tactile Gestures for Intuitive Robot Programming and Control
    Kubus, Daniel
    Muxfeldt, Arne
    Kissener, Konrad
    Haus, Jan
    Steil, Jochen
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 1643 - 1650
  • [42] Robust 3D Hand Detection for Gestures Recognition
    Cerlinca, Tudor Loan
    Pentiuc, Stefan Gheorghe
    INTELLIGENT DISTRIBUTED COMPUTING V, 2011, 382 : 259 - 264
  • [43] SPEECH PRODUCTION - OBSERVATION AND MODELING OF ARTICULATORY GESTURES
    FUJIMURA, O
    BIOTELEMETRY, 1975, 2 (1-2) : 118 - 119
  • [44] Combating Adversarial Misspellings with Robust Word Recognition
    Pruthi, Danish
    Dhingra, Bhuwan
    Lipton, Zachary C.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5582 - 5591
  • [45] ROBUST WORD RECOGNITION IN ADVERSE CAR ENVIRONMENTS
    NAKAMURA, S
    AKABANE, T
    HAMAGUCHI, S
    KITOH, A
    SHARP TECHNICAL JOURNAL, 1993, (57): : 9 - 14
  • [46] Robust word recognition in adverse car environments
    Nakamura, Satoshi
    Akabane, Toshio
    Hamaguchi, Seiji
    Kitoh, Atsunori
    Shapu Giho/Sharp Technical Journal, 1993, (57): : 9 - 14
  • [47] Noise robust isolated word recognition using speech feature enhancement techniques
    Ecole Nationale d'Ingénieurs de Sfax ENIS, Department of Génie Electrique, BP W, 3038 Sfax, Tunisia
    不详
    J. Appl. Sci., 2007, 24 (3935-3942):
  • [48] AWARENESS OF CERTAIN ARTICULATORY GESTURES IN NORMAL-SPEAKING AND ARTICULATORY-DEFECTIVE CHILDREN
    RUSCELLO, DM
    MOREAU, VK
    SHOLTIS, D
    JOURNAL OF COMMUNICATION DISORDERS, 1980, 13 (01) : 59 - 64
  • [50] Improvement of Phone Recognition Accuracy Using Articulatory Features
    K. E. Manjunath
    K. Sreenivasa Rao
    Circuits, Systems, and Signal Processing, 2018, 37 : 704 - 728