Speech recognition using cepstral articulatory features

被引:4
|
作者
Najnin, Shamima [1 ]
Banerjee, Bonny [2 ,3 ]
机构
[1] Intel Corp, Hillsboro, OR 97124 USA
[2] Univ Memphis, Inst Intelligent Syst, Memphis, TN 38152 USA
[3] Univ Memphis, Dept Elect & Comp Engn, Memphis, TN 38152 USA
基金
美国国家科学基金会;
关键词
Phoneme recognition; Acoustic feature; Cepstral articulatory feature; Inversion mapping; General regression neural network; Deep neural network; RETINAL PROJECTIONS; DEEP;
D O I
10.1016/j.specom.2019.01.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Though speech recognition has been widely investigated in the past decades, the role of articulation in recognition has received scant attention. Recognition accuracy increases when recognizers are trained with acoustic features in conjunction with articulatory ones. Traditionally, acoustic features are represented by mel-frequency cepstral coefficients (MFCCs) while articulatory features are represented by the locations or trajectories of the articulators. We propose the articulatory cepstral coefficients (ACCs) as features which are the cepstral coefficients of the time-location articulatory signal. We show that ACCs yield state-of-the-art results in phoneme classification and recognition on benchmark datasets over a wide range of experiments. The similarity of MFCCs and ACCs and their superior performance in isolation and conjunction indicate that common algorithms can be effectively used for acoustic and articulatory signals.
引用
收藏
页码:26 / 37
页数:12
相关论文
共 50 条
  • [41] Articulatory and excitation source features for speech recognition in read, extempore and conversation modes
    Manjunath, K. E.
    Rao, K. Sreenivasa
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (01) : 121 - 134
  • [42] Emotion Recognition using Ensemble of Cepstral, Perceptual and Temporal Features
    Vasuki, P.
    Arvind
    Vaideesh
    Shamsudeen, Mohamed
    Abubacker
    [J]. 2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 2, 2016, : 43 - 48
  • [43] Articulatory Knowledge in the Recognition of Dysarthric Speech
    Rudzicz, Frank
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 947 - 960
  • [44] SPEECH RECOGNITION USING REGULARIZED MINIMUM VARIANCE DISTORTIONLESS RESPONSE SPECTRUM ESTIMATION-BASED CEPSTRAL FEATURES
    Alam, Md Jahangir
    Kenny, Patrick
    O'Shaughnessy, Douglas
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8071 - 8075
  • [45] ARTICULATORY FEATURES FOR EXPRESSIVE SPEECH SYNTHESIS
    Black, Alan W.
    Bunnell, H. Timothy
    Dou, Ying
    Muthukumar, Prasanna Kumar
    Metze, Florian
    Perry, Daniel
    Polzehl, Tim
    Prahallad, Kishore
    Steidl, Stefan
    Vaughn, Callie
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4005 - 4008
  • [46] Articulatory Features for ASR of Pathological Speech
    Yilmaz, Emre
    Mitra, Vikramjit
    Bartels, Chris
    Franco, Horacio
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2958 - 2962
  • [47] Musical instrument recognition using cepstral coefficients and temporal features
    Eronen, A
    Klapuri, A
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 753 - 756
  • [48] Time-Varying LP Cepstral Features for Improved Isolated Word Speech Recognition
    Ang, Federico
    Tsutsui, Hiroshi
    Miyanaga, Yoshikazu
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 302 - 306
  • [49] On the Contribution of Articulatory Features to Speech Synthesis
    Matura, Martin
    Juzova, Marketa
    Matousek, Jindrich
    [J]. SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 398 - 407
  • [50] Discriminating Parkinson and Healthy People Using Phonation and Cepstral Features of Speech
    Upadhya, Savitha S.
    Cheeran, A. N.
    [J]. 8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 197 - 202