A STATISTICAL APPROACH TO AUTOMATIC SPEECH RECOGNITION USING THE ATOMIC SPEECH UNITS CONSTRUCTED FROM OVERLAPPING ARTICULATORY FEATURES

被引:70
|
作者
DENG, L
SUN, DX
机构
[1] UNIV WATERLOO, DEPT ELECT & COMP ENGN, WATERLOO N2L 3G1, ON, CANADA
[2] SUNY STONY BROOK, DEPT APPL MATH & STAT, STONY BROOK, NY 11794 USA
来源
关键词
D O I
10.1121/1.409839
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In recent years, the development of a feature-based general statistical framework has been pursued for automatic speech recognition via novel designs of minimal or atomic units of speech, aiming at a parsimonious scheme to share the interword and interphone speech data and at a unified way to account for the context-dependent behaviors in Speech. The basic design philosophy has been motivated by the theory of distinctive features and by a new form of phonology which argues for use of multidimensional articulatory structures. In this paper, the most recently developed feature-based recognizer is presented, which is capable of operating on all classes of English sounds. Detailed descriptions of the design considerations for the recognizer and of key aspects of the design process are provided. This process, which is called lexicon ''compilation,'' consists of three elements (1) establishing a feature-specification system; (2) constructing a probabilistic and fractional temporal overlapping pattern across the features; and (3) mapping from the feature-overlap pattern to a state-transition graph. A standard phonetic classification task from the TIMIT database is used as a test bed to evaluate the performance of the recognizer; The experimental results provide preliminary evidence for the effectiveness of the feature-based approach to speech recognition.
引用
收藏
页码:2702 / 2719
页数:18
相关论文
共 50 条
  • [1] A STUDY ON ROBUSTNESS OF ARTICULATORY FEATURES FOR AUTOMATIC SPEECH RECOGNITION OF NEUTRAL AND WHISPERED SPEECH
    Srinivasan, Gokul
    Illa, Aravind
    Ghosh, Prasanta Kumar
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5936 - 5940
  • [2] Speech recognition using cepstral articulatory features
    Najnin, Shamima
    Banerjee, Bonny
    [J]. SPEECH COMMUNICATION, 2019, 107 : 26 - 37
  • [3] Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion
    Ghosh, Prasanta Kumar
    Narayanan, Shrikanth
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (04): : EL251 - EL257
  • [4] A STATISTICAL APPROACH TO THE AUTOMATIC RECOGNITION OF SPEECH
    SMITH, JEK
    KLEM, L
    [J]. AMERICAN PSYCHOLOGIST, 1961, 16 (07) : 445 - 445
  • [5] Whispery speech recognition using adapted articulatory features
    Jou, SC
    Schultz, T
    Waibel, A
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1009 - 1012
  • [6] Articulatory Features for "Meeting" Speech Recognition
    Metze, Florian
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 581 - 584
  • [7] Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features
    Zhan, Qingran
    Motlicek, Petr
    Du, Shixuan
    Shan, Yahui
    Ma, Sifan
    Xie, Xiang
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1912 - 1916
  • [8] An HMM-based speech recognizer using overlapping articulatory features
    Erler, K
    Freeman, GH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (04): : 2500 - 2513
  • [9] Automatic Speech Recognition Experiments with Articulatory Data
    Uraga, Esmeralda
    Hain, Thomas
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 353 - 356
  • [10] Applying articulatory features to speech emotion recognition
    Zhou, Yu
    Sun, Yanqing
    Yang, Lin
    Yan, Yonghong
    [J]. 2009 INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN COMPUTER SCIENCE, ICRCCS 2009, 2009, : 73 - 76