Acoustic-Phonetic Analysis for Speech Recognition: A Review

被引:7
|
作者
Sarma, Biswajit Dev [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, EMST Lab, Elect & Elect Engn, Gauhati, India
关键词
Acoustic-phonetic knowledge; Speech recognition approaches; LANDMARK DETECTION; SPEAKER VERIFICATION; FORMANT TRANSITIONS; FEATURES; CLASSIFICATION; VOWEL; SPECTRUM; MODEL; ALGORITHM; TUTORIAL;
D O I
10.1080/02564602.2017.1293570
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper reviews the literature related to the acoustic-phonetic analysis of speech and the speech recognition approaches that use these types of knowledge. At first, acoustic-phonetic cues that are important for recognition of different sound units are presented. This include description of the acoustic-phonetic events, literature related to analysis and automatic detection of the events, and significance of the events in automatic speech recognition. Next, different speech recognition approaches are discussed and the literature related to the use of acoustic-phonetic knowledge by these approaches are reviewed. Finally, different approaches are compared and a framework suitable for recognition of phones present in syllable-like units is proposed.
引用
收藏
页码:305 / 327
页数:23
相关论文
共 50 条
  • [41] Acoustic-phonetic properties of Siri- and human-directed speech
    Cohn, Michelle
    Segedin, Bruno Ferenc
    Zellou, Georgia
    [J]. JOURNAL OF PHONETICS, 2022, 90
  • [42] Individual Differences in the Use of Acoustic-Phonetic Versus Lexical Cues for Speech Perception
    Giovannone, Nikole
    Theodore, Rachel M.
    [J]. FRONTIERS IN COMMUNICATION, 2021, 6
  • [43] Classification of Fricatives Using Feature Extrapolation of Acoustic-Phonetic Features in Telephone Speech
    Lee, Jung-Won
    Choi, Jeung-Yoon
    Kang, Hong-Goo
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1268 - 1271
  • [44] Acoustic-phonetic processing in semantic dementia
    Kwok, Shaleigh
    Reilly, Jamie
    Grossman, Murray
    Work, Melissa
    [J]. BRAIN AND LANGUAGE, 2006, 99 (1-2) : 145 - 146
  • [45] STATISTICAL SEGMENTATION OF A SPEECH SIGNAL IN KNOWLEDGE-BASED ACOUSTIC-PHONETIC DECODING
    ANDREOBRECHT, R
    PARLANGEAU, N
    [J]. JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 477 - 480
  • [46] Acoustic-Phonetic Modeling in the SPICOS System
    Ney, Hermann
    Noll, Andreas
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 312 - 319
  • [47] Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds
    Kohler, J
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2195 - 2198
  • [49] An acoustic-phonetic feature-based system for the automatic recognition of fricative consonants
    Ali, AMA
    Van der Speigel, J
    Mueller, P
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 961 - 964
  • [50] ACOUSTIC-PHONETIC PRIMING IN SPOKEN WORD RECOGNITION - A TEST OF THE NEIGHBORHOOD ACTIVATION MODEL
    PISONI, DB
    GOLDINGER, SD
    LUCE, PA
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1988, 26 (06) : 505 - 506