Acoustic-Phonetic Analysis for Speech Recognition: A Review

被引:7
|
作者
Sarma, Biswajit Dev [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, EMST Lab, Elect & Elect Engn, Gauhati, India
关键词
Acoustic-phonetic knowledge; Speech recognition approaches; LANDMARK DETECTION; SPEAKER VERIFICATION; FORMANT TRANSITIONS; FEATURES; CLASSIFICATION; VOWEL; SPECTRUM; MODEL; ALGORITHM; TUTORIAL;
D O I
10.1080/02564602.2017.1293570
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper reviews the literature related to the acoustic-phonetic analysis of speech and the speech recognition approaches that use these types of knowledge. At first, acoustic-phonetic cues that are important for recognition of different sound units are presented. This include description of the acoustic-phonetic events, literature related to analysis and automatic detection of the events, and significance of the events in automatic speech recognition. Next, different speech recognition approaches are discussed and the literature related to the use of acoustic-phonetic knowledge by these approaches are reviewed. Finally, different approaches are compared and a framework suitable for recognition of phones present in syllable-like units is proposed.
引用
收藏
页码:305 / 327
页数:23
相关论文
共 50 条
  • [31] ACOUSTIC-PHONETIC FEATURE BASED DIALECT IDENTIFICATION IN HINDI SPEECH
    Sinha, Shweta
    Jain, Aruna
    Agrawal, S. S.
    [J]. INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2015, 8 (01) : 235 - 254
  • [32] Acoustic-phonetic characteristics of hyperarticulated speech for different speaking styles
    Köster, S
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 873 - 876
  • [33] An Acoustic-Phonetic Approach to Effects of Face Masks on Speech Intelligibility
    Kim, Yunjung
    Thompson, Austin
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2022, 65 (12): : 4679 - 4689
  • [34] Automatic assessments of dysarthric speech: the usability of acoustic-phonetic features
    van Bemmel, Loes
    Pesenti, Chiara
    Wei, Xue
    Strik, Helmer
    [J]. INTERSPEECH 2023, 2023, : 141 - 145
  • [35] A MODULE FOR ACOUSTIC-PHONETIC TRANSCRIPTION OF FLUENTLY SPOKEN GERMAN SPEECH
    REGEL, P
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1982, 30 (03): : 440 - 450
  • [36] STAR-PAK - A SIGNAL-PROCESSING PACKAGE FOR ACOUSTIC-PHONETIC ANALYSIS OF SPEECH
    DUNCAN, G
    DALBY, J
    JACK, MA
    [J]. PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 77 - 83
  • [37] APPLICATION OF ADAPTIVE THRESHOLD ELEMENTS TO RECOGNITION OF ACOUSTIC-PHONETIC STATES
    DAMMANN, JE
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1965, 38 (02): : 213 - &
  • [38] ACOUSTIC-PHONETIC FEATURES OF STRESSED SYLLABLES IN SPEECH OF 3 YEAR OLDS
    HAWKINS, S
    ALLEN, G
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 : S56 - S56
  • [39] Physiological and Cognitive Status Monitoring on the Base of Acoustic-Phonetic Speech Parameters
    Kiss, Gabor
    Vicsi, Klara
    [J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2014, 2014, 8791 : 120 - 131
  • [40] Acoustic-phonetic properties of Siri- and human-directed speech
    Cohn, Michelle
    Segedin, Bruno Ferenc
    Zellou, Georgia
    [J]. JOURNAL OF PHONETICS, 2022, 90