Acoustic-Phonetic Analysis for Speech Recognition: A Review

被引：7

作者：

Sarma, Biswajit Dev ^{[1
]}

Prasanna, S. R. Mahadeva ^{[1
]}

机构：

[1] Indian Inst Technol Guwahati, EMST Lab, Elect & Elect Engn, Gauhati, India

来源：

IETE TECHNICAL REVIEW | 2018年 / 35卷 / 03期

关键词：

Acoustic-phonetic knowledge; Speech recognition approaches; LANDMARK DETECTION; SPEAKER VERIFICATION; FORMANT TRANSITIONS; FEATURES; CLASSIFICATION; VOWEL; SPECTRUM; MODEL; ALGORITHM; TUTORIAL;

D O I：

10.1080/02564602.2017.1293570

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper reviews the literature related to the acoustic-phonetic analysis of speech and the speech recognition approaches that use these types of knowledge. At first, acoustic-phonetic cues that are important for recognition of different sound units are presented. This include description of the acoustic-phonetic events, literature related to analysis and automatic detection of the events, and significance of the events in automatic speech recognition. Next, different speech recognition approaches are discussed and the literature related to the use of acoustic-phonetic knowledge by these approaches are reviewed. Finally, different approaches are compared and a framework suitable for recognition of phones present in syllable-like units is proposed.

引用

页码：305 / 327

页数：23

共 50 条

[41] Acoustic-phonetic properties of Siri- and human-directed speech
Cohn, Michelle
Segedin, Bruno Ferenc
Zellou, Georgia
[J]. JOURNAL OF PHONETICS, 2022, 90
[42] Individual Differences in the Use of Acoustic-Phonetic Versus Lexical Cues for Speech Perception
Giovannone, Nikole
Theodore, Rachel M.
[J]. FRONTIERS IN COMMUNICATION, 2021, 6
[43] Classification of Fricatives Using Feature Extrapolation of Acoustic-Phonetic Features in Telephone Speech
Lee, Jung-Won
Choi, Jeung-Yoon
Kang, Hong-Goo
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1268 - 1271
[44] Acoustic-phonetic processing in semantic dementia
Kwok, Shaleigh
Reilly, Jamie
Grossman, Murray
Work, Melissa
[J]. BRAIN AND LANGUAGE, 2006, 99 (1-2) : 145 - 146
[45] STATISTICAL SEGMENTATION OF A SPEECH SIGNAL IN KNOWLEDGE-BASED ACOUSTIC-PHONETIC DECODING
ANDREOBRECHT, R
PARLANGEAU, N
[J]. JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 477 - 480
[46] Acoustic-Phonetic Modeling in the SPICOS System
Ney, Hermann
Noll, Andreas
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 312 - 319
[47] Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds
Kohler, J
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2195 - 2198
[48] ACOUSTIC-PHONETIC DEMONOLOGY - AN EXEMPLARY FAILURE - A REVIEW OF COOPER,WILLIAM SPEECH-PERCEPTION AND PRODUCTION - COOPER,WE
REMEZ, RE
[J]. PHONETICA, 1983, 40 (04) : 330 - 332
[49] An acoustic-phonetic feature-based system for the automatic recognition of fricative consonants
Ali, AMA
Van der Speigel, J
Mueller, P
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 961 - 964
[50] ACOUSTIC-PHONETIC PRIMING IN SPOKEN WORD RECOGNITION - A TEST OF THE NEIGHBORHOOD ACTIVATION MODEL
PISONI, DB
GOLDINGER, SD
LUCE, PA
[J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1988, 26 (06) : 505 - 506

← 1 2 3 4 5 →