A Comprehensive Examination of Phoneme Recognition in Automatic Speech Recognition Systems

被引:0
|
作者
Bhatt, Shobha [1 ]
Bansal, Shweta [2 ]
Kumar, Ankit [3 ]
Pandey, Saroj Kumar [3 ]
Ojha, Manoj Kumar [2 ]
Singh, Kamred Udham [4 ]
Chakraborty, Sanjay [5 ]
Singh, Teekam [6 ]
Swarup, Chetan [7 ]
机构
[1] Netaji Subhash Univ Technol, Dept Comp Sci & Engn, Delhi 110078, India
[2] KR Mangalam Univ, Dept Comp Sci Engn, Gurugram 122103, India
[3] GLA Univ, Dept Comp Engn & Applicat, Mathura 281406, India
[4] Graph Era Hill Univ, Sch Comp, Dehra Dun 248002, India
[5] Techno Int New Town, Dept Comp Sci & Engn, Kolkata 700156, India
[6] Graph Era Deemed Univ, Dept Comp Sci & Engn, Dehra Dun 248002, India
[7] Saudi Elect Univ, Coll Sci & Theoret Studies, Dept Basic Sci, Riyadh Male Campus, Riyadh 13316, Saudi Arabia
关键词
review phoneme speech consonants; vowels recognition; FEATURES;
D O I
10.18280/ts.400518
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This review offers an exhaustive examination of phoneme recognition, an essential subword acoustic unit in speech processing. Phoneme-based systems find widespread utility in diverse applications including speech recognition, speaker identification, and language recognition. The efficacy of these systems hinges upon the precise recognition of phonemes, thereby underscoring the criticality of enhancing our understanding of phoneme recognition to optimize system performance. Previous reviews have primarily focused on specific issues within the realm of phoneme recognition, with comprehensive studies on the subject being notably sparse in existing literature. Consequently, there is an urgent need for an extensive investigation into phoneme recognition to bolster recognition accuracy. This comprehensive review seeks to bridge this knowledge gap by examining pivotal aspects such as vowel recognition, consonant recognition, acoustic-phonetic cues, contextual effects, feature extraction methods, classification techniques, phoneme recognition enhancement strategies, and performance metrics. The review elucidates various technologies and trends in phoneme recognition, thereby providing valuable insights that can mitigate errors in phoneme-based systems through the application of appropriate techniques delineated in the study. The findings of this study hold substantial potential benefits for a wide spectrum of speech research communities, encompassing students, educators, specialists, developers, and scholars. The review encompasses both fundamental and advanced concepts pertinent to phoneme recognition, thereby offering a comprehensive resource for individuals engaged in this field.
引用
收藏
页码:1997 / 2008
页数:12
相关论文
共 50 条
  • [1] PHONEME SELECTION FOR STUDIES IN AUTOMATIC SPEECH RECOGNITION
    SHOUP, JE
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (04): : 397 - &
  • [2] Phoneme Confusions in Human and Automatic Speech Recognition
    Meyer, Bernd T.
    Waechter, Matthias
    Brand, Thomas
    Kollmeier, Birger
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2740 - 2743
  • [3] MLP BASED PHONEME DETECTORS FOR AUTOMATIC SPEECH RECOGNITION
    Thomas, Samuel
    Patrick Nguyen
    Zweig, Geoffrey
    Hermansky, Hynek
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5024 - 5027
  • [4] Automatic Phoneme Border Detection to Improve Speech Recognition
    Sergio, Suarez-Guerra
    Cristian-Remington, Juarez-Murillo
    Jose Luis, Oropeza-Rodriguez
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 127 - 135
  • [5] Contribution from the accuracy of phoneme recognition to the quality of automatic recognition of Russian speech
    Karpukhin I.A.
    [J]. Moscow University Computational Mathematics and Cybernetics, 2016, 40 (2) : 89 - 95
  • [6] Phoneme fuzzy characterization in speech recognition systems
    Beritelli, F
    Borrometi, L
    Cuce, A
    [J]. APPLICATIONS OF SOFT COMPUTING, 1997, 3165 : 305 - 306
  • [7] Automatic speech recognition systems
    Catariov, A
    [J]. Information Technologies 2004, 2004, 5822 : 83 - 93
  • [8] Automatic Fongbe Phoneme Recognition From Spoken Speech Signal
    Laleye, Frejus A. A.
    Ezin, Eugene C.
    Motamed, Cina
    [J]. ICINCO: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 1, 2016, : 102 - 109
  • [9] Building Automatic Speech Recognition Systems for Moroccan Dialect: A Phoneme-Based Approach
    Abderrahim Ezzine
    Naouar Laaidi
    Ouissam Zealouk
    Hassan Satori
    [J]. SN Computer Science, 5 (6)
  • [10] PHONEME GROUPING FOR SPEECH RECOGNITION
    REDDY, DR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1967, 41 (05): : 1295 - &