Acoustic Modeling in Speech Recognition: A Systematic Review

被引:0
|
作者
Bhatt, Shobha [1 ]
Jain, Anurag [1 ]
Dev, Amita [2 ]
机构
[1] Guru Gobind Singh Indraprastha Univ GGSIPU, Univ Sch Informat & Commun Technol, New Delhi, India
[2] Indira Gandhi Delhi Tech Univ Women, Dept Name Org, New Delhi, India
关键词
Acoustic modeling; speech recognition; systematic review; acoustic unit; MFCC; classification;
D O I
10.14569/IJACSA.2020.0110455
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The paper presents a systematic review of acoustic modeling (AM) techniques in speech recognition(SR). Acoustic modeling establishes a relationship between acoustic information and language construct in SR. Over the past decades, researchers presented studies addressing specific concerns in AM. However, all previous research works lack a systematic and comprehensive review of acoustic modeling issues. A systematic review is introduced to understand the acoustic modeling issues in speech recognition. This paper provides an extensive and comprehensive inspection of various researches that have been performed since 1984. The extensive investigation and analysis into AM was performed by getting the relevant data from 73 research works chose after the screening process between the years from 1984 to 2020. The systematic review process was divided into different parts to investigate acoustic modeling issues. Main issues in acoustic modeling such as feature extraction techniques, acoustic modeling units, speech corpora, classification methods, different tools used, language issues applied, and evaluation parameters were investigated. This study helps the reader to understand various acoustic modeling issues with comprehensive details. The research outcomes presented in this study depict research trends and shed light on new research topics in AM. The result of this review can be used to build a better speech recognition system by choosing a suitable acoustic modeling construct in SR.
引用
收藏
页码:397 / 412
页数:16
相关论文
共 50 条
  • [21] Acoustic Modeling in Mandarin Speech Recognition of Minority Accent in Yunnan
    Wu Peishan
    Yang Jian
    [J]. PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 4, 2008, : 526 - 530
  • [22] Prosody-dependent Acoustic Modeling for Mandarin Speech Recognition
    Chiu, Tzu-Hsuan
    Chiang, Chen-Yu
    Liao, Yuan-Fu
    Yang, Jyh-Her
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 139 - 142
  • [23] GMM-BASED ACOUSTIC MODELING FOR EMBEDDED SPEECH RECOGNITION
    Levy, Christophe
    Linares, Georges
    Bonastre, Jean-Francois
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1726 - 1729
  • [24] TRANSFORMER-BASED ACOUSTIC MODELING FOR HYBRID SPEECH RECOGNITION
    Wang, Yongqiang
    Mohamed, Abdelrahman
    Le, Duc
    Liu, Chunxi
    Xiao, Alex
    Mahadeokar, Jay
    Huang, Hongzhao
    Tjandra, Andros
    Zhang, Xiaohui
    Zhang, Frank
    Fuegen, Christian
    Zweig, Geoffrey
    Seltzer, Michael L.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6874 - 6878
  • [25] A study on acoustic modeling for speech recognition of predominantly monosyllabic languages
    Maneenoi, Ekkarit
    Ahkuputra, Visarut
    Luksaneeyanawin, Sudaporn
    Jitapunkul, Somchai
    [J]. IEICE Transactions on Information and Systems, 2004, E87-D (05) : 1146 - 1163
  • [26] Modeling the Temporal Evolution of Acoustic Parameters for Speech Emotion Recognition
    Ntalampiras, Stavros
    Fakotakis, Nikos
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2012, 3 (01) : 116 - 125
  • [27] Automatic Speech Emotion Recognition: a Systematic Literature Review
    Mustafa H.H.
    Darwish N.R.
    Hefny H.A.
    [J]. International Journal of Speech Technology, 2024, 27 (1) : 267 - 285
  • [28] A systematic review of speech recognition technology in health care
    Maree Johnson
    Samuel Lapkin
    Vanessa Long
    Paula Sanchez
    Hanna Suominen
    Jim Basilakis
    Linda Dawson
    [J]. BMC Medical Informatics and Decision Making, 14
  • [29] A systematic literature review of speech emotion recognition approaches
    Singh, Youddha Beer
    Goel, Shivani
    [J]. NEUROCOMPUTING, 2022, 492 : 245 - 263
  • [30] Urdu Speech Emotion Recognition: A Systematic Literature Review
    Taj, Soonh
    Mujtaba, Ghulam
    Daudpota, Sher Muhammad
    Mughal, Muhammad Hussain
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (07)