Acoustic Modeling in Speech Recognition: A Systematic Review

被引：0

作者：

Bhatt, Shobha ^{[1
]}

Jain, Anurag ^{[1
]}

Dev, Amita ^{[2
]}

机构：

[1] Guru Gobind Singh Indraprastha Univ GGSIPU, Univ Sch Informat & Commun Technol, New Delhi, India

[2] Indira Gandhi Delhi Tech Univ Women, Dept Name Org, New Delhi, India

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2020年 / 11卷 / 04期

关键词：

Acoustic modeling; speech recognition; systematic review; acoustic unit; MFCC; classification;

D O I：

10.14569/IJACSA.2020.0110455

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The paper presents a systematic review of acoustic modeling (AM) techniques in speech recognition(SR). Acoustic modeling establishes a relationship between acoustic information and language construct in SR. Over the past decades, researchers presented studies addressing specific concerns in AM. However, all previous research works lack a systematic and comprehensive review of acoustic modeling issues. A systematic review is introduced to understand the acoustic modeling issues in speech recognition. This paper provides an extensive and comprehensive inspection of various researches that have been performed since 1984. The extensive investigation and analysis into AM was performed by getting the relevant data from 73 research works chose after the screening process between the years from 1984 to 2020. The systematic review process was divided into different parts to investigate acoustic modeling issues. Main issues in acoustic modeling such as feature extraction techniques, acoustic modeling units, speech corpora, classification methods, different tools used, language issues applied, and evaluation parameters were investigated. This study helps the reader to understand various acoustic modeling issues with comprehensive details. The research outcomes presented in this study depict research trends and shed light on new research topics in AM. The result of this review can be used to build a better speech recognition system by choosing a suitable acoustic modeling construct in SR.

引用

页码：397 / 412

页数：16

共 50 条

[21] Acoustic Modeling in Mandarin Speech Recognition of Minority Accent in Yunnan
Wu Peishan
Yang Jian
[J]. PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 4, 2008, : 526 - 530
[22] Prosody-dependent Acoustic Modeling for Mandarin Speech Recognition
Chiu, Tzu-Hsuan
Chiang, Chen-Yu
Liao, Yuan-Fu
Yang, Jyh-Her
Wang, Yih-Ru
Chen, Sin-Horng
[J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 139 - 142
[23] GMM-BASED ACOUSTIC MODELING FOR EMBEDDED SPEECH RECOGNITION
Levy, Christophe
Linares, Georges
Bonastre, Jean-Francois
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1726 - 1729
[24] TRANSFORMER-BASED ACOUSTIC MODELING FOR HYBRID SPEECH RECOGNITION
Wang, Yongqiang
Mohamed, Abdelrahman
Le, Duc
Liu, Chunxi
Xiao, Alex
Mahadeokar, Jay
Huang, Hongzhao
Tjandra, Andros
Zhang, Xiaohui
Zhang, Frank
Fuegen, Christian
Zweig, Geoffrey
Seltzer, Michael L.
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6874 - 6878
[25] A study on acoustic modeling for speech recognition of predominantly monosyllabic languages
Maneenoi, Ekkarit
Ahkuputra, Visarut
Luksaneeyanawin, Sudaporn
Jitapunkul, Somchai
[J]. IEICE Transactions on Information and Systems, 2004, E87-D (05) : 1146 - 1163
[26] Modeling the Temporal Evolution of Acoustic Parameters for Speech Emotion Recognition
Ntalampiras, Stavros
Fakotakis, Nikos
[J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2012, 3 (01) : 116 - 125
[27] Automatic Speech Emotion Recognition: a Systematic Literature Review
Mustafa H.H.
Darwish N.R.
Hefny H.A.
[J]. International Journal of Speech Technology, 2024, 27 (1) : 267 - 285
[28] A systematic review of speech recognition technology in health care
Maree Johnson
Samuel Lapkin
Vanessa Long
Paula Sanchez
Hanna Suominen
Jim Basilakis
Linda Dawson
[J]. BMC Medical Informatics and Decision Making, 14
[29] A systematic literature review of speech emotion recognition approaches
Singh, Youddha Beer
Goel, Shivani
[J]. NEUROCOMPUTING, 2022, 492 : 245 - 263
[30] Urdu Speech Emotion Recognition: A Systematic Literature Review
Taj, Soonh
Mujtaba, Ghulam
Daudpota, Sher Muhammad
Mughal, Muhammad Hussain
[J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (07)

← 1 2 3 4 5 →