Cooperative supervised and unsupervised learning algorithm for phoneme recognition in continuous speech and speaker-independent context

被引：4

作者：

Arous, N ^{[1
]}

Ellouze, N ^{[1
]}

机构：

[1] Ecole Natl Ingn Tunis, Grp Reconnaissance Vocale, Unite Rech Signal Image Reconnaissance Formes, Tunis 1002, Tunisia

来源：

NEUROCOMPUTING | 2003年 / 51卷

关键词：

neural network; supervised learning; unsupervised learning; self-organizing map; continuous speech recognition;

D O I：

10.1016/S0925-2312(02)00618-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural networks have been traditionally considered as an alternative approach to pattern recognition in general, and speech recognition in particular. There have been much success in practical pattern recognition applications using neural networks including multi-layer perceptions, radial basis functions, and self-organizing maps (SOMs). In this paper, we propose a system of SOMs based on the association of some supervised and unsupervised learning algorithms inherited from the most popular neural network in the unsupervised learning category, SOM. The case study of the proposed system of SOMs is phoneme recognition in continuous speech and speaker independent context. Also, we propose a way to save more information during training phase of a Kohonen map in the objective to ameliorate speech recognition accuracy. The applied SOM variants serve as tools for developing intelligent systems and pursuing artificial intelligence applications. (C) 2002 Elsevier Science B.V. All rights reserved.

引用

页码：225 / 235

页数：11

共 50 条

[31] Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus
Abushariah, Mohammad
Ainon, Raja Noor
Zainuddin, Roziati
Elshafei, Moustafa
Khalifa, Othman
[J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2012, 9 (01) : 84 - 93
[32] Acoustic-phonetic speech parameters for speaker-independent speech recognition
Deshmukh, O
Espy-Wilson, CY
Juneja, A
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 593 - 596
[33] Speaker-independent Speech Emotion Recognition Based on Random Forest Feature Selection Algorithm
Cao, Wei-Hua
Xu, Jian-Ping
Liu, Zhen-Tao
[J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10995 - 10998
[34] Across-speaker Articulatory Normalization for Speaker-independent Silent Speech Recognition
Wang, Jun
Samal, Ashok
Green, Jordan R.
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1179 - 1183
[35] An automatic speech recognition system with speaker-independent identification support
Caranica, Alexandru
Burileanu, Corneliu
[J]. ADVANCED TOPICS IN OPTOELECTRONICS, MICROELECTRONICS, AND NANOTECHNOLOGIES VII, 2015, 9258
[36] LOW-LATENCY SPEAKER-INDEPENDENT CONTINUOUS SPEECH SEPARATION
Yoshioka, Takuya
Chen, Zhuo
Liu, Changliang
Xiao, Xiong
Erdogan, Hakan
Dimitriadis, Dimitrios
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6980 - 6984
[37] A SPEAKER-INDEPENDENT SPEECH RECOGNITION SYSTEM FOR TELEPHONE NETWORK APPLICATIONS
TRNKA, R
[J]. REVUE TECHNIQUE THOMSON-CSF, 1984, 16 (04): : 847 - 861
[38] Speaker-independent telephone speech recognition system: the VCS TeleRec
Hunt, Alan
[J]. Speech technology, 1988, 4 (02): : 80 - 82
[39] Speaker-independent speech recognition based on tree-structured speaker clustering
Kosaka, T
Matsunaga, S
Sagayama, S
[J]. COMPUTER SPEECH AND LANGUAGE, 1996, 10 (01): : 55 - 74
[40] Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition
Md Shah Fahad
Ashish Ranjan
Akshay Deepak
Gayadhar Pradhan
[J]. Circuits, Systems, and Signal Processing, 2022, 41 : 6113 - 6135

← 1 2 3 4 5 →