Cooperative supervised and unsupervised learning algorithm for phoneme recognition in continuous speech and speaker-independent context

被引:4
|
作者
Arous, N [1 ]
Ellouze, N [1 ]
机构
[1] Ecole Natl Ingn Tunis, Grp Reconnaissance Vocale, Unite Rech Signal Image Reconnaissance Formes, Tunis 1002, Tunisia
关键词
neural network; supervised learning; unsupervised learning; self-organizing map; continuous speech recognition;
D O I
10.1016/S0925-2312(02)00618-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks have been traditionally considered as an alternative approach to pattern recognition in general, and speech recognition in particular. There have been much success in practical pattern recognition applications using neural networks including multi-layer perceptions, radial basis functions, and self-organizing maps (SOMs). In this paper, we propose a system of SOMs based on the association of some supervised and unsupervised learning algorithms inherited from the most popular neural network in the unsupervised learning category, SOM. The case study of the proposed system of SOMs is phoneme recognition in continuous speech and speaker independent context. Also, we propose a way to save more information during training phase of a Kohonen map in the objective to ameliorate speech recognition accuracy. The applied SOM variants serve as tools for developing intelligent systems and pursuing artificial intelligence applications. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:225 / 235
页数:11
相关论文
共 50 条
  • [31] Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus
    Abushariah, Mohammad
    Ainon, Raja Noor
    Zainuddin, Roziati
    Elshafei, Moustafa
    Khalifa, Othman
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2012, 9 (01) : 84 - 93
  • [32] Acoustic-phonetic speech parameters for speaker-independent speech recognition
    Deshmukh, O
    Espy-Wilson, CY
    Juneja, A
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 593 - 596
  • [33] Speaker-independent Speech Emotion Recognition Based on Random Forest Feature Selection Algorithm
    Cao, Wei-Hua
    Xu, Jian-Ping
    Liu, Zhen-Tao
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10995 - 10998
  • [34] Across-speaker Articulatory Normalization for Speaker-independent Silent Speech Recognition
    Wang, Jun
    Samal, Ashok
    Green, Jordan R.
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1179 - 1183
  • [35] An automatic speech recognition system with speaker-independent identification support
    Caranica, Alexandru
    Burileanu, Corneliu
    [J]. ADVANCED TOPICS IN OPTOELECTRONICS, MICROELECTRONICS, AND NANOTECHNOLOGIES VII, 2015, 9258
  • [36] LOW-LATENCY SPEAKER-INDEPENDENT CONTINUOUS SPEECH SEPARATION
    Yoshioka, Takuya
    Chen, Zhuo
    Liu, Changliang
    Xiao, Xiong
    Erdogan, Hakan
    Dimitriadis, Dimitrios
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6980 - 6984
  • [37] A SPEAKER-INDEPENDENT SPEECH RECOGNITION SYSTEM FOR TELEPHONE NETWORK APPLICATIONS
    TRNKA, R
    [J]. REVUE TECHNIQUE THOMSON-CSF, 1984, 16 (04): : 847 - 861
  • [38] Speaker-independent telephone speech recognition system: the VCS TeleRec
    Hunt, Alan
    [J]. Speech technology, 1988, 4 (02): : 80 - 82
  • [39] Speaker-independent speech recognition based on tree-structured speaker clustering
    Kosaka, T
    Matsunaga, S
    Sagayama, S
    [J]. COMPUTER SPEECH AND LANGUAGE, 1996, 10 (01): : 55 - 74
  • [40] Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition
    Md Shah Fahad
    Ashish Ranjan
    Akshay Deepak
    Gayadhar Pradhan
    [J]. Circuits, Systems, and Signal Processing, 2022, 41 : 6113 - 6135