Word recognition with a hierarchical neural network

被引:0
|
作者
Domont, Xavier [1 ]
Heckmann, Martin [1 ]
Wersing, Heiko [1 ]
Joublin, Frank [1 ]
Menzel, Stefan [1 ]
Sendhoff, Bernhard [1 ]
Goerick, Christian [1 ]
机构
[1] Honda Res Inst Europe GmbH, D-63073 Offenbach, Germany
来源
ADVANCES IN NONLINEAR SPEECH PROCESSING | 2007年 / 4885卷
关键词
speech recognition; robust features; feed-forward architecture;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a feedforward neural network for syllable recognition. The core of the recognition system is based on a hierarchical architecture initially developed for visual object recognition. We show that, given the similarities between the primary auditory and visual cortexes, such a system can successfully be used for speech recognition. Syllables are used as basic units for the recognition. Their spectrograms, computed using a Gammatone filterbank, are interpreted as images and subsequently feed into the neural network after a preprocessing step that enhances the formant frequencies and normalizes the length of the syllables. The performance of our system has been analyzed on the recognition of 25 different monosyllabic words. The parameters of the architecture have been optimized using an evolutionary strategy. Compared to the Sphinx-4 speech recognition system, our system achieves better robustness and generalization capabilities in noisy conditions.
引用
收藏
页码:142 / 151
页数:10
相关论文
共 50 条
  • [1] A PCMN NEURAL NETWORK FOR WORD RECOGNITION
    WANG, SR
    YE, HY
    ROBERT, F
    NEURAL NETWORKS FROM MODELS TO APPLICATIONS, 1989, : 513 - 522
  • [2] Effect of local excitation-inhibition ratio on word recognition in hierarchical spiking neural network
    Ye, Ting
    Wang, Jiang
    Li, Kai
    Gao, Tianshi
    Yi, Guosheng
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8380 - 8385
  • [3] A PCMN NEURAL NETWORK FOR ISOLATED WORD RECOGNITION
    YE, HY
    WANG, SG
    ROBERT, F
    SPEECH COMMUNICATION, 1990, 9 (02) : 141 - 153
  • [4] A neural network model for spoken word recognition
    Tsai, HL
    Lee, SJ
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 4029 - 4034
  • [5] Isolated Word Recognition Using Neural Network
    Masood, Sarfaraz
    Mehta, Madhav
    Namrata
    Rizvi, Danish Raza
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [6] RECOGNITION LEARNING BY A HIERARCHICAL ART NEURAL NETWORK
    CARPENTER, GA
    PERCEPTION, 1989, 18 (04) : 507 - 507
  • [7] Dynamic Hierarchical Bayesian Network for Arabic Handwritten Word Recognition
    Jayech, Khaoula
    Trimech, Nesrine
    Mahjoub, Mohamed Ali
    Ben Amara, Najoua Essoukri
    2013 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY AND ACCESSIBILITY (ICTA), 2013,
  • [8] A hybrid neural network model in handwritten word recognition
    Chiang, JH
    NEURAL NETWORKS, 1998, 11 (02) : 337 - 346
  • [9] Neural - Network based measures of confidence for word recognition
    Weintraub, M
    Beaufays, F
    Rivlin, Z
    Konig, Y
    Stolcke, A
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 887 - 890
  • [10] Hierarchical Expert Neural Network System for Speech Recognition
    Rocha, Priscila
    Silva, Washington
    Barros, Allan
    JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2019, 30 (03) : 347 - 359