A neural network model for spoken word recognition

被引:0
|
作者
Tsai, HL
Lee, SJ
机构
来源
SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION | 1997年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a neural approach to the speaker-independent word recognition, based on the algorithms of dynamic time warping(DTW) [8, 7] and fuzzy ARTMAP [5, 4]. DTW has some drawbacks: (1) It is space and time consuming for a large set of training patterns. (2) It gives an equal importance to each frame of a pattern. To obtain a better performance, the training patterns need to be prefiltered by human experts. Our approach attempts to address these shortcomings of DTW. We use a modified Fuzzy ARTMAP to be the framework of our approach. Our architecture is a four-layer sequential neural network. Our training algorithm and recalling algorithm are similar to fuzzy ARTMAP. However, our neural approach is a sequential algorithm. Experiments on the recognition of English alphabets have been performed. The recognition rates obtained by our approach and DTW are 87% and 80%, respectively, while memory space used in our approach is two or three times smaller than that used in DTW. Furthermore, prefiltering on training patterns is not required.
引用
收藏
页码:4029 / 4034
页数:6
相关论文
共 50 条
  • [21] A study on user defined spoken wake-up word recognition system using deep neural network-hidden Markov model hybrid model
    Yoon, Ki-mu
    Kim, Wooil
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (02): : 131 - 136
  • [22] ISOLATED WORD SPEECH RECOGNITION USING A NEURAL NETWORK BASED SOURCE MODEL
    LEE, GE
    TATTERSALL, GD
    SMYTH, SG
    BT TECHNOLOGY JOURNAL, 1992, 10 (03): : 38 - 47
  • [23] RECURRENT NEURAL NETWORK LANGUAGE MODEL WITH STRUCTURED WORD EMBEDDINGS FOR SPEECH RECOGNITION
    He, Tianxing
    Xiang, Xu
    Qian, Yanmin
    Yu, Kai
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5396 - 5400
  • [24] Amharic spoken digits recognition using convolutional neural network
    Ayall, Tewodros Alemu
    Zhou, Changjun
    Liu, Huawen
    Brhanemeskel, Getnet Mezgebu
    Abate, Solomon Teferra
    Adjeisah, Michael
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [25] Learnable axonal delay in spiking neural networks improves spoken word recognition
    Sun, Pengfei
    Chua, Yansong
    Devos, Paul
    Botteldooren, Dick
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [26] Spoken word recognition without a TRACE
    Hannagan, Thomas
    Magnuson, James S.
    Grainger, Jonathan
    FRONTIERS IN PSYCHOLOGY, 2013, 4
  • [27] Models of spoken-word recognition
    Weber, Andrea
    Scharenborg, Odette
    WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE, 2012, 3 (03) : 387 - 401
  • [28] The dynamics of spoken word recognition in bilinguals
    Desroches, Amy S.
    Friesen, Deanna C.
    Teles, Matthew
    Korade, Chloe A.
    Forest, Evan W.
    BILINGUALISM-LANGUAGE AND COGNITION, 2022, 25 (04) : 705 - 710
  • [29] Orthographic involvement in spoken word recognition
    Taft, M
    AUSTRALIAN JOURNAL OF PSYCHOLOGY, 2004, 56 : 138 - 138
  • [30] Similarity mapping in spoken word recognition
    Connine, CM
    Titone, D
    Deelman, T
    Blasko, D
    JOURNAL OF MEMORY AND LANGUAGE, 1997, 37 (04) : 463 - 480