A neural network model for spoken word recognition

被引：0

作者：

Tsai, HL

Lee, SJ

机构：

来源：

SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION | 1997年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose a neural approach to the speaker-independent word recognition, based on the algorithms of dynamic time warping(DTW) [8, 7] and fuzzy ARTMAP [5, 4]. DTW has some drawbacks: (1) It is space and time consuming for a large set of training patterns. (2) It gives an equal importance to each frame of a pattern. To obtain a better performance, the training patterns need to be prefiltered by human experts. Our approach attempts to address these shortcomings of DTW. We use a modified Fuzzy ARTMAP to be the framework of our approach. Our architecture is a four-layer sequential neural network. Our training algorithm and recalling algorithm are similar to fuzzy ARTMAP. However, our neural approach is a sequential algorithm. Experiments on the recognition of English alphabets have been performed. The recognition rates obtained by our approach and DTW are 87% and 80%, respectively, while memory space used in our approach is two or three times smaller than that used in DTW. Furthermore, prefiltering on training patterns is not required.

引用

页码：4029 / 4034

页数：6

共 50 条

[21] A study on user defined spoken wake-up word recognition system using deep neural network-hidden Markov model hybrid model
Yoon, Ki-mu
Kim, Wooil
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (02): : 131 - 136
[22] ISOLATED WORD SPEECH RECOGNITION USING A NEURAL NETWORK BASED SOURCE MODEL
LEE, GE
TATTERSALL, GD
SMYTH, SG
BT TECHNOLOGY JOURNAL, 1992, 10 (03): : 38 - 47
[23] RECURRENT NEURAL NETWORK LANGUAGE MODEL WITH STRUCTURED WORD EMBEDDINGS FOR SPEECH RECOGNITION
He, Tianxing
Xiang, Xu
Qian, Yanmin
Yu, Kai
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5396 - 5400
[24] Amharic spoken digits recognition using convolutional neural network
Ayall, Tewodros Alemu
Zhou, Changjun
Liu, Huawen
Brhanemeskel, Getnet Mezgebu
Abate, Solomon Teferra
Adjeisah, Michael
JOURNAL OF BIG DATA, 2024, 11 (01)
[25] Learnable axonal delay in spiking neural networks improves spoken word recognition
Sun, Pengfei
Chua, Yansong
Devos, Paul
Botteldooren, Dick
FRONTIERS IN NEUROSCIENCE, 2023, 17
[26] Spoken word recognition without a TRACE
Hannagan, Thomas
Magnuson, James S.
Grainger, Jonathan
FRONTIERS IN PSYCHOLOGY, 2013, 4
[27] Models of spoken-word recognition
Weber, Andrea
Scharenborg, Odette
WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE, 2012, 3 (03) : 387 - 401
[28] The dynamics of spoken word recognition in bilinguals
Desroches, Amy S.
Friesen, Deanna C.
Teles, Matthew
Korade, Chloe A.
Forest, Evan W.
BILINGUALISM-LANGUAGE AND COGNITION, 2022, 25 (04) : 705 - 710
[29] Orthographic involvement in spoken word recognition
Taft, M
AUSTRALIAN JOURNAL OF PSYCHOLOGY, 2004, 56 : 138 - 138
[30] Similarity mapping in spoken word recognition
Connine, CM
Titone, D
Deelman, T
Blasko, D
JOURNAL OF MEMORY AND LANGUAGE, 1997, 37 (04) : 463 - 480

← 1 2 3 4 5 →