A Fast Learning Method for Multilayer Perceptrons in Automatic Speech Recognition Systems

被引:4
|
作者
Cai, Chenghao [1 ]
Xu, Yanyan [2 ]
Ke, Dengfeng [3 ]
Su, Kaile [4 ]
机构
[1] Beijing Forestry Univ, Sch Technol, Beijing 100083, Peoples R China
[2] Beijing Forestry Univ, Sch Informat Sci & Technol, 35 Qinghua Dong Rd, Beijing 100083, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[4] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Qld 4111, Australia
基金
中国国家自然科学基金;
关键词
D O I
10.1155/2015/797083
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We propose a fast learning method for multilayer perceptrons (MLPs) on large vocabulary continuous speech recognition (LVCSR) tasks. A preadjusting strategy based on separation of training data and dynamic learning-rate with a cosine function is used to increase the accuracy of a stochastic initial MLP. Weight matrices of the preadjusted MLP are restructured by a method based on singular value decomposition (SVD), reducing the dimensionality of the MLP. A back propagation (BP) algorithm that fits the unfolded weight matrices is used to train the restructured MLP, reducing the time complexity of the learning process. Experimental results indicate that on LVCSR tasks, in comparison with the conventional learning method, this fast learning method can achieve a speedup of around 2.0 times with improvement on both the cross entropy loss and the frame accuracy. Moreover, it can achieve a speedup of approximately 3.5 times with only a little loss of the cross entropy loss and the frame accuracy. Since this method consumes less time and space than the conventional method, it is more suitable for robots which have limitations on hardware.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Propagation of Uncertainty through Multilayer Perceptrons for Robust Automatic Speech Recognition
    Astudillo, Ramon Fernandez
    da Silva Neto, Joao Paulo
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 468 - 471
  • [2] An approach for automatic speech recognition using multilayer perceptrons and acoustic segmentation
    Giurgiu, M
    Meciu, E
    [J]. COMBIO'96 - SUMMER WORKSHOP ON COMPUTATIONAL MODELLING, IMAGING AND VISUALIZATION IN BIOSCIENCES, 1996, 1996 (06): : 104 - 108
  • [3] Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition
    Atmaja, Bagus Tris
    Akagi, Masato
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 325 - 331
  • [4] A Hybrid Learning Method for Multilayer Perceptrons
    Zhon Meide Huang Wenhu Hong Jiarong (School of Astronautics)
    [J]. 哈尔滨工业大学学报, 1990, (03) : 52 - 61
  • [5] PERCEPTRONS AND MULTILAYER PERCEPTRONS IN SPEECH RECOGNITION - IMPROVEMENTS FROM TEMPORAL WARPING OF THE TRAINING MATERIAL
    KAMMERER, B
    KUPPER, W
    [J]. NEURAL NETWORKS FROM MODELS TO APPLICATIONS, 1989, : 531 - 540
  • [6] Transfer Learning for Automatic Speech Recognition Systems
    Asefisaray, Behnam
    Haznedaroglu, Ali
    Erden, Mustafa
    Arslan, Levent M.
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [7] Understanding the Representation and Computation of Multilayer Perceptrons: A Case Study in Speech Recognition
    Nagamine, Tasha
    Mesgarani, Nima
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [8] Learning name pronunciations in automatic speech recognition systems
    Beaufays, F
    Sankar, A
    Williams, S
    Weintraub, M
    [J]. 15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 233 - 240
  • [9] Fast training of multilayer perceptrons
    Verma, B
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (06): : 1314 - 1320
  • [10] Automatic target recognition using modularly cascaded vector quantizers and multilayer perceptrons
    Chan, LCA
    Nasrabadi, NM
    Mirelli, V
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3386 - 3389