Efficient Generation and Use of MLP Features for Arabic Speech Recognition

被引:0
|
作者
Park, J. [1 ]
Diehl, F. [1 ]
Gales, M. J. F. [1 ]
Tomalin, M. [1 ]
Woodland, P. C. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
Arabic Speech Recognition; Multi-Layer Perceptron; Acoustic Modelling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Front-end features computed using Multi-Layer Perceptrons (MLPs) have recently attracted much interest, but are a challenge to scale to large networks and very large training data sets. This paper discusses methods to reduce the training time for the generation of MLP features and their use in an ASR system using a variety of techniques: parallel training of a set of MLPs on different data sub-sets; methods for computing features from by a combination of these networks; and rapid discriminative training of HMMs using MLP-based features. The impact on MLP frame-based accuracy using different training strategies is discussed along with the effect on word rates from incorporating the MLP features in various configurations into an Arabic broadcast audio transcription system.
引用
收藏
页码:240 / 243
页数:4
相关论文
共 50 条
  • [21] Emotion recognition in Arabic speech
    Hadjadji, Imene
    Falek, Leila
    Demri, Lyes
    Teffahi, Hocine
    2019 INTERNATIONAL CONFERENCE ON ADVANCED ELECTRICAL ENGINEERING (ICAEE), 2019,
  • [22] On the Use of Pitch Features for Disordered Speech Recognition
    Liu, Shansong
    Hu, Shoukang
    Liu, Xunying
    Meng, Helen
    INTERSPEECH 2019, 2019, : 4130 - 4134
  • [23] Use of periodicity and jitter as speech recognition features
    Thomson, DL
    Chengalvarayan, R
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 21 - 24
  • [24] Emotion recognition in Arabic speech
    Samira Klaylat
    Ziad Osman
    Lama Hamandi
    Rached Zantout
    Analog Integrated Circuits and Signal Processing, 2018, 96 : 337 - 351
  • [25] Emotion recognition in Arabic speech
    Klaylat, Samira
    Osman, Ziad
    Hamandi, Lama
    Zantout, Rached
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2018, 96 (02) : 337 - 351
  • [26] The Use of Correlation Features in the Problem of Speech Recognition
    Andriyanov, Nikita
    ALGORITHMS, 2023, 16 (02)
  • [27] Comparison of Speech Features for Arabic Phonemes Recognition System based Malay Speakers
    Abd Almisreb, Ali
    Abidin, Ahmad Farid
    Tahir, Nooritawati Md
    2014 IEEE CONFERENCE ON SYSTEMS, PROCESS AND CONTROL (ICSPC 2014), 2014, : 79 - 83
  • [29] Robust Arabic speech recognition in noisy environments using prosodic features and formant
    Amrous, Anissa
    Debyeche, Mohamed
    Amrouche, Abderrahman
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (04) : 351 - 359
  • [30] Use an Efficient Neural Network to Improve the Arabic Handwriting Recognition
    Al Hamad, Husam Ahmed
    2013 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2013), 2013, : 269 - 274