Efficient Generation and Use of MLP Features for Arabic Speech Recognition

被引:0
|
作者
Park, J. [1 ]
Diehl, F. [1 ]
Gales, M. J. F. [1 ]
Tomalin, M. [1 ]
Woodland, P. C. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
Arabic Speech Recognition; Multi-Layer Perceptron; Acoustic Modelling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Front-end features computed using Multi-Layer Perceptrons (MLPs) have recently attracted much interest, but are a challenge to scale to large networks and very large training data sets. This paper discusses methods to reduce the training time for the generation of MLP features and their use in an ASR system using a variety of techniques: parallel training of a set of MLPs on different data sub-sets; methods for computing features from by a combination of these networks; and rapid discriminative training of HMMs using MLP-based features. The impact on MLP frame-based accuracy using different training strategies is discussed along with the effect on word rates from incorporating the MLP features in various configurations into an Arabic broadcast audio transcription system.
引用
收藏
页码:240 / 243
页数:4
相关论文
共 50 条
  • [31] The symmetric technique of formant transition generation for use in speech synthesis in Arabic
    Lamari Chegrani
    Guerti Mhania
    Boudraa Bachir
    International Journal of Information Technology, 2025, 17 (2) : 1235 - 1245
  • [32] Phonotactic Language Recognition Using MLP Features
    BenZeghiba, Mohamed Faouzi
    Gauvain, Jean-Luc
    Lamel, Lori
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2039 - 2042
  • [33] THE USE OF VOICE SOURCE FEATURES FOR SUNG SPEECH RECOGNITION
    Dabike, Gerardo Roa
    Barker, Jon
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6513 - 6517
  • [34] Automatic recognition of Arabic dysarthric speech
    Tolba, Hesham M.
    El-Torgoman, Ahmed S.
    AEJ - Alexandria Engineering Journal, 2010, 49 (02): : 131 - 138
  • [35] Arabic Phonetic Dictionaries for Speech Recognition
    Ali, Mohamed
    Elshafei, Moustafa
    Al-Ghamdi, Mansour
    Al-Muhtaseb, Husni
    Al-Najjar, Atef
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2009, 2 (04) : 67 - 80
  • [36] Literature Survey of Arabic Speech Recognition
    Al-Anzi, Fawaz S.
    AbuZeina, Dia
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES AND ENGINEERING (ICCSE), 2018,
  • [37] Survey on Arabic speech emotion recognition
    Iben Nasr L.
    Masmoudi A.
    Hadrich Belguith L.
    International Journal of Speech Technology, 2024, 27 (01) : 53 - 68
  • [38] Arabic Speech Recognition: Advancement and Challenges
    Rahman, Ashifur
    Kabir, Md. Mohsin
    Mridha, M. F.
    Alatiyyah, Mohammed
    Alhasson, Haifa F.
    Alharbi, Shuaa S.
    IEEE ACCESS, 2024, 12 : 39689 - 39716
  • [39] Diacritics Effect on Arabic Speech Recognition
    Sa’ed Abed
    Mohammad Alshayeji
    Sari Sultan
    Arabian Journal for Science and Engineering, 2019, 44 : 9043 - 9056
  • [40] A Comparative Study of Arabic Speech Recognition
    Ali, Onsy Abdel Alim
    Moselhy, Mohamed M.
    Bzeih, Aya
    2012 16TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2012, : 884 - 887