Efficient Generation and Use of MLP Features for Arabic Speech Recognition

被引:0
|
作者
Park, J. [1 ]
Diehl, F. [1 ]
Gales, M. J. F. [1 ]
Tomalin, M. [1 ]
Woodland, P. C. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
Arabic Speech Recognition; Multi-Layer Perceptron; Acoustic Modelling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Front-end features computed using Multi-Layer Perceptrons (MLPs) have recently attracted much interest, but are a challenge to scale to large networks and very large training data sets. This paper discusses methods to reduce the training time for the generation of MLP features and their use in an ASR system using a variety of techniques: parallel training of a set of MLPs on different data sub-sets; methods for computing features from by a combination of these networks; and rapid discriminative training of HMMs using MLP-based features. The impact on MLP frame-based accuracy using different training strategies is discussed along with the effect on word rates from incorporating the MLP features in various configurations into an Arabic broadcast audio transcription system.
引用
收藏
页码:240 / 243
页数:4
相关论文
共 50 条
  • [1] TRAINING AND ADAPTING MLP FEATURES FOR ARABIC SPEECH RECOGNITION
    Park, J.
    Diehl, F.
    Gales, M. J. F.
    Tomalin, M.
    Woodland, P. C.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4461 - 4464
  • [2] The efficient incorporation of MLP features into automatic speech recognition systems
    Park, J.
    Diehl, F.
    Gales, M. J. F.
    Tomalin, M.
    Woodland, P. C.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2011, 25 (03): : 519 - 534
  • [3] Syntactic Features for Arabic Speech Recognition
    Kuo, Hong-Kwang Jeff
    Mangu, Lidia
    Emami, Ahmad
    Zitouni, Imed
    Lee, Young-Suk
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 327 - 332
  • [4] Region Dependent Transform on MLP Features for Speech Recognition
    Ng, Tim
    Zhang, Bing
    Matsoukas, Spyros
    Long Nguyen
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 228 - 231
  • [5] Augmented Context Features for Arabic Speech Recognition
    Emami, Ahmad
    Kuo, Hong-Kwang J.
    Zitouni, Imed
    Mangu, Lidia
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1832 - 1835
  • [6] MORPHOLOGICAL AND SYNTACTIC FEATURES FOR ARABIC SPEECH RECOGNITION
    Kuo, Hong-Kwang Jeff
    Mangu, Lidia
    Emami, Ahmad
    Zitouni, Imed
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5190 - 5193
  • [7] Speech Emotion Recognition Based on Arabic Features
    Meddeb, Mohamed
    Karray, Hichem
    Alimi, Adel M.
    [J]. 2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 46 - 51
  • [8] Generation of Arabic Phonetic Dictionaries for Speech Recognition
    Ali, Mohamed
    Elshafei, Moustafa
    Al-Ghamdi, Mansour
    Al-Muhtaseb, Husni
    Al-Najjar, Atef
    [J]. IIT: 2008 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY, 2008, : 434 - +
  • [9] A Canonicalization of Distinctive Phonetic Features to Improve Arabic Speech Recognition
    Alotaibi, Yousef A.
    Selouani, Sidh-Amed
    Yakoub, Mohammed Sidi
    Seddiq, Yasser Mohammed
    Meftah, Ali
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2019, 105 (06) : 1269 - 1277
  • [10] Use of Different Features for Emotion Recognition Using MLP Network
    Palo, H. K.
    Mohanty, Mihir Narayana
    Chandra, Mahesh
    [J]. COMPUTATIONAL VISION AND ROBOTICS, 2015, 332 : 7 - 15