Augmented Context Features for Arabic Speech Recognition

被引:0
|
作者
Emami, Ahmad [1 ]
Kuo, Hong-Kwang J. [1 ]
Zitouni, Imed [1 ]
Mangu, Lidia [1 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
language modeling; speech recognition; clustering; syntactic features; LANGUAGE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate different types of features for language modeling in Arabic automatic speech recognition. While much effort in language modeling research has been directed at designing better models or smoothing techniques for n-gram language models, in this paper we take the approach of augmenting the context in the n-gram model with different sources of information. We start by adding word class labels to the context. The word classes are automatically derived from un-annotated training data. As a contrast, we also experiment with POS tags which require a tagger trained on annotated data. An amalgam of these two methods uses class labels defined on word and POS tag combinations. Other context features include super-tags derived from the syntactic tree structure as well as semantic features derived from Prop Bank. Experiments on the DARPA GALE Arabic speech recognition task show that augmented context features often improve both perplexity and word error rate.
引用
收藏
页码:1832 / 1835
页数:4
相关论文
共 50 条
  • [1] Syntactic Features for Arabic Speech Recognition
    Kuo, Hong-Kwang Jeff
    Mangu, Lidia
    Emami, Ahmad
    Zitouni, Imed
    Lee, Young-Suk
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 327 - 332
  • [2] MORPHOLOGICAL AND SYNTACTIC FEATURES FOR ARABIC SPEECH RECOGNITION
    Kuo, Hong-Kwang Jeff
    Mangu, Lidia
    Emami, Ahmad
    Zitouni, Imed
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5190 - 5193
  • [3] Speech Emotion Recognition Based on Arabic Features
    Meddeb, Mohamed
    Karray, Hichem
    Alimi, Adel M.
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 46 - 51
  • [4] TRAINING AND ADAPTING MLP FEATURES FOR ARABIC SPEECH RECOGNITION
    Park, J.
    Diehl, F.
    Gales, M. J. F.
    Tomalin, M.
    Woodland, P. C.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4461 - 4464
  • [5] A Canonicalization of Distinctive Phonetic Features to Improve Arabic Speech Recognition
    Alotaibi, Yousef A.
    Selouani, Sidh-Amed
    Yakoub, Mohammed Sidi
    Seddiq, Yasser Mohammed
    Meftah, Ali
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2019, 105 (06) : 1269 - 1277
  • [6] Efficient Generation and Use of MLP Features for Arabic Speech Recognition
    Park, J.
    Diehl, F.
    Gales, M. J. F.
    Tomalin, M.
    Woodland, P. C.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 240 - 243
  • [7] VISUAL FEATURES FOR CONTEXT-AWARE SPEECH RECOGNITION
    Gupta, Abhinav
    Miao, Yajie
    Neves, Leonardo
    Metze, Florian
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5020 - 5024
  • [8] Integration of Auxiliary Features in Hidden Markov Models for Arabic Speech Recognition
    Amrous, Anissa Imen
    Debyeche, Mohamed
    Amrouche, A.
    2009 3RD INTERNATIONAL CONFERENCE ON SIGNALS, CIRCUITS AND SYSTEMS (SCS 2009), 2009, : 612 - 616
  • [9] Prosodic Features and Formant Contribution for Arabic Speech Recognition in Noisy Environments
    Amrous, Anissa Imen
    Debyeche, Mohamed
    Amrouche, Abderrahman
    SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS, 6TH INTERNATIONAL CONFERENCE SOCO 2011, 2011, 87 : 465 - 474
  • [10] Speaker-Dependent Bottleneck Features for Egyptian Arabic Speech Recognition
    Romanenko, Aleksei
    Mendelev, Valentin
    SPEECH AND COMPUTER, 2016, 9811 : 620 - 626