Augmented Context Features for Arabic Speech Recognition

被引:0
|
作者
Emami, Ahmad [1 ]
Kuo, Hong-Kwang J. [1 ]
Zitouni, Imed [1 ]
Mangu, Lidia [1 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
language modeling; speech recognition; clustering; syntactic features; LANGUAGE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate different types of features for language modeling in Arabic automatic speech recognition. While much effort in language modeling research has been directed at designing better models or smoothing techniques for n-gram language models, in this paper we take the approach of augmenting the context in the n-gram model with different sources of information. We start by adding word class labels to the context. The word classes are automatically derived from un-annotated training data. As a contrast, we also experiment with POS tags which require a tagger trained on annotated data. An amalgam of these two methods uses class labels defined on word and POS tag combinations. Other context features include super-tags derived from the syntactic tree structure as well as semantic features derived from Prop Bank. Experiments on the DARPA GALE Arabic speech recognition task show that augmented context features often improve both perplexity and word error rate.
引用
收藏
页码:1832 / 1835
页数:4
相关论文
共 50 条
  • [21] Arabic Phonetic Dictionaries for Speech Recognition
    Ali, Mohamed
    Elshafei, Moustafa
    Al-Ghamdi, Mansour
    Al-Muhtaseb, Husni
    Al-Najjar, Atef
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2009, 2 (04) : 67 - 80
  • [22] Literature Survey of Arabic Speech Recognition
    Al-Anzi, Fawaz S.
    AbuZeina, Dia
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES AND ENGINEERING (ICCSE), 2018,
  • [23] Survey on Arabic speech emotion recognition
    Iben Nasr L.
    Masmoudi A.
    Hadrich Belguith L.
    International Journal of Speech Technology, 2024, 27 (01) : 53 - 68
  • [24] Arabic Speech Recognition: Advancement and Challenges
    Rahman, Ashifur
    Kabir, Md. Mohsin
    Mridha, M. F.
    Alatiyyah, Mohammed
    Alhasson, Haifa F.
    Alharbi, Shuaa S.
    IEEE ACCESS, 2024, 12 : 39689 - 39716
  • [25] Diacritics Effect on Arabic Speech Recognition
    Sa’ed Abed
    Mohammad Alshayeji
    Sari Sultan
    Arabian Journal for Science and Engineering, 2019, 44 : 9043 - 9056
  • [26] A Comparative Study of Arabic Speech Recognition
    Ali, Onsy Abdel Alim
    Moselhy, Mohamed M.
    Bzeih, Aya
    2012 16TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2012, : 884 - 887
  • [27] An Investigation in Speech Recognition for Colloquial Arabic
    Al-Shareef, Sarah
    Hain, Thomas
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2880 - 2883
  • [28] Arabic speech synthesis and diacritic recognition
    Rebai, Ilyes
    BenAyed, Yassine
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 485 - 494
  • [29] Diacritics Effect on Arabic Speech Recognition
    Abed, Sa'ed
    Alshayeji, Mohammad
    Sultan, Sari
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) : 9043 - 9056
  • [30] Arabic Automatic Speech Recognition Enhancement
    Ahmed, Basem H. A.
    Ghabayen, Ayman S.
    2017 PALESTINIAN INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (PICICT), 2017, : 98 - 102