Augmented Context Features for Arabic Speech Recognition

被引:0
|
作者
Emami, Ahmad [1 ]
Kuo, Hong-Kwang J. [1 ]
Zitouni, Imed [1 ]
Mangu, Lidia [1 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
language modeling; speech recognition; clustering; syntactic features; LANGUAGE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate different types of features for language modeling in Arabic automatic speech recognition. While much effort in language modeling research has been directed at designing better models or smoothing techniques for n-gram language models, in this paper we take the approach of augmenting the context in the n-gram model with different sources of information. We start by adding word class labels to the context. The word classes are automatically derived from un-annotated training data. As a contrast, we also experiment with POS tags which require a tagger trained on annotated data. An amalgam of these two methods uses class labels defined on word and POS tag combinations. Other context features include super-tags derived from the syntactic tree structure as well as semantic features derived from Prop Bank. Experiments on the DARPA GALE Arabic speech recognition task show that augmented context features often improve both perplexity and word error rate.
引用
收藏
页码:1832 / 1835
页数:4
相关论文
共 50 条
  • [41] Generation of Arabic Phonetic Dictionaries for Speech Recognition
    Ali, Mohamed
    Elshafei, Moustafa
    Al-Ghamdi, Mansour
    Al-Muhtaseb, Husni
    Al-Najjar, Atef
    IIT: 2008 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY, 2008, : 434 - +
  • [42] The impact of phonological rules on Arabic speech recognition
    Al-Anzi F.S.
    AbuZeina D.
    International Journal of Speech Technology, 2017, 20 (3) : 715 - 723
  • [43] Arabic Speech Recognition System based on CMUSphinx
    Satori, H.
    Harti, M.
    Chenfour, N.
    ISCIII '07: 3RD INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, PROCEEDINGS, 2007, : 31 - +
  • [44] Lexicon Free Arabic Speech Recognition Recipe
    Ahmed, Abdelrahman
    Hifny, Yasser
    Shaalan, Khaled
    Toral, Sergio
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 147 - 159
  • [45] Recognition of Arabic speech sound error in children
    Nacereddine Hammami
    Isah A. Lawal
    Mouldi Bedda
    Nadir Farah
    International Journal of Speech Technology, 2020, 23 : 705 - 711
  • [46] Bidirectional deep architecture for Arabic speech recognition
    Zerari, Naima
    Abdelhamid, Samir
    Bouzgou, Hassen
    Raymond, Christian
    OPEN COMPUTER SCIENCE, 2019, 9 (01): : 92 - 102
  • [47] Experiments on Automatic Recognition of Nonnative Arabic Speech
    Alotaibi, Yousef Ajami
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008, 2008 (1)
  • [48] Arabic speech recognition using SPHINX engine
    Hyassat, Hussein
    Abu Zitar, Raed
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2006, 9 (3-4) : 133 - 150
  • [49] Experiments on Automatic Recognition of Nonnative Arabic Speech
    YousefAjami Alotaibi
    Sid-Ahmed Selouani
    Douglas O'Shaughnessy
    EURASIP Journal on Audio, Speech, and Music Processing, 2008
  • [50] Arabic corpus Implementation: Application to Speech Recognition
    Helali, Wafa
    Hajaiej, Zied
    Cherif, Adnane
    2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 50 - 53