Syntactic Features for Arabic Speech Recognition

被引:10
|
作者
Kuo, Hong-Kwang Jeff [1 ]
Mangu, Lidia [1 ]
Emami, Ahmad [1 ]
Zitouni, Imed [1 ]
Lee, Young-Suk [1 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
LANGUAGE;
D O I
10.1109/ASRU.2009.5373470
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We report word error rate improvements with syntactic features using a neural probabilistic language model through N-best re-scoring. The syntactic features we use include exposed head words and their non-terminal labels both before and after the predicted word. Neural network LMs generalize better to unseen events by modeling words and other context features in continuous space. They are suitable for incorporating many different types of features, including syntactic features, where there is no pre-defined back-off order. We choose an N-best re-scoring framework to be able to take full advantage of the complete parse tree of the entire sentence. Using syntactic features, along with morphological features, improves the word error rate (WER) by up to 5.5% relative, from 9.4% to 8.6%, on the latest GALE evaluation test set.
引用
收藏
页码:327 / 332
页数:6
相关论文
共 50 条
  • [1] MORPHOLOGICAL AND SYNTACTIC FEATURES FOR ARABIC SPEECH RECOGNITION
    Kuo, Hong-Kwang Jeff
    Mangu, Lidia
    Emami, Ahmad
    Zitouni, Imed
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5190 - 5193
  • [2] Augmented Context Features for Arabic Speech Recognition
    Emami, Ahmad
    Kuo, Hong-Kwang J.
    Zitouni, Imed
    Mangu, Lidia
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1832 - 1835
  • [3] Speech Emotion Recognition Based on Arabic Features
    Meddeb, Mohamed
    Karray, Hichem
    Alimi, Adel M.
    [J]. 2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 46 - 51
  • [4] TRAINING AND ADAPTING MLP FEATURES FOR ARABIC SPEECH RECOGNITION
    Park, J.
    Diehl, F.
    Gales, M. J. F.
    Tomalin, M.
    Woodland, P. C.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4461 - 4464
  • [5] Efficient Generation and Use of MLP Features for Arabic Speech Recognition
    Park, J.
    Diehl, F.
    Gales, M. J. F.
    Tomalin, M.
    Woodland, P. C.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 240 - 243
  • [6] A Canonicalization of Distinctive Phonetic Features to Improve Arabic Speech Recognition
    Alotaibi, Yousef A.
    Selouani, Sidh-Amed
    Yakoub, Mohammed Sidi
    Seddiq, Yasser Mohammed
    Meftah, Ali
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2019, 105 (06) : 1269 - 1277
  • [7] Prosodic Features and Formant Contribution for Arabic Speech Recognition in Noisy Environments
    Amrous, Anissa Imen
    Debyeche, Mohamed
    Amrouche, Abderrahman
    [J]. SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS, 6TH INTERNATIONAL CONFERENCE SOCO 2011, 2011, 87 : 465 - 474
  • [8] Integration of Auxiliary Features in Hidden Markov Models for Arabic Speech Recognition
    Amrous, Anissa Imen
    Debyeche, Mohamed
    Amrouche, A.
    [J]. 2009 3RD INTERNATIONAL CONFERENCE ON SIGNALS, CIRCUITS AND SYSTEMS (SCS 2009), 2009, : 612 - 616
  • [9] Speaker-Dependent Bottleneck Features for Egyptian Arabic Speech Recognition
    Romanenko, Aleksei
    Mendelev, Valentin
    [J]. SPEECH AND COMPUTER, 2016, 9811 : 620 - 626
  • [10] Emotion Recognition in Arabic Speech
    Klaylat, Samira
    Hamandi, Lama
    Osman, Ziad
    Zantout, Rached
    [J]. 2017 SENSORS NETWORKS SMART AND EMERGING TECHNOLOGIES (SENSET), 2017,