A large vocabulary continuous speech recognition system for Persian language

被引:0
|
作者
Hossein Sameti
Hadi Veisi
Mohammad Bahrani
Bagher Babaali
Khosro Hosseinzadeh
机构
[1] Sharif University of Technology,Department of Computer Engineering
关键词
Language Model; Automatic Speech Recognition; Speech Recognition System; Word Error Rate; Voice Activity Detector;
D O I
暂无
中图分类号
学科分类号
摘要
The first large vocabulary speech recognition system for the Persian language is introduced in this paper. This continuous speech recognition system uses most standard and state-of-the-art speech and language modeling techniques. The development of the system, called Nevisa, has been started in 2003 with a dominant academic theme. This engine incorporates customized established components of traditional continuous speech recognizers and its parameters have been optimized for real applications of the Persian language. For this purpose, we had to identify the computational challenges of the Persian language, especially for text processing and extract statistical and grammatical language models for the Persian language. To achieve this, we had to either generate the necessary speech and text corpora or modify the available primitive corpora available for the Persian language.
引用
收藏
相关论文
共 50 条
  • [1] A large vocabulary continuous speech recognition system for Persian language
    Sameti, Hossein
    Veisi, Hadi
    Bahrani, Mohammad
    Babaali, Bagher
    Hosseinzadeh, Khosro
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 12
  • [2] Connectionist language modeling for large vocabulary continuous speech recognition
    Schwenk, H
    Gauvain, JL
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 765 - 768
  • [3] The RWTH large vocabulary continuous speech recognition system
    Ney, H
    Welling, L
    Ortmanns, S
    Beulen, K
    Wessel, F
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 853 - 856
  • [4] A Myanmar Large Vocabulary Continuous Speech Recognition System
    Naing, Hay Mar Soe
    Hlaing, Aye Mya
    Pa, Win Pa
    Hu, Xinhui
    Thu, Ye Kyaw
    Hori, Chiori
    Kawai, Hisashi
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 320 - 327
  • [5] A unified language model for large vocabulary continuous speech recognition of Turkish
    Arisoy, Ebru
    Dutagaci, Helin
    Arslan, Levent M.
    [J]. SIGNAL PROCESSING, 2006, 86 (10) : 2844 - 2862
  • [6] Automatic language identification using large vocabulary continuous speech recognition
    Mendoza, S
    Gillick, L
    Ito, Y
    Lowe, S
    Newmann, M
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 785 - 788
  • [7] A LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM WITH HIGH PREDICTABILITY
    SHIGENAGA, M
    SEKIGUCHI, Y
    YAMAGUCHI, T
    MASUDA, R
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1817 - 1825
  • [8] A large-vocabulary continuous speech recognition system for Hindi
    Kumar, M
    Rajput, N
    Verma, A
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2004, 48 (5-6) : 703 - 715
  • [9] LARGE-VOCABULARY SPEECH RECOGNITION - A SYSTEM FOR THE ITALIAN LANGUAGE
    DORTA, P
    FERRETTI, M
    MARTELLI, A
    MELECRINIS, S
    SCARCI, S
    VOLPI, G
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1988, 32 (02) : 217 - 226
  • [10] Syllable based language model for large vocabulary continuous speech recognition of Uyghur
    [J]. Silamu, W. (wushour@xju.edu.cn), 1600, Tsinghua University (53):