Subword analysis of small vocabulary and large vocabulary ASR for Punjabi language

被引:0
|
作者
Puneet Mittal
Navdeep Singh
机构
[1] BBSBEC,
[2] Mata Gujri College,undefined
关键词
Subword modeling; Pronunciation dictionary; WER; Acoustic modeling;
D O I
暂无
中图分类号
学科分类号
摘要
Modeling of words into phones should be done quite carefully, as these phones or sound units are used to build the acoustic model. Various techniques have been proposed for modeling the acoustic unit like phone, character, syllable, subword etc. Problem occurs when too many unique subwords/phones are generated in dictionary; it makes the automatic speech recognition process difficult. Various researchers have formulated diverse techniques to deal with it. In this paper, subword based dictionary has been explored for Punjabi language. For large vocabulary, number of subwords generated is quite more than the number permissible for computation. To reduce the number of subwords to be modeled, an algorithm has been proposed to replace least occurring subword with subword having similar sound. Acoustic model has been developed using the small and large vocabulary data. WER and size comparison has been done. Results reveal that large vocabulary models give high recognition rate having only 6% of WER.
引用
收藏
页码:71 / 78
页数:7
相关论文
共 50 条
  • [41] An Analysis on the Vocabulary of Learners of Turkish as a Foreign Language
    Tufekcioglu, Burak
    HACETTEPE UNIVERSITESI EGITIM FAKULTESI DERGISI-HACETTEPE UNIVERSITY JOURNAL OF EDUCATION, 2020, 35 (01): : 1 - 19
  • [42] Subword RNNLM Approximations for Out-Of-Vocabulary Keyword Search
    Singh, Mittul
    Virpioja, Sami
    Smit, Peter
    Kurimo, Mikko
    INTERSPEECH 2019, 2019, : 4235 - 4239
  • [43] A real-time prototype for small-vocabulary audio-visual ASR
    Connell, JH
    Haas, N
    Marcheret, E
    Neti, C
    Potamianos, G
    Velipasalar, S
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL II, PROCEEDINGS, 2003, : 469 - 472
  • [44] Language identification through large vocabulary continous speech recognition
    Lim, BP
    Li, HZ
    Chen, Y
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 49 - 52
  • [45] A large vocabulary continuous speech recognition system for Persian language
    Sameti, Hossein
    Veisi, Hadi
    Bahrani, Mohammad
    Babaali, Bagher
    Hosseinzadeh, Khosro
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 12
  • [46] Large vocabulary speech recognition with multispan statistical language models
    Bellegarda, JR
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01): : 76 - 84
  • [47] A large vocabulary continuous speech recognition system for Persian language
    Hossein Sameti
    Hadi Veisi
    Mohammad Bahrani
    Bagher Babaali
    Khosro Hosseinzadeh
    EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [48] LARGE-VOCABULARY SPEECH RECOGNITION - A SYSTEM FOR THE ITALIAN LANGUAGE
    DORTA, P
    FERRETTI, M
    MARTELLI, A
    MELECRINIS, S
    SCARCI, S
    VOLPI, G
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1988, 32 (02) : 217 - 226
  • [49] A multispan language modeling framework for large vocabulary speech recognition
    Bellegarda, JR
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 456 - 467
  • [50] TEACHING A LARGE RUSSIAN LANGUAGE VOCABULARY BY MNEMONIC KEYWORD METHOD
    RAUGH, MR
    SCHUPBACH, RD
    ATKINSON, RC
    INSTRUCTIONAL SCIENCE, 1977, 6 (03) : 199 - 221