Subword analysis of small vocabulary and large vocabulary ASR for Punjabi language

被引:0
|
作者
Puneet Mittal
Navdeep Singh
机构
[1] BBSBEC,
[2] Mata Gujri College,undefined
关键词
Subword modeling; Pronunciation dictionary; WER; Acoustic modeling;
D O I
暂无
中图分类号
学科分类号
摘要
Modeling of words into phones should be done quite carefully, as these phones or sound units are used to build the acoustic model. Various techniques have been proposed for modeling the acoustic unit like phone, character, syllable, subword etc. Problem occurs when too many unique subwords/phones are generated in dictionary; it makes the automatic speech recognition process difficult. Various researchers have formulated diverse techniques to deal with it. In this paper, subword based dictionary has been explored for Punjabi language. For large vocabulary, number of subwords generated is quite more than the number permissible for computation. To reduce the number of subwords to be modeled, an algorithm has been proposed to replace least occurring subword with subword having similar sound. Acoustic model has been developed using the small and large vocabulary data. WER and size comparison has been done. Results reveal that large vocabulary models give high recognition rate having only 6% of WER.
引用
收藏
页码:71 / 78
页数:7
相关论文
共 50 条
  • [1] Subword analysis of small vocabulary and large vocabulary ASR for Punjabi language
    Mittal, Puneet
    Singh, Navdeep
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (01) : 71 - 78
  • [2] Corrective language modeling for large vocabulary ASR with the perceptron algorithm
    Roark, B
    Saraclar, M
    Collins, M
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 749 - 752
  • [3] LARGE VOCABULARY SPEECH RECOGNITION USING SUBWORD UNITS
    LEE, CH
    GAUVAIN, JL
    PIERACCINI, R
    RABINER, LR
    SPEECH COMMUNICATION, 1993, 13 (3-4) : 263 - 279
  • [4] EXPLORING MULTIDIMENSIONAL LSTMS FOR LARGE VOCABULARY ASR
    Li, Jinyu
    Mohamed, Abdelrahman
    Zweig, Geoffrey
    Gong, Yifan
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4940 - 4944
  • [5] Very Large Vocabulary ASR for Spoken Russian with Syntactic and Morphemic Analysis
    Karpov, Alexey
    Kipyatkova, Irina
    Ronzhin, Andrey
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3168 - +
  • [6] SUBWORD-BASED LARGE-VOCABULARY SPEECH RECOGNITION
    LEE, CH
    GAUVAIN, JL
    PIERACCINI, R
    RABINER, LR
    AT&T TECHNICAL JOURNAL, 1993, 72 (05): : 25 - 36
  • [7] A STUDY ON MULTILINGUAL ACOUSTIC MODELING FOR LARGE VOCABULARY ASR
    Lin, Hui
    Deng, Li
    Yu, Dong
    Gong, Yi-fan
    Acero, Alex
    Lee, Chin-Hui
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4333 - +
  • [8] Sparse imputation for large vocabulary noise robust ASR
    Gemmeke, Jort Florent
    Cranen, Bert
    Remes, Ulpu
    COMPUTER SPEECH AND LANGUAGE, 2011, 25 (02): : 462 - 479
  • [9] DUAL LEARNING FOR LARGE VOCABULARY ON-DEVICE ASR
    Peyser, Cal
    Huang, Ronny
    Sainath, Tara
    Prabhavalkar, Rohit
    Picheny, Michael
    Cho, Kyunghyun
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 245 - 251
  • [10] An Exploration of Large Vocabulary Tools for Small Vocabulary Phonetic Recognition
    Sainath, Tara N.
    Ramabhadran, Bhuvana
    Picheny, Michael
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 359 - 364