Subword analysis of small vocabulary and large vocabulary ASR for Punjabi language

被引：0

作者：

Puneet Mittal

Navdeep Singh

机构：

[1] BBSBEC,

[2] Mata Gujri College,undefined

来源：

International Journal of Speech Technology | 2020年 / 23卷

关键词：

Subword modeling; Pronunciation dictionary; WER; Acoustic modeling;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Modeling of words into phones should be done quite carefully, as these phones or sound units are used to build the acoustic model. Various techniques have been proposed for modeling the acoustic unit like phone, character, syllable, subword etc. Problem occurs when too many unique subwords/phones are generated in dictionary; it makes the automatic speech recognition process difficult. Various researchers have formulated diverse techniques to deal with it. In this paper, subword based dictionary has been explored for Punjabi language. For large vocabulary, number of subwords generated is quite more than the number permissible for computation. To reduce the number of subwords to be modeled, an algorithm has been proposed to replace least occurring subword with subword having similar sound. Acoustic model has been developed using the small and large vocabulary data. WER and size comparison has been done. Results reveal that large vocabulary models give high recognition rate having only 6% of WER.

引用

页码：71 / 78

页数：7

共 50 条

[41] An Analysis on the Vocabulary of Learners of Turkish as a Foreign Language
Tufekcioglu, Burak
HACETTEPE UNIVERSITESI EGITIM FAKULTESI DERGISI-HACETTEPE UNIVERSITY JOURNAL OF EDUCATION, 2020, 35 (01): : 1 - 19
[42] Subword RNNLM Approximations for Out-Of-Vocabulary Keyword Search
Singh, Mittul
Virpioja, Sami
Smit, Peter
Kurimo, Mikko
INTERSPEECH 2019, 2019, : 4235 - 4239
[43] A real-time prototype for small-vocabulary audio-visual ASR
Connell, JH
Haas, N
Marcheret, E
Neti, C
Potamianos, G
Velipasalar, S
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL II, PROCEEDINGS, 2003, : 469 - 472
[44] Language identification through large vocabulary continous speech recognition
Lim, BP
Li, HZ
Chen, Y
2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 49 - 52
[45] A large vocabulary continuous speech recognition system for Persian language
Sameti, Hossein
Veisi, Hadi
Bahrani, Mohammad
Babaali, Bagher
Hosseinzadeh, Khosro
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 12
[46] Large vocabulary speech recognition with multispan statistical language models
Bellegarda, JR
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01): : 76 - 84
[47] A large vocabulary continuous speech recognition system for Persian language
Hossein Sameti
Hadi Veisi
Mohammad Bahrani
Bagher Babaali
Khosro Hosseinzadeh
EURASIP Journal on Audio, Speech, and Music Processing, 2011
[48] LARGE-VOCABULARY SPEECH RECOGNITION - A SYSTEM FOR THE ITALIAN LANGUAGE
DORTA, P
FERRETTI, M
MARTELLI, A
MELECRINIS, S
SCARCI, S
VOLPI, G
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1988, 32 (02) : 217 - 226
[49] A multispan language modeling framework for large vocabulary speech recognition
Bellegarda, JR
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 456 - 467
[50] TEACHING A LARGE RUSSIAN LANGUAGE VOCABULARY BY MNEMONIC KEYWORD METHOD
RAUGH, MR
SCHUPBACH, RD
ATKINSON, RC
INSTRUCTIONAL SCIENCE, 1977, 6 (03) : 199 - 221

← 1 2 3 4 5 →