Enhancing Large Vocabulary Continuous Speech Recognition System for Urdu-English Conversational Code-Switched Speech

被引:0
|
作者
Farooq, Muhammad Umar [1 ]
Adeeba, Farah [1 ]
Hussain, Sarmad [1 ]
Rauf, Sahar [1 ]
Khalid, Maryam [1 ]
机构
[1] Univ Engn & Technol, Ctr Language Engn, Al Khawarizmi Inst Comp Sci, Lahore, Pakistan
关键词
Urdu-English code-switching; Urdu speech recognition; under-resourced language;
D O I
10.1109/o-cocosda50338.2020.9295036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents first step towards Large Vocabulary Continuous Speech Recognition (LVCSR) system for Urdu-English code-switched conversational speech. Urdu is the national language and lingua franca of Pakistan, with 100 million speakers worldwide. English, on the other hand, is official language of Pakistan and commonly mixed with Urdu in daily communication. Urdu, being under-resourced language, have no substantial Urdu-English code-switched corpus in hand to develop speech recognition system. In this research, readily available spontaneous Urdu speech corpus (25 hours) is revised to use it for enhancement of read speech Urdu LVCSR to recognize code-switched speech. This data set is split into 20 hours of train and 5 hours of test set. 10 hours of Urdu BroadCast (BC) data are collected and annotated in a semi-supervised way to enhance the system further. For acoustic modeling, state-of-the-art DNN-HMM modeling technique is used without any prior GMM-HMM training and alignments. Various techniques to improve language model using monolingual data are investigated. The overall percent Word Error Rate (WER) is reduced from 40.71% to 26.95% on test set.
引用
收藏
页码:155 / 159
页数:5
相关论文
共 50 条
  • [41] Experimenting with lipreading for large vocabulary continuous speech recognition
    Karel Paleček
    Journal on Multimodal User Interfaces, 2018, 12 : 309 - 318
  • [42] Large-Vocabulary Continuous Speech Recognition Systems
    Saon, George
    Chien, Jen-Tzung
    IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 18 - 33
  • [43] Recent Developments in Large Vocabulary Continuous Speech Recognition
    Saon, George
    Chien, Jen-Tzung
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [44] Development of Large Vocabulary Continuous Speech Recognition for Polish
    Demenko, G.
    Szymanski, M.
    Cecko, R.
    Kusmierek, E.
    Lange, M.
    Wegner, K.
    Klessa, K.
    Owsianny, M.
    ACTA PHYSICA POLONICA A, 2012, 121 (1A) : A86 - A91
  • [45] Investigation on large vocabulary continuous Kannada speech recognition
    Vanajakshi, Puttaswamy Gowda
    Mathivanan, M.
    Kumaran, T. Senthil
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2021, 36 (01) : 1 - 24
  • [46] JNAS: Japanese speech corpus for large vocabulary continuous speech recognition research
    Itou, Katunobu
    Yamamoto, Mikio
    Takeda, Kazuya
    Takezawa, Toshiyuki
    Matsuoka, Tatsuo
    Kobayashi, Tetsunori
    Shikano, Kiyohiro
    Itahashi, Shuichi
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1999, 20 (03): : 199 - 206
  • [47] Code-switched end-to-end Marathi speech recognition for especially abled people
    Hore, Praveen
    Sharma, Amit
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2022, 25 (03): : 771 - 784
  • [48] AUTOMATIC DETECTION OF NEW WORDS IN A LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM
    ASADI, A
    SCHWARTZ, R
    MAKHOUL, J
    SPEECH AND NATURAL LANGUAGE, 1989, : 263 - 265
  • [49] Modeling word-level rate-of-speech variation in large vocabulary conversational speech recognition
    Zheng, J
    Franco, H
    Stolcke, A
    SPEECH COMMUNICATION, 2003, 41 (2-3) : 273 - 285
  • [50] Capitalising on North American speech resources for the development of a South African English large vocabulary speech recognition system
    Kamper, Herman
    de Wet, Febe
    Hain, Thomas
    Niesler, Thomas
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (06): : 1255 - 1268