Dual Script E2E Framework for Multilingual and Code-Switching ASR

被引:1
|
作者
Kumar, Mari Ganesh [1 ]
Kuriakose, Jom [1 ]
Thyagachandran, Anand [1 ]
Kumar, Arun A. [1 ]
Seth, Ashish [1 ]
Prasad, Lodagala V. S. V. Durga [1 ]
Jaiswal, Saish [1 ]
Prakash, Anusha [1 ]
Murthy, Hema A. [1 ]
机构
[1] Indian Inst Technol Madras, Chennai, Tamil Nadu, India
来源
关键词
speech recognition; low-resource; multilingual; common label set; dual script;
D O I
10.21437/Interspeech.2021-978
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
India is home to multiple languages, and training automatic speech recognition (ASR) systems is challenging. Over time, each language has adopted words from other languages, such as English, leading to code-mixing. Most Indian languages also have their own unique scripts, which poses a major limitation in training multilingual and code-switching ASR systems. Inspired by results in text-to-speech synthesis, in this paper, we use an in-house rule-based phoneme-level common label set (CLS) representation to train multilingual and code-switching ASR for Indian languages. We propose two end-to-end (E2E) ASR systems. In the first system, the E2E model is trained on the CLS representation, and we use a novel data-driven back-end to recover the native language script. In the second system, we propose a modification to the E2E model, wherein the CLS representation and the native language characters are used simultaneously for training. We show our results on the multilingual and code-switching (MUCS) ASR challenge 2021. Our best results achieve approximate to 6% and 5% improvement in word error rate over the baseline system for the multilingual and code-switching tasks, respectively, on the challenge development data.
引用
收藏
页码:2441 / 2445
页数:5
相关论文
共 50 条
  • [1] Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods
    Ye, Lingxuan
    Cheng, Gaofeng
    Yang, Runyan
    Yang, Zehui
    Tian, Sanli
    Zhang, Pengyuan
    Yan, Yonghong
    [J]. INTERSPEECH 2022, 2022, : 3163 - 3167
  • [2] Code-Switching in Multilingual Picturebooks
    Kuemmerling-Meibauer, Bettina
    [J]. BOOKBIRD-A JOURNAL OF INTERNATIONAL CHILDRENS LITERATURE, 2013, 51 (03) : 12 - 21
  • [3] The CSTR system for multilingual and code-switching ASR challenges for low resource Indian languages
    Centre for Speech Technology Research, University of Edinburgh, United Kingdom
    [J]. Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH, 2308, (1001-1005):
  • [4] The CSTR System for Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages
    Klejch, Ondrej
    Wallington, Electra
    Bell, Peter
    [J]. INTERSPEECH 2021, 2021, : 2881 - 2885
  • [5] MUCS 2021: Multilingual and code-switching ASR challenges for low resource Indian languages
    Diwan, Anuj
    Vaideeswaran, Rakesh
    Shah, Sanket
    Singh, Ankita
    Raghavan, Srinivasa
    Khare, Shreya
    Unni, Vinit
    Vyas, Saurabh
    Rajpuria, Akash
    Yarra, Chiranjeevi
    Mittal, Ashish
    Ghosh, Prasanta Kumar
    Jyothi, Preethi
    Bali, Kalika
    Seshadri, Vivek
    Sitaram, Sunayana
    Bharadwaj, Samarth
    Nanavati, Jai
    Nanavati, Raoul
    Sankaranarayanan, Karthik
    [J]. INTERSPEECH 2021, 2021, : 2446 - 2450
  • [6] ZERO-SHOT CODE-SWITCHING ASR AND TTS WITH MULTILINGUAL MACHINE SPEECH CHAIN
    Nakayama, Sahoko
    Tjandra, Andros
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 964 - 971
  • [7] Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR
    Chowdhury, Shammur Absar
    Hussein, Amir
    Abdelali, Ahmed
    Ali, Ahmed
    [J]. INTERSPEECH 2021, 2021, : 2466 - 2470
  • [8] CODE-SWITCHING DETECTION USING MULTILINGUAL DNNS
    Yilmaz, Emre
    van den Heuvel, Henk
    van Leeuwen, David
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 610 - 616
  • [9] An Investigation of Acoustic Models for Multilingual Code-Switching
    White, Christopher M.
    Khudanpur, Sanjeev
    Baker, James K.
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2691 - +
  • [10] To switch or not to switch: Code-switching in a multilingual country
    Shay, Orit
    [J]. 3RD INTERNATIONAL CONFERENCE - EDUCATION, REFLECTION, DEVELOPMENT, 2015, 209 : 462 - 469