IMPROVING LANGUAGE IDENTIFICATION FOR MULTILINGUAL SPEAKERS

被引:0
|
作者
Titus, Andrew [1 ]
Silovsky, Jan [1 ]
Chen, Nanxin [1 ,2 ]
Hsiao, Roger [1 ]
Young, Mary [1 ]
Ghoshal, Arnab [1 ]
机构
[1] Apple, Cupertino, CA 95014 USA
[2] Johns Hopkins Univ, Baltimore, MD 21218 USA
关键词
Language identification; multilingual; RECOGNITION;
D O I
10.1109/icassp40776.2020.9053057
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spoken language identification (LID) technologies have improved in recent years from discriminating largely distinct languages to discriminating highly similar languages or even dialects of the same language. One aspect that has been mostly neglected, however, is discrimination of languages for multilingual speakers, despite being a primary target audience of many systems that utilize LID technologies. As we show in this work, LID systems can have a high average accuracy for most combinations of languages while greatly underperforming for others when accented speech is present. We address this by using coarser-grained targets for the acoustic LID model and integrating its outputs with interaction context signals in a context-aware model to tailor the system to each user. This combined system achieves an average 97% accuracy across all language combinations while improving worst-case accuracy by over 60% relative to our baseline.
引用
收藏
页码:8284 / 8288
页数:5
相关论文
共 50 条
  • [21] A unified system for multilingual speech recognition and language identification
    Liu, Danyang
    Xu, Ji
    Zhang, Pengyuan
    Yan, Yonghong
    SPEECH COMMUNICATION, 2021, 127 : 17 - 28
  • [22] Enhancing multilingual recognition of emotion in speech by language identification
    Sagha, Hesam
    Matejka, Pavel
    Gavryukova, Maryna
    Povolny, Filip
    Marchi, Erik
    Schuller, Bjoern
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2949 - 2953
  • [23] Improving Multilingual Transformer Transducer Models by Reducing Language Confusions
    Sun, Eric
    Li, Jinyu
    Meng, Zhong
    Wu, Yu
    Xue, Jian
    Liu, Shujie
    Gong, Yifan
    INTERSPEECH 2021, 2021, : 3470 - 3474
  • [24] Task Rebalancing: Improving Multilingual Communication with Native Speakers-Generated Highlights on Automated Transcripts
    Pan, Mei-Hua
    Yamashita, Naomi
    Wang, Hao-Chuan
    CSCW'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, 2017, : 310 - 321
  • [25] The speakers of minority languages are more multilingual
    Dobrushina, Nina
    Moroz, George
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2021, 25 (04) : 921 - 938
  • [26] Developmental acquisition of stops by multilingual speakers
    Cal, Zuzanna
    Wrembel, Magdalena
    INTERNATIONAL JOURNAL OF MULTILINGUALISM, 2025, 22 (01) : 13 - 35
  • [27] Speech-language pathologists' support for multilingual speakers' English intelligibility and participation informed by the ICF
    Blake, Helen L.
    McLeod, Sharynne
    JOURNAL OF COMMUNICATION DISORDERS, 2019, 77 : 56 - 70
  • [28] Assessing multilingual speakers' language processing through functional near-infrared spectroscopy (fNIRS)
    Farrukh, Fizza
    Nazeer, Hammad
    Minhas, Hamza Shabbir
    Naseer, Noman
    Noori, Farzan Majeed
    BEHAVIOURAL BRAIN RESEARCH, 2025, 484
  • [29] Colonial mindset, appropriation, and emotion: language attitudes towards English of multilingual speakers in southern Philippines
    Ponce, Ariel Robert C.
    INTERNATIONAL JOURNAL OF MULTILINGUALISM, 2025,
  • [30] Language Identification: A New Fast Algorithm to Identify the Language of a Text in a Multilingual Corpus
    Gadri, Said
    Moussaoui, Abdelouahab
    Belabdelouahab-Fernini, Linda
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 321 - 326