LANGUAGE DIARIZATION FOR CODE-SWITCH CONVERSATIONAL SPEECH

被引:0
|
作者
Lyu, Dau-Cheng [1 ]
Chng, Eng-Siong [1 ]
Li, Haizhou [1 ]
机构
[1] Nanyang Technol Univ, Temasek Labs, Singapore 639798, Singapore
关键词
language diarization; language recognition; code-switch; conversational speech; IDENTIFICATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper examines the process of language diarization, the process to perform language segmentation and recognition, in a code-switched speech. Towards this task, we have developed a 63 hours conversational code-switch corpus recorded from Singapore/Malaysia speakers. We show that code-switching can occur frequently and the average language interval may be as short as one second. As such, language diarization is challenging task. To process such short segments, we propose a language diarization system using long term context feature across several phone-based segments and the combination of acoustics and phonotactic information. We achieved a frame error rate of 14.7% for language diarization performance on a Mandarin-English code-switch corpus. To evaluate our system, we measured the language recognition performance on monolingual segments extracted from the code-switch corpus against published techniques of LID systems - we obtained a relative equal error rate reduction of 5.2%, 13.8%, 15.1% and 17.9% on speech durations of 0.1 to 0.5 sec., 0.5 to 1 sec., 1 to 3 sec. and 3 to 9 sec respectively.
引用
收藏
页码:7314 / 7318
页数:5
相关论文
共 50 条
  • [1] A FIRST SPEECH RECOGNITION SYSTEM FOR MANDARIN-ENGLISH CODE-SWITCH CONVERSATIONAL SPEECH
    Ngoc Thang Vu
    Lyu, Dau-Cheng
    Weiner, Jochen
    Telaar, Dominic
    Schlippe, Tim
    Blaicher, Fabian
    Chng, Eng-Siong
    Schultz, Tanja
    Li, Haizhou
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4889 - 4892
  • [2] A FIRST SPEECH RECOGNITION SYSTEM FOR MANDARIN-ENGLISH CODE-SWITCH CONVERSATIONAL SPEECH
    Ngoc Thang Vu
    Lyu, Dau-Cheng
    Weiner, Jochen
    Telaar, Dominic
    Schlippe, Tim
    Blaicher, Fabian
    Chng, Eng-Siong
    Schultz, Tanja
    Li, Haizhou
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4889 - 4892
  • [3] CODE-SWITCH SPEECH RESCORING WITH MONOLINGUAL DATA
    Liu, Guoyu
    Cao, Lixin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6229 - 6233
  • [4] LEARNING LANGUAGE AND SPEAKER INFORMATION FOR CODE-SWITCH SPEECH SYNTHESIS WITH LIMITED DATA
    Chai, Mengxin
    Guo, Shaotong
    Gong, Cheng
    Wang, Longbiao
    Dang, Jianwu
    Zhang, Ju
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 602 - 609
  • [5] Code-Switch Fatigue
    Graham-Perel, Ashley
    [J]. AMERICAN JOURNAL OF NURSING, 2023, 123 (05) : 17 - 18
  • [6] Expanding N-grams for Code-Switch Language Models
    Hamed, Injy
    Elmahdy, Mohamed
    Abdennadher, Slim
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2018, 2019, 845 : 221 - 229
  • [7] Why do Mandarin speakers code-switch? A case study of conversational code-switching in China
    Zhong, Xinyi
    Ang, Lay Hoon
    Sharmini, Sharon
    [J]. HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2024, 11 (01):
  • [8] IMPROVED MIXED LANGUAGE SPEECH RECOGNITION USING ASYMMETRIC ACOUSTIC MODEL AND LANGUAGE MODEL WITH CODE-SWITCH INVERSION CONSTRAINTS
    Li, Ying
    Fung, Pascale
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7368 - 7372
  • [9] Bilingual language mixing: Why do bilinguals code-switch?
    Heredia, RR
    Altarriba, J
    [J]. CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2001, 10 (05) : 164 - 168
  • [10] Linguistically Motivated Parallel Data Augmentation for Code-switch Language Modeling
    Lee, Grandee
    Yue, Xianghu
    Li, Haizhou
    [J]. INTERSPEECH 2019, 2019, : 3730 - 3734