Effects of Dialectal Code-Switching on Speech Modules: A Study using Egyptian Arabic Broadcast Speech

被引:4
|
作者
Chowdhury, Shammur A. [1 ]
Samih, Younes [1 ]
Eldesouki, Mohamed [2 ]
Ali, Ahmed [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Doha, Qatar
[2] Concordia Univ, Montreal, PQ, Canada
来源
关键词
code-switching; dialect identification; corpus; code mixing index;
D O I
10.21437/Interspeech.2020-2271
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The intra-utterance code-switching (CS) is defined as the alternation between two or more languages within the same utterance. Despite the fact that spoken dialectal code-switching (DCS) is more challenging than CS, it remains largely unexplored. In this study, we describe a method to build the first spoken DCS corpus. The corpus is annotated at the token-level minding both linguistic and acoustic cues for dialectal Arabic. For detailed analysis, we study Arabic automatic speech recognition (ASR), Arabic dialect identification (ADI), and natural language processing (NLP) modules for the DCS corpus. Our results highlight the importance of lexical information for discriminating the DCS labels. We observe that the performance of different models is highly dependent on the degree of code-mixing at the token-level as well as its complexity at the utterance-level.
引用
收藏
页码:2382 / 2386
页数:5
相关论文
共 50 条
  • [1] Investigations on speech recognition systems for low-resource dialectal Arabic-English code-switching speech
    Hamed, Injy
    Denisov, Pavel
    Li, Chia-Yu
    Elmahdy, Mohamed
    Abdennadher, Slim
    Ngoc Thang Vu
    COMPUTER SPEECH AND LANGUAGE, 2022, 72
  • [2] Arabic Code-Switching Speech Recognition using Monolingual Data
    Ali, Ahmed
    Chowdhur, Shammur
    Hussein, Amir
    Hifny, Yasser
    INTERSPEECH 2021, 2021, : 3475 - 3479
  • [3] Addressing Code-Switching in French/Algerian Arabic Speech
    Amazota, Djegdjiga
    Adda-Decker, Martine
    Lamel, Lori
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 62 - 66
  • [4] Code-switching in reported speech
    Leisiö, L
    SELECTED PAPERS FROM THE 6TH INTERNATIONAL PRAGMATICS CONFERENCE, VOL 2: PRAGMATICS IN 1998, 1999, : 349 - 362
  • [5] Code-switching in Indic Speech Synthesisers
    Thomas, Anju Leela
    Prakash, Anusha
    Baby, Arun
    Murthy, Hema A.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1948 - 1952
  • [6] Developing an Automatic Speech Recognizer For Filipino with English Code-Switching in News Broadcast
    Lim, Mark Louis
    Xu, Aaron John
    Lin, Charles Stepven
    Chen, Zishi
    Pascual, Ronald
    2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 13 - 17
  • [8] TEXTUAL DATA AUGMENTATION FOR ARABIC-ENGLISH CODE-SWITCHING SPEECH RECOGNITION
    Hussein, Amir
    Chowdhury, Shammur Absar
    Abdelali, Ahmed
    Dehak, Najim
    Ali, Ahmed
    Khudanpur, Sanjeev
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 777 - 784
  • [9] Look at the gato! Code-switching in speech to toddlers
    Bail, Amelie
    Morini, Giovanna
    Newman, Rochelle S.
    JOURNAL OF CHILD LANGUAGE, 2015, 42 (05) : 1073 - 1101
  • [10] Direct Speech in the context of discussion on code-switching
    Barciela, Lois Xacobe Atanes
    ESTUDOS DE LINGUISTICA GALEGA, 2023, 15