Disentangled Representations in Local-Global Contexts for Arabic Dialect Identification

被引:0
|
作者
Alhakeem, Zainab [1 ]
Jang, Se-In [2 ]
Kang, Hong-Goo [1 ]
机构
[1] Yonsei Univ, Dept Elect & Elect Engn, Seoul 03722, South Korea
[2] Harvard Med Sch, Massachusetts Gen Hosp, Gordon Ctr Med Imaging, Boston, MA 02115 USA
关键词
Arabic dialect identification; disentangled representation; supervised clustering; global context; transformer; SPEAKER; NETWORKS;
D O I
10.1109/TASLP.2023.3341006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this article, we propose a locally and globally informed disentanglement network for Arabic dialect identification (ADI). Our proposed disentanglement network aims to detach all irrelevant information (e.g., speaker, gender and channel) from the source utterance and extract only dialect-related representations fitted for the ADI problem. The proposed network consists of local convolutional backbone modules to learn low-resolution feature maps and self-attention-based bottleneck transformers to efficiently aggregate the local information to represent the global context as the learned dialect embeddings. We propose a novel supervised clustering loss to minimize intra-class variations and maximize inter-class variations in a latent space. Our model achieves state-of-the-art results in qualitative and quantitative evaluations by outperforming other competitive solutions on ADI-17 datasets. Specifically, we have shown that local-global awareness from our proposed network boosts feature representation and enhances identification performance.
引用
收藏
页码:879 / 890
页数:12
相关论文
共 50 条
  • [31] Using Prosody and Phonotactics in Arabic Dialect Identification
    Biadsy, Fadi
    Hirschberg, Julia
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 208 - 211
  • [32] Hierarchical Deep Learning for Arabic Dialect Identification
    de Francony, Gael
    Guichard, Victor
    Joshi, Praveen
    Afli, Haithem
    Bouchekif, Abdessalam
    FOURTH ARABIC NATURAL LANGUAGE PROCESSING WORKSHOP (WANLP 2019), 2019, : 249 - 253
  • [33] Multi-scale local-global architecture for person re-identification
    Liu, Jing
    Tiwari, Prayag
    Tri Gia Nguyen
    Gupta, Deepak
    Band, Shahab S.
    SOFT COMPUTING, 2022, 26 (16) : 7967 - 7977
  • [34] Policies, powers and local-global challenges
    Fernando Baron, Luis
    REVISTA CS EN CIENCIAS SOCIALES, 2016, 20 : 11 - 13
  • [35] LESSONS FOR LOCAL-GLOBAL ACTION AND RESEARCH
    BRYANT, D
    JOURNAL OF COMMUNITY PSYCHOLOGY, 1995, 23 (03) : 250 - 255
  • [36] Local-global aware-transformer for occluded person re-identification
    Liu, Jing
    Zhou, Guoqing
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 84 : 71 - 78
  • [37] Ordinary p-adic representations of GL2(Qp) and local-global compatibility
    Breuil, Christophe
    Emerton, Matthew
    ASTERISQUE, 2010, (331) : 255 - 315
  • [38] Polar Codes with Local-Global Decoding
    Zhu, Ziyuan
    Wu, Wei
    Siegel, Paul H.
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 392 - 396
  • [39] A local-global principle for the telescope conjecture
    Antieau, Benjamin
    ADVANCES IN MATHEMATICS, 2014, 254 : 280 - 299
  • [40] Local-global divisibility on algebraic tori
    Alessandri, Jessica
    Chirivi, Rocco
    Paladino, Laura
    BULLETIN OF THE LONDON MATHEMATICAL SOCIETY, 2024, 56 (02) : 803 - 816