A Deep Learning Approach for Arabic Spoken Command Spotting

被引:0
|
作者
Salhab, Mahmoud [1 ]
Harmanani, Haidar [1 ]
机构
[1] Lebanese Amer Univ, Dept Comp Sci & Math, Byblos 14012010, Lebanon
关键词
Speech Recognition; Conformer; synthetic data generation; SPEECH;
D O I
10.1109/CCECE59415.2024.10667309
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Keyword spotting (KWS) in spoken language involves identifying specific terms within an audio stream. While this task is widely employed in edge devices, achieving high accuracy is challenging due to the need for efficiency on low-power and resource-constrained systems. This paper presents a novel approach for Arabic keyword spotting using a ConformerGRU model architecture. The model's performance is further enhanced by training a text-to-speech model for synthetic data generation. The proposed method demonstrates state-of-the-art results with 99.59% accuracy.
引用
收藏
页码:330 / 334
页数:5
相关论文
共 50 条
  • [1] AraSpot: Arabic Spoken Command Spotting
    Salhab, Mahmoud
    Harmanani, Haidar
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (07)
  • [2] A Deep Learning Approach for Identifying and Discriminating Spoken Arabic Among Other Languages
    Alashban, Adal A.
    Alotaibi, Yousef A.
    IEEE ACCESS, 2023, 11 : 11613 - 11628
  • [3] Spoken Arabic Digits Recognition Using Deep Learning
    Wazir, Abdulaziz Saleh Mahfoudh B. A.
    Chuah, Joon Huang
    2019 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS (I2CACIS), 2019, : 339 - 344
  • [4] Recognition of Arabic Accents From English Spoken Speech Using Deep Learning Approach
    Habbash, Mansoor
    Mnasri, Sami
    Alghamdi, Mansoor
    Alrashidi, Malek
    Tarawneh, Ahmad S.
    Gumair, Abdullah
    Hassanat, Ahmad B.
    IEEE ACCESS, 2024, 12 : 37219 - 37230
  • [5] Deep Spoken Keyword Spotting: An Overview
    Lopez-Espejo, Ivan
    Tan, Zheng-Hua
    Hansen, John H. L.
    Jensen, Jesper
    IEEE ACCESS, 2022, 10 : 4169 - 4199
  • [6] A Robust Model for Translating Arabic Sign Language into Spoken Arabic Using Deep Learning
    Nahar, Khalid M. O.
    Almomani, Ammar
    Shatnawi, Nahlah
    Alauthman, Mohammad
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (02): : 2037 - 2057
  • [7] PROGRESSIVE CONTINUAL LEARNING FOR SPOKEN KEYWORD SPOTTING
    Huang, Yizheng
    Hou, Nana
    Chen, Nancy F.
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7552 - 7556
  • [8] A Deep Learning Approach for Arabic Manuscripts Classification
    Al-homed, Lutfieh S.
    Jambi, Kamal M.
    Al-Barhamtoshy, Hassanin M.
    SENSORS, 2023, 23 (19)
  • [9] A Deep Learning Approach for Arabic Text Classification
    Sundus, Katrina
    Al-Haj, Fatima
    Hammo, Bassam
    2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 258 - 264
  • [10] A DEEP LEARNING APPROACH TO MODELING COMPETITIVENESS IN SPOKEN CONVERSATIONS
    Chowdhury, Shammur Absar
    Riccardi, Giuseppe
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5680 - 5684