A Hybrid Convolutional Bi-Directional Gated Recurrent Unit System for Spoken Languages of JK and Ladakhi

被引:0
|
作者
Thukroo, Irshad Ahmad [1 ]
Bashir, Rumaan [1 ]
Giri, Kaiser J. J. [1 ]
机构
[1] Islamic Univ Sci & Technol, Dept Comp Sci, 1 Univ Ave, Pulwama 192122, Jammu & Kashmir, India
关键词
Language identification; convolutional neural network; long short-term memory; Bi-directional gated recurrent unit; IIITH-ILSC; FEATURE-SELECTION METHOD; NEURAL-NETWORK; IDENTIFICATION; RECOGNITION;
D O I
10.1142/S0219649223500284
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Spoken language identification is the process of recognising language in an audio segment and is the precursor for several technologies such as automatic call routing, language recognition, multilingual conversation, language parsing, and sentimental analysis. Language identification has become a challenging task for low-resource languages like Kashmiri and Ladakhi spoken in the UT's of Jammu and Kashmir (JK) and Ladakh, India. This is mainly due to speaker variations like duration, moderator, and ambiance particularly when training and testing are done on different datasets whilst analysing the accuracy of language identification system in actual implementation, thus producing low accuracy results. In order to tackle this problem, we propose a hybrid convolutional bi-directional gated recurrent unit (Bi-GRU) utilising the effects of both static and dynamic behaviour of the audio signal in order to achieve better results as compared to state-of-the-art models. The audio signals are first converted into two-dimensional structures called Mel-spectrograms to represent the frequency distribution over time. To investigate the spectral behaviour of audio signals, we employ a convolutional neural network (CNN) that perceives Mel-spectrograms in multiple dimensions. The CNN-learned feature vector serves as input to the Bi-GRU that maintains the dynamic behaviour of the audio signal. Experiments are done on six spoken languages, i.e. Ladakhi, Kashmiri, Hindi, Urdu, English, and Dogri. The data corpora used for experimentation are the International Institute of Information Technology Hyderabad-Indian Language Speech Corpus (IIITH-ILSC) and the self-created data corpus for the Ladakhi language. The model is tested on two datasets, i.e. speaker-dependent and speaker-independent. Results show that when validating the efficiency of our proposed model on both speaker-dependent and speaker-independent datasets, we achieve optimal accuracies of 99% and 91%, respectively, thus achieving promising results in comparison to the state-of-the-art models available.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Bi-directional photonic switch and optical data storage in a hybrid optomechanical system
    Yadav, Surabhi
    Bhattacherjee, Aranya B.
    [J]. JOURNAL OF NONLINEAR OPTICAL PHYSICS & MATERIALS, 2022, 31 (02)
  • [32] Generating Popularity-Aware Reciprocal Recommendations Using Siamese Bi-Directional Gated Recurrent Units Network
    Kumari, Tulika
    Sharma, Ravish
    Gupta, Bhavna
    Bedi, Punam
    [J]. VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (03) : 273 - 301
  • [33] Tissue-border Detection in Volumetric Laser Endomicroscopy using Bi-directional Gated Recurrent Neural Networks
    Okel, Sanne E.
    van Der Sommen, Fons
    Selmanaj, Endi
    van Der Putten, Joost
    Struyvenberg, Maarten R.
    Bergman, Jacques J. G. H. M.
    de With, Peter H. N.
    [J]. MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
  • [34] Developing a Novel Hybrid Model Double Exponential Smoothing and Dual Attention Encoder-Decoder Based Bi-Directional Gated Recurrent Unit Enhanced With Bayesian Optimization to Forecast Stock Price
    Jayanth, Talabathula
    Manimaran, A.
    [J]. IEEE ACCESS, 2024, 12 : 114760 - 114785
  • [35] Unit Commitment of Integrated Electricity and Heat System with Bi-directional Variable Mass Flow
    Wu, Xuewei
    Chen, Zhe
    Fang, Jiakun
    [J]. 2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [36] Multi-directional gated recurrent unit and convolutional neural network for load and energy forecasting: A novel hybridization
    Abid, Fazeel
    Alam, Muhammad
    Alamri, Faten S.
    Siddique, Imran
    [J]. AIMS MATHEMATICS, 2023, 8 (09): : 19993 - 20017
  • [37] Design and implementation of a PEMFC/battery hybrid generation system with bi-directional power flow
    Hua, Chih-Chiang
    Chuang, Chih-Wei
    Huang, Chi-Lun
    [J]. 2006 IEEE POWER ELECTRONICS SPECIALISTS CONFERENCE, VOLS 1-7, 2006, : 1898 - +
  • [38] Research on bi-directional inverter of wind-solar-diesel hybrid generation system
    Li, Xiaoying
    Zhao, Zhang
    Zhang, Litao
    [J]. ADVANCES IN ENERGY SCIENCE AND TECHNOLOGY, PTS 1-4, 2013, 291-294 : 2553 - +
  • [39] Get Up!: Assessing Postural Activity & Transitions using Bi-Directional Gated Recurrent Units (Bi-GRUs) on Smartphone Motion Data
    Chandrasekaran, Kavin
    Buquicchio, Luke
    Gerych, Walter
    Agu, Emmanuel
    Rundensteiner, Elke
    [J]. 2019 IEEE HEALTHCARE INNOVATIONS AND POINT OF CARE TECHNOLOGIES (HI-POCT), 2019, : 25 - 28
  • [40] System Studies for a Bi-Directional Advanced Hybrid Drive System (AHDS) for Application on a Future Surface Combatant
    Langston, James
    Andrus, Michael
    Steurer, Michael
    Alexander, Dwight
    Buck, Jeffrey
    Robinson, George
    Wieczenski, Don
    [J]. 2013 IEEE ELECTRIC SHIP TECHNOLOGIES SYMPOSIUM (ESTS), 2013, : 509 - 513