Kazakh and Russian Languages Identification Using Long Short-Term Memory Recurrent Neural Networks

被引:0
|
作者
Kozhirbayev, Zhanibek [1 ,2 ]
Yessenbayev, Zhandos [2 ]
Karabalayeva, Muslima [2 ]
机构
[1] LN Gumilyov Eurasian Natl Univ, Fac Informat Technol, Astana, Kazakhstan
[2] Nazarbayev Univ, Natl Lab Astana, Astana, Kazakhstan
关键词
Language identification; Long Short-Term Memory Recurrent Neural Networks;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic language identification (LID) belongs to the automatic process whereby the identity of the language spoken in a speech sample can be distinguished. In recent decades, LID has made significant advancement in spoken language identification which received an advantage from technological achievements in related areas, such as signal processing, pattern recognition, machine learning and neural networks. This work investigates the employment of Long Short Term Memory (LSTM) recurrent neural networks (RNNs) for automatic language identification. The main reason of applying LSTM RNNs to the current task is their reasonable capacity in handling sequences. This study shows that LSTM RNNs can efficiently take advantage of temporal dependencies in acoustic data in order to learn relevant features for language recognition tasks. In this paper, we show results for conducted language identification experiments for Kazakh and Russian languages and the presented LSTM RNN model can deal with short utterances (2s). The model was trained using open-source highlevel neural networks API Keras on limited computational resources.
引用
收藏
页码:342 / 346
页数:5
相关论文
共 50 条
  • [1] Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks
    Zazo, Ruben
    Lozano-Diez, Alicia
    Gonzalez-Dominguez, Javier
    Toledano, Doroteo T.
    Gonzalez-Rodriguez, Joaquin
    [J]. PLOS ONE, 2016, 11 (01):
  • [2] Long Short-Term Memory Recurrent Neural Networks for Antibacterial Peptide Identification
    Youmans, Michael
    Spainhour, Christian
    Qiu, Peng
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 498 - 502
  • [3] LATE REVERBERATION SUPPRESSION USING RECURRENT NEURAL NETWORKS WITH LONG SHORT-TERM MEMORY
    Zhao, Yan
    Wang, DeLiang
    Xu, Buye
    Zhang, Tao
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5434 - 5438
  • [4] LOMBARD SPEECH SYNTHESIS USING LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS
    Bollepalli, Bajibabu
    Airaksinen, Manu
    Alku, Paavo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5505 - 5509
  • [5] Classification of Antibacterial Peptides Using Long Short-Term Memory Recurrent Neural Networks
    Youmans, Michael
    Spainhour, John C. G.
    Qiu, Peng
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (04) : 1134 - 1140
  • [6] Session Based Recommendations Using Recurrent Neural Networks - Long Short-Term Memory
    Dobrovolny, Michal
    Selamat, Ali
    Krejcar, Ondrej
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 53 - 65
  • [7] Statistical downscaling of precipitation using long short-term memory recurrent neural networks
    Saptarshi Misra
    Sudeshna Sarkar
    Pabitra Mitra
    [J]. Theoretical and Applied Climatology, 2018, 134 : 1179 - 1196
  • [8] Statistical downscaling of precipitation using long short-term memory recurrent neural networks
    Misra, Saptarshi
    Sarkar, Sudeshna
    Mitra, Pabitra
    [J]. THEORETICAL AND APPLIED CLIMATOLOGY, 2018, 134 (3-4) : 1179 - 1196
  • [9] Industrial Financial Forecasting using Long Short-Term Memory Recurrent Neural Networks
    Ali, Muhammad Mohsin
    Babar, Muhammad Imran
    Hamza, Muhammad
    Jehanzeb, Muhammad
    Habib, Saad
    Khan, Muhammad Sajid
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (04) : 88 - 99
  • [10] On Speaker Adaptation of Long Short-Term Memory Recurrent Neural Networks
    Miao, Yajie
    Metze, Florian
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1101 - 1105