A Recurrent Neural Network-Based Approach to Automatic Language Identification from Speech

被引:1
|
作者
Mukherjee, Himadri [1 ]
Dhar, Ankita [1 ]
Obaidullah, Sk Md [2 ]
Santosh, K. C. [3 ]
Phadikar, Santanu [4 ]
Roy, Kaushik [1 ]
机构
[1] West Bengal State Univ, Dept Comp Sci, Kolkata, India
[2] Aliah Univ, Dept Comp Sci & Engn, Kolkata, India
[3] Univ South Dakota, Dept Comp Sci, Brookings, SD USA
[4] Maulana Abul Kalam Azad Univ Technol, Dept Comp Sci & Engn, Kolkata, India
关键词
Language identification; Recurrent neural network; Long short-term memory; Line spectral frequency;
D O I
10.1007/978-981-15-0829-5_43
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The task of automatically identifying the used language from speech signals is known as automatic language identification. It is very much important prior to speech recognition in multilingual scenarios where speakers use more than a single language in course of communication. In this paper, a recurrent neural network (RNN)-based system with long short-term memory (LSTM) along with handcrafted line spectral frequency-based features is proposed for language identification. Experiments were performed on as many as 21908 clips (more than 30 h of data) from the top three spoken languages of the world, namely, English, Chinese, and Spanish, and a highest average accuracy of 95.22% has been obtained.
引用
收藏
页码:441 / 450
页数:10
相关论文
共 50 条
  • [1] Recurrent Neural Network-based Language Modeling for an Automatic Russian Speech Recognition System
    Kipyatkova, Irina
    Karpov, Alexey
    2015 ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE AND INFORMATION EXTRACTION, SOCIAL MEDIA AND WEB SEARCH FRUCT CONFERENCE (AINL-ISMW FRUCT), 2015, : 33 - 38
  • [2] A Novel Optimized Recurrent Network-Based Automatic System for Speech Emotion Identification
    Neeraja Koppula
    Koppula Srinivas Rao
    Shaik Abdul Nabi
    Allam Balaram
    Wireless Personal Communications, 2023, 128 : 2217 - 2243
  • [3] A Novel Optimized Recurrent Network-Based Automatic System for Speech Emotion Identification
    Koppula, Neeraja
    Rao, Koppula Srinivas
    Nabi, Shaik Abdul
    Balaram, Allam
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 128 (03) : 2217 - 2243
  • [4] BIDIRECTIONAL RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Arisoy, Ebru
    Sethy, Abhinav
    Ramabhadran, Bhuvana
    Chen, Stanley
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5421 - 5425
  • [5] A Comparative Study of Khasi Speech Recognition Systems with Recurrent Neural Network-Based Language Model
    Deepajothi, S.
    Rao, Vuda Sreenivasa
    Ambhika, C.
    Mandala, Vishwanadham
    Rao, R. V. V. N. Bheema
    Kumar, Shailendra
    Gera, Venkateswara Rao
    Nagaraju, D.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06) : 1296 - 1305
  • [6] Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition
    Masumura, Ryo
    Asami, Taichi
    Oba, Takanobu
    Sakauchi, Sumitaka
    Ito, Akinori
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2557 - 2567
  • [7] Recurrent Neural Network-Based Dictionary Learning for Compressive Speech Sensing
    Ji, Yunyun
    Zhu, Wei-Ping
    Champagne, Benoit
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (08) : 3616 - 3643
  • [8] Recurrent Neural Network-Based Dictionary Learning for Compressive Speech Sensing
    Yunyun Ji
    Wei-Ping Zhu
    Benoit Champagne
    Circuits, Systems, and Signal Processing, 2019, 38 : 3616 - 3643
  • [9] A dynamic programming framework for neural network-based automatic speech segmentation
    van Vuuren, Van Zyl
    ten Bosch, Louis
    Niesler, Thomas
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2286 - 2290
  • [10] Deep Recurrent Neural Network-Based Identification of Precursor microRNAs
    Park, Seunghyun
    Min, Seonwoo
    Choi, Hyun-Soo
    Yoon, Sungroh
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30