Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks

被引:85
|
作者
Zazo, Ruben [1 ]
Lozano-Diez, Alicia [1 ]
Gonzalez-Dominguez, Javier [1 ]
Toledano, Doroteo T. [1 ]
Gonzalez-Rodriguez, Joaquin [1 ]
机构
[1] Univ Autonoma Madrid, ATVS Biometr Recognit Grp, Madrid, Spain
来源
PLOS ONE | 2016年 / 11卷 / 01期
关键词
SPEAKER;
D O I
10.1371/journal.pone.0146917
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Long Short Term Memory (LSTM) Recurrent Neural Networks (RNNs) have recently outperformed other state-of-the-art approaches, such as i-vector and Deep Neural Networks (DNNs), in automatic Language Identification (LID), particularly when dealing with very short utterances (similar to 3s). In this contribution we present an open-source, end-to-end, LSTM RNN system running on limited computational resources (a single GPU) that outperforms a reference i-vector system on a subset of the NIST Language Recognition Evaluation (8 target languages, 3s task) by up to a 26%. This result is in line with previously published research using proprietary LSTM implementations and huge computational resources, which made these former results hardly reproducible. Further, we extend those previous experiments modeling unseen languages (out of set, OOS, modeling), which is crucial in real applications. Results show that a LSTM RNN with OOS modeling is able to detect these languages and generalizes robustly to unseen OOS languages. Finally, we also analyze the effect of even more limited test data (from 2.25s to 0.1s) proving that with as little as 0.5s an accuracy of over 50% can be achieved.
引用
下载
收藏
页数:17
相关论文
共 50 条
  • [31] Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) Power Forecasting
    Alsabban, Maha S.
    Salem, Nema
    Malik, Hebatullah M.
    APPEEC 2021: 2021 13TH IEEE PES ASIA PACIFIC POWER & ENERGY ENGINEERING CONFERENCE (APPEEC), 2021,
  • [32] Using a Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) to Classify Network Attacks
    Muhuri, Pramita Sree
    Chatterjee, Prosenjit
    Yuan, Xiaohong
    Roy, Kaushik
    Esterline, Albert
    INFORMATION, 2020, 11 (05)
  • [33] Using a long short-term memory recurrent neural network (LSTM-RNN) to classify network attacks
    Muhuri P.S.
    Chatterjee P.
    Yuan X.
    Roy K.
    Esterline A.
    Information (Switzerland), 2020, 11 (05):
  • [34] WELL INTEGRITY OPERATIONS EXPERIENCE TRANSFER USING LONG SHORT-TERM MEMORY (LSTM) RECURRENT NEURAL NETWORK
    Semwogerere, David
    Pavlov, Alexey
    Sangesland, Sigbjorn
    PROCEEDINGS OF ASME 2023 42ND INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE & ARCTIC ENGINEERING, OMAE2023, VOL 9, 2023,
  • [35] Predicting Short-term Traffic Flow by Long Short-Term Memory Recurrent Neural Network
    Tian, Yongxue
    Pan, Li
    2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 153 - 158
  • [36] Approximate Computing for Long Short Term Memory (LSTM) Neural Networks
    Sen, Sanchari
    Raghunathan, Anand
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (11) : 2266 - 2276
  • [37] Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks
    Zazo, Ruben
    Nidadavolu, Phani Sankar
    Chen, Nanxin
    Gonzalez-Rodriguez, Joaquin
    Dehak, Najim
    IEEE ACCESS, 2018, 6 : 22524 - 22530
  • [38] Comparing Long Short-Term Memory (LSTM) and bidirectional LSTM deep neural networks for power consumption prediction
    da Silva, Davi Guimaraes
    Meneses, Anderson Alvarenga de Moura
    ENERGY REPORTS, 2023, 10 : 3315 - 3334
  • [39] Rainfall-runoff modelling using Long Short-Term Memory (LSTM) networks
    Kratzert, Frederik
    Klotz, Daniel
    Brenner, Claire
    Schulz, Karsten
    Herrnegger, Mathew
    HYDROLOGY AND EARTH SYSTEM SCIENCES, 2018, 22 (11) : 6005 - 6022
  • [40] Predicting machine's performance record using the stacked long short-term memory (LSTM) neural networks
    Ma, Min
    Liu, Chenbin
    Wei, Ran
    Liang, Bin
    Dai, Jianrong
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2022, 23 (03):