Spoken Language Identification with Deep Convolutional Neural Network and Data Augmentation

被引:0
|
作者
Korkut, Can [1 ]
Haznedaroglu, Ali [1 ]
Arslan, Levent M. [1 ,2 ]
机构
[1] Sestek, Istanbul, Turkey
[2] Bogazici Univ, Elekt Elekt Muhendisligi Bolumu, Istanbul, Turkey
关键词
Spoken Language Identification; CNN; Data Augmentation; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a spoken language detection system based on deep convolutional neural networks is presented. The neural network model is trained and tested on a speech dataset containing five languages. Speech signals are first converted into mel-spectrogram features and these features are fed into the deep convolutional neural network. Flattened outputs of the deep convolutional network are then fed into a recurrent layer, and a dense layer with softmax activation function is used as an output layer to predict the output language probabilities. This network results in 0.89 F1-score in our test data. We also used a data augmentation method, namely Spec Augment, which increased the F1-score to 0.94.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Spoken Language Identification Using Convolutional Neural Network In Nepalese Context
    Sapkota, Shiva Sagar
    Shakya, Aman
    Joshi, Basanta
    Proceedings of 2023 26th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2023, 2023,
  • [2] Spoken Language Identification System Using Convolutional Recurrent Neural Network
    Alashban, Adal A.
    Qamhan, Mustafa A.
    Meftah, Ali H.
    Alotaibi, Yousef A.
    APPLIED SCIENCES-BASEL, 2022, 12 (18):
  • [3] DATA AUGMENTATION FOR DEEP CONVOLUTIONAL NEURAL NETWORK ACOUSTIC MODELING
    Cui, Xiaodong
    Goel, Vaibhava
    Kingsbury, Brian
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4545 - 4549
  • [4] Ethio-Semitic language identification using convolutional neural networks with data augmentation
    Amlakie Aschale Alemu
    Malefia Demilie Melese
    Ayodeji Olalekan Salau
    Multimedia Tools and Applications, 2024, 83 : 34499 - 34514
  • [5] Ethio-Semitic language identification using convolutional neural networks with data augmentation
    Alemu, Amlakie Aschale
    Melese, Malefia Demilie
    Salau, Ayodeji Olalekan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 34499 - 34514
  • [6] Identification of natural selection in genomic data with deep convolutional neural network
    Fadja, Arnaud Nguembang
    Riguzzi, Fabrizio
    Bertorelle, Giorgio
    Trucchi, Emiliano
    BIODATA MINING, 2021, 14 (01)
  • [7] Identification of natural selection in genomic data with deep convolutional neural network
    Arnaud Nguembang Fadja
    Fabrizio Riguzzi
    Giorgio Bertorelle
    Emiliano Trucchi
    BioData Mining, 14
  • [8] Image recognition of interference fringes in polishing by convolutional neural network with data augmentation by deep convolutional generative adversarial network
    Chen, Yi-Huei
    Lin, Wei-Ting
    Liu, Chun-Wei
    OPTICAL ENGINEERING, 2022, 61 (04)
  • [9] Data augmentation based morphological classification of galaxies using deep convolutional neural network
    Ansh Mittal
    Anu Soorya
    Preeti Nagrath
    D. Jude Hemanth
    Earth Science Informatics, 2020, 13 : 601 - 617
  • [10] Environmental sound classification using a regularized deep convolutional neural network with data augmentation
    Mushtaq, Zohaib
    Su, Shun-Feng
    APPLIED ACOUSTICS, 2020, 167