Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages

被引:7
|
作者
Dey, Spandan [1 ]
Saha, Goutam [1 ]
Sahidullah, Md [2 ]
机构
[1] Indian Inst Technol, Dept E&ECE, Kharagpur, W Bengal, India
[2] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
关键词
Cross-corpora; language recognition; channel compensation; long-term average spectrum; TDNN; IDENTIFICATION;
D O I
10.23919/EUSIPCO54536.2021.9616273
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we conduct one of the very first studies for cross-corpora performance evaluation in the spoken language identification (LID) problem. Cross-corpora evaluation was not explored much in LID research, especially for the Indian languages. We have selected three Indian spoken language corpora: IIITH-ILSC, LDC South Asian, and IITKGP-MLILSC. For each of the corpus, LID systems are trained on the state-of-the-art time-delay neural network (TDNN) based architecture with MFCC features. We observe that the LID performance degrades drastically for cross-corpora evaluation. For example, the system trained on the IIITH-ILSC corpus shows an average EER of 11.80 % and 43.34 % when evaluated with the same corpora and LDC South Asian corpora, respectively. Our preliminary analysis shows the significant differences among these corpora in terms of mismatch in the long-term average spectrum (LTAS) and signal-to-noise ratio (SNR). Subsequently, we apply different feature level compensation methods to reduce the cross-corpora acoustic mismatch. Our results indicate that these feature normalization schemes can help to achieve promising LID performance on cross-corpora experiments.
引用
收藏
页码:546 / 550
页数:5
相关论文
共 50 条
  • [1] Cross-corpora spoken language identification with domain diversification and generalization
    Dey, Spandan
    Sahidullah, Md
    Saha, Goutam
    COMPUTER SPEECH AND LANGUAGE, 2023, 81
  • [2] VOICE-BASED SADNESS AND ANGER RECOGNITION WITH CROSS-CORPORA EVALUATION
    Toledo-Ronen, Orith
    Sorin, Alexander
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7517 - 7521
  • [3] Towards Cross-Corpora Generalization for Low-Resource Spoken Language Identification
    Dey, Spandan
    Sahidullah, Md
    Saha, Goutam
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 5040 - 5050
  • [4] Cross-Corpora Comparisons of Topics and Topic Trends
    Bystrov, Victor
    Naboka, Viktoriia
    Staszewska-Bystrova, Anna
    Winker, Peter
    JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK, 2022, 242 (04): : 433 - 469
  • [5] An Investigation of Deep Neural Network Architectures for Language Recognition in Indian Languages
    Mounika, K., V
    Achanta, Sivanand
    Lakshmi, H. R.
    Gangashetty, Suryakanth V.
    Vuppala, Anil Kumar
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2930 - 2933
  • [6] Parameter optimization issues for cross-corpora emotion classification
    Vlasenko, Bogdan
    Philippou-Huebner, David
    Wendemuth, Andreas
    2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 454 - 459
  • [7] Cross-Corpora Unsupervised Learning of Trajectories in Autism Spectrum Disorders
    Elibol, Huseyin Melih
    Nguyen, Vincent
    Linderman, Scott
    Johnson, Matthew
    Hashmi, Amna
    Doshi-Velez, Finale
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [8] Development of speech corpora for speaker recognition research and evaluation in Indian languages
    Patil, Hemant
    Basu, T.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2008, 11 (01) : 17 - 32
  • [9] Cross-corpora unsupervised learning of trajectories in autism spectrum disorders
    Elibol, Huseyin Melih
    Nguyen, Vincent
    Linderman, Scott
    Johnson, Matthew
    Hashmi, Amna
    Doshi-Velez, Finale
    Journal of Machine Learning Research, 2016, 17 : 1 - 38
  • [10] Investigating heterogeneous protein annotations toward cross-corpora utilization
    Wang Y.
    Kim J.-D.
    Sætre R.
    Pyysalo S.
    Tsujii J.
    BMC Bioinformatics, 10 (1)