Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages

被引:7
|
作者
Dey, Spandan [1 ]
Saha, Goutam [1 ]
Sahidullah, Md [2 ]
机构
[1] Indian Inst Technol, Dept E&ECE, Kharagpur, W Bengal, India
[2] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
关键词
Cross-corpora; language recognition; channel compensation; long-term average spectrum; TDNN; IDENTIFICATION;
D O I
10.23919/EUSIPCO54536.2021.9616273
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we conduct one of the very first studies for cross-corpora performance evaluation in the spoken language identification (LID) problem. Cross-corpora evaluation was not explored much in LID research, especially for the Indian languages. We have selected three Indian spoken language corpora: IIITH-ILSC, LDC South Asian, and IITKGP-MLILSC. For each of the corpus, LID systems are trained on the state-of-the-art time-delay neural network (TDNN) based architecture with MFCC features. We observe that the LID performance degrades drastically for cross-corpora evaluation. For example, the system trained on the IIITH-ILSC corpus shows an average EER of 11.80 % and 43.34 % when evaluated with the same corpora and LDC South Asian corpora, respectively. Our preliminary analysis shows the significant differences among these corpora in terms of mismatch in the long-term average spectrum (LTAS) and signal-to-noise ratio (SNR). Subsequently, we apply different feature level compensation methods to reduce the cross-corpora acoustic mismatch. Our results indicate that these feature normalization schemes can help to achieve promising LID performance on cross-corpora experiments.
引用
收藏
页码:546 / 550
页数:5
相关论文
共 50 条
  • [21] Large Web Corpora of High Quality for Indian Languages
    Quasthoff, Uwe
    Mitra, Ritwik
    Mitra, Sunny
    Eckart, Thomas
    Goldhahn, Dirk
    Goyal, Pawan
    Mukherjee, Animesh
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [22] IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages
    Kakwani, Divyanshu
    Kunchukuttan, Anoop
    Golla, Satish
    Gokul, N. C.
    Bhattacharyya, Avik
    Khapra, Mitesh M.
    Kumar, Pratyush
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4948 - 4961
  • [23] End-to-end Argument Mining with Cross-corpora Multi-task Learning
    Morio, Gaku
    Ozaki, Hiroaki
    Morishita, Terufumi
    Yanai, Kohsuke
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 639 - 658
  • [24] An SVM Based Approach to Cross-Language Adaptation for Indian Languages
    Raju, A. Vijaya Rama
    Sekhar, C. Chandra
    ADVANCES IN NEURO-INFORMATION PROCESSING, PT II, 2009, 5507 : 394 - 401
  • [25] Corpus-Based Translation Induction in Indian Languages Using Auxiliary Language Corpora from Wikipedia
    Tholpadi, Goutham
    Bhattacharyya, Chiranjib
    Shevade, Shirish
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2017, 16 (03)
  • [26] Scaling Multilingual Corpora and Language Models to 500 Languages
    Imani, Ayyoob
    Lin, Peiqin
    Kargaran, Amir Hossein
    Severini, Silvia
    Sabet, Masoud Jalili
    Kassner, Nora
    Ma, Chunlan
    Schmid, Helmut
    Martins, Andre F. T.
    Yvon, Francois
    Schuetze, Hinrich
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1082 - 1117
  • [27] How Bad are PoS Taggers in Cross-Corpora Settings? Evaluating Annotation Divergence in the UD Project
    Wisniewski, Guillaume
    Yvon, Francois
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 218 - 227
  • [28] Development of multi-lingual spoken corpora of Indian languages
    Samudravijaya, K.
    Chinese Spoken Language Processing, Proceedings, 2006, 4274 : 792 - 801
  • [29] Cross-Corpora Convolutional Deep Neural Network Dereverberation Preprocessing for Speaker Verification and Speech Enhancement
    Guzewich, Peter
    Zahorian, Stephen
    Chen, Xiao
    Zhang, Hao
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1329 - 1333
  • [30] Multilingual Speech Recognition Using Language-Specific Phoneme Recognition as Auxiliary Task for Indian Languages
    Sailor, Hardik B.
    Hain, Thomas
    INTERSPEECH 2020, 2020, : 4756 - 4760