Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages

被引:7
|
作者
Dey, Spandan [1 ]
Saha, Goutam [1 ]
Sahidullah, Md [2 ]
机构
[1] Indian Inst Technol, Dept E&ECE, Kharagpur, W Bengal, India
[2] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
关键词
Cross-corpora; language recognition; channel compensation; long-term average spectrum; TDNN; IDENTIFICATION;
D O I
10.23919/EUSIPCO54536.2021.9616273
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we conduct one of the very first studies for cross-corpora performance evaluation in the spoken language identification (LID) problem. Cross-corpora evaluation was not explored much in LID research, especially for the Indian languages. We have selected three Indian spoken language corpora: IIITH-ILSC, LDC South Asian, and IITKGP-MLILSC. For each of the corpus, LID systems are trained on the state-of-the-art time-delay neural network (TDNN) based architecture with MFCC features. We observe that the LID performance degrades drastically for cross-corpora evaluation. For example, the system trained on the IIITH-ILSC corpus shows an average EER of 11.80 % and 43.34 % when evaluated with the same corpora and LDC South Asian corpora, respectively. Our preliminary analysis shows the significant differences among these corpora in terms of mismatch in the long-term average spectrum (LTAS) and signal-to-noise ratio (SNR). Subsequently, we apply different feature level compensation methods to reduce the cross-corpora acoustic mismatch. Our results indicate that these feature normalization schemes can help to achieve promising LID performance on cross-corpora experiments.
引用
收藏
页码:546 / 550
页数:5
相关论文
共 50 条
  • [31] Cross-Corpora Evaluation and Analysis of Grammatical Error Correction Models - Is Single-Corpus Evaluation Enough?
    Mita, Masato
    Mizumoto, Tomoya
    Kaneko, Masahiro
    Nagata, Ryo
    Inui, Kentaro
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1309 - 1314
  • [32] Real-life emotion-related states detection in call centers: a cross-corpora study
    Devillers, Laurence
    Vaudable, Christophe
    Chastagnol, Clement
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2350 - 2353
  • [33] Sign Language Recognition: Working with Limited Corpora
    Cooper, Helen
    Bowden, Richard
    UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION: APPLICATIONS AND SERVICES, PT III, 2009, 5616 : 472 - 481
  • [34] Arabic Offensive and Hate Speech Detection Using a Cross-Corpora Multi-Task Learning Model
    Aldjanabi, Wassen
    Dahou, Abdelghani
    Al-qaness, Mohammed A. A.
    Abd Elaziz, Mohamed
    Helmi, Ahmed Mohamed
    Damasevicius, Robertas
    INFORMATICS-BASEL, 2021, 8 (04):
  • [35] Multilingual Speaker Recognition on Indian Languages
    Sarkar, Sourjya
    Rao, K. Sreenivasa
    Nandi, Dipanjan
    Kumar, Sunil S. B.
    2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
  • [36] Indian Languages Corpus for Speech Recognition
    Basu, Joyanta
    Khan, Soma
    Roy, Rajib
    Saxena, Babita
    Ganguly, Dipankar
    Arora, Sunita
    Arora, Karunesh Kumar
    Bansal, Shweta
    Agrawal, Shyam Sunder
    2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 13 - 18
  • [37] The TDIL program and the Indian Language Corpora Initiative (ILCI)
    Jha, Girish Nath
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [38] Determination Of Linguistic Differences And Statistical Analysis Of Large Corpora Of Indian Languages
    Bansal, Shweta
    Mahajan, Minakshi
    Agrawal, S. S.
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [39] Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT
    Chronopoulou, Alexandra
    Stojanovski, Dario
    Fraser, Alexander
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2703 - 2711
  • [40] Using Games to Augment Corpora for Language Recognition and Confusability
    Cieri, Christopher
    Fiumara, James
    Wright, Jonathan
    INTERSPEECH 2021, 2021, : 1887 - 1891