Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages

被引：7

作者：

Dey, Spandan ^{[1
]}

Saha, Goutam ^{[1
]}

Sahidullah, Md ^{[2
]}

机构：

[1] Indian Inst Technol, Dept E&ECE, Kharagpur, W Bengal, India

[2] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France

来源：

29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021) | 2021年

关键词：

Cross-corpora; language recognition; channel compensation; long-term average spectrum; TDNN; IDENTIFICATION;

D O I：

10.23919/EUSIPCO54536.2021.9616273

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we conduct one of the very first studies for cross-corpora performance evaluation in the spoken language identification (LID) problem. Cross-corpora evaluation was not explored much in LID research, especially for the Indian languages. We have selected three Indian spoken language corpora: IIITH-ILSC, LDC South Asian, and IITKGP-MLILSC. For each of the corpus, LID systems are trained on the state-of-the-art time-delay neural network (TDNN) based architecture with MFCC features. We observe that the LID performance degrades drastically for cross-corpora evaluation. For example, the system trained on the IIITH-ILSC corpus shows an average EER of 11.80 % and 43.34 % when evaluated with the same corpora and LDC South Asian corpora, respectively. Our preliminary analysis shows the significant differences among these corpora in terms of mismatch in the long-term average spectrum (LTAS) and signal-to-noise ratio (SNR). Subsequently, we apply different feature level compensation methods to reduce the cross-corpora acoustic mismatch. Our results indicate that these feature normalization schemes can help to achieve promising LID performance on cross-corpora experiments.

引用

页码：546 / 550

页数：5

共 50 条

[31] Cross-Corpora Evaluation and Analysis of Grammatical Error Correction Models - Is Single-Corpus Evaluation Enough?
Mita, Masato
Mizumoto, Tomoya
Kaneko, Masahiro
Nagata, Ryo
Inui, Kentaro
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1309 - 1314
[32] Real-life emotion-related states detection in call centers: a cross-corpora study
Devillers, Laurence
Vaudable, Christophe
Chastagnol, Clement
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2350 - 2353
[33] Sign Language Recognition: Working with Limited Corpora
Cooper, Helen
Bowden, Richard
UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION: APPLICATIONS AND SERVICES, PT III, 2009, 5616 : 472 - 481
[34] Arabic Offensive and Hate Speech Detection Using a Cross-Corpora Multi-Task Learning Model
Aldjanabi, Wassen
Dahou, Abdelghani
Al-qaness, Mohammed A. A.
Abd Elaziz, Mohamed
Helmi, Ahmed Mohamed
Damasevicius, Robertas
INFORMATICS-BASEL, 2021, 8 (04):
[35] Multilingual Speaker Recognition on Indian Languages
Sarkar, Sourjya
Rao, K. Sreenivasa
Nandi, Dipanjan
Kumar, Sunil S. B.
2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
[36] Indian Languages Corpus for Speech Recognition
Basu, Joyanta
Khan, Soma
Roy, Rajib
Saxena, Babita
Ganguly, Dipankar
Arora, Sunita
Arora, Karunesh Kumar
Bansal, Shweta
Agrawal, Shyam Sunder
2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 13 - 18
[37] The TDIL program and the Indian Language Corpora Initiative (ILCI)
Jha, Girish Nath
LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
[38] Determination Of Linguistic Differences And Statistical Analysis Of Large Corpora Of Indian Languages
Bansal, Shweta
Mahajan, Minakshi
Agrawal, S. S.
2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
[39] Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT
Chronopoulou, Alexandra
Stojanovski, Dario
Fraser, Alexander
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2703 - 2711
[40] Using Games to Augment Corpora for Language Recognition and Confusability
Cieri, Christopher
Fiumara, James
Wright, Jonathan
INTERSPEECH 2021, 2021, : 1887 - 1891

← 1 2 3 4 5 →