KALAKA: a TV Broadcast Speech Database for the Evaluation of Language Recognition Systems

被引：0

作者：

Rodriguez-Fuentes, Luis J. ^{[1
]}

Penagarikano, Mikel ^{[1
]}

Bordel, German ^{[1
]}

Varona, Amparo ^{[1
]}

Diez, Mireia ^{[1
]}

机构：

[1] Univ Basque Country, Dept Elect & Elect, Software Technol Working Grp, Leioa 48940, Spain

来源：

LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2010年

关键词：

SUPPORT VECTOR MACHINES; SPEAKER;

D O I：

暂无

中图分类号：

H [语言、文字];

学科分类号：

05 ;

摘要：

A speech database, named KALAKA, was created to support the Albayzin 2008 Evaluation of Language Recognition Systems, organized by the Spanish Network on Speech Technologies from May to November 2008. This evaluation, designed according to the criteria and methodology applied in the NIST Language Recognition Evaluations, involved four target languages: Basque, Catalan, Galician and Spanish (official languages in Spain), and included speech signals in other (unknown) languages to allow open-set verification trials. In this paper, the process of designing, collecting data and building the train, development and evaluation datasets of KALAKA is described. Results attained in the Albayzin 2008 LRE are presented as a means of evaluating the database. The performance of a state-of-the-art language recognition system on a closed-set evaluation task is also presented for reference. Future work includes extending KALAKA by adding Portuguese and English as target languages and renewing the set of unknown languages needed to carry out open-set evaluations.

引用

页码：1678 / 1685

页数：8

共 50 条

[21] A Comparative Analysis of Speech Recognition Systems for the Tatar Language
Khusainov, Aidar
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 515 - 523
[22] Performance Evaluation of Next Generation Broadcast Wireless Systems in the UHF TV Band
Xu, Yang
Wang, Fang
Kou, Yajun
Song, Rongfang
PROCEEDINGS OF THE 2015 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA CHINACOM 2015, 2015, : 681 - 686
[23] Evaluation of word confidence for speech recognition systems
Siu, MH
Gish, H
COMPUTER SPEECH AND LANGUAGE, 1999, 13 (04): : 299 - 318
[24] Fusion of Speech, Faces and Text for Person Identification in TV Broadcast
Bredin, Herve
Poignant, Johann
Tapaswi, Makarand
Fortier, Guillaume
Viet Bac Le
Napoleon, Thibault
Gao, Hua
Barras, Claude
Rosset, Sophie
Besacier, Laurent
Verbeek, Jakob
Quenot, Georges
Jurie, Frederic
Ekenel, Hazim Kemal
COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 385 - 394
[25] Unsupervised Language Model Adaptation for Automatic Speech Recognition of Broadcast News Using Web 2.0
Schlippe, Tim
Gren, Lukasz
Vu, Ngoc Thang
Schultz, Tanja
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2697 - 2701
[26] Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition
Chen, X.
Tan, T.
Liu, X.
Lanchantin, P.
Wan, M.
Gales, M. J. F.
Woodland, P. C.
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3511 - 3515
[27] The Broadcast Narrow Band Speech Corpus: A New Resource Type for Large Scale Language Recognition
Cieri, Christopher
Brandschain, Linda
Neely, Abby
Graff, David
Walker, Kevin
Caruso, Chris
Martin, Alvin
Greenberg, Craig
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2819 - +
[28] Investigation on Mandarin Broadcast News Speech Recognition
Hwang, Mei-Yuh
Lei, Xin
Wang, Wen
Shinozaki, Takahiro
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1233 - +
[29] A study on Mandarin broadcast news speech recognition
Chen, CL
Wang, YR
Chen, SH
2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 257 - 260
[30] Automatic Speech Recognition on Firefighter TETRA broadcast
Stein, Daniel
Usabaev, Bela
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 119 - 124

← 1 2 3 4 5 →