KALAKA: a TV Broadcast Speech Database for the Evaluation of Language Recognition Systems

被引:0
|
作者
Rodriguez-Fuentes, Luis J. [1 ]
Penagarikano, Mikel [1 ]
Bordel, German [1 ]
Varona, Amparo [1 ]
Diez, Mireia [1 ]
机构
[1] Univ Basque Country, Dept Elect & Elect, Software Technol Working Grp, Leioa 48940, Spain
关键词
SUPPORT VECTOR MACHINES; SPEAKER;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
A speech database, named KALAKA, was created to support the Albayzin 2008 Evaluation of Language Recognition Systems, organized by the Spanish Network on Speech Technologies from May to November 2008. This evaluation, designed according to the criteria and methodology applied in the NIST Language Recognition Evaluations, involved four target languages: Basque, Catalan, Galician and Spanish (official languages in Spain), and included speech signals in other (unknown) languages to allow open-set verification trials. In this paper, the process of designing, collecting data and building the train, development and evaluation datasets of KALAKA is described. Results attained in the Albayzin 2008 LRE are presented as a means of evaluating the database. The performance of a state-of-the-art language recognition system on a closed-set evaluation task is also presented for reference. Future work includes extending KALAKA by adding Portuguese and English as target languages and renewing the set of unknown languages needed to carry out open-set evaluations.
引用
收藏
页码:1678 / 1685
页数:8
相关论文
共 50 条
  • [21] A Comparative Analysis of Speech Recognition Systems for the Tatar Language
    Khusainov, Aidar
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 515 - 523
  • [22] Performance Evaluation of Next Generation Broadcast Wireless Systems in the UHF TV Band
    Xu, Yang
    Wang, Fang
    Kou, Yajun
    Song, Rongfang
    PROCEEDINGS OF THE 2015 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA CHINACOM 2015, 2015, : 681 - 686
  • [23] Evaluation of word confidence for speech recognition systems
    Siu, MH
    Gish, H
    COMPUTER SPEECH AND LANGUAGE, 1999, 13 (04): : 299 - 318
  • [24] Fusion of Speech, Faces and Text for Person Identification in TV Broadcast
    Bredin, Herve
    Poignant, Johann
    Tapaswi, Makarand
    Fortier, Guillaume
    Viet Bac Le
    Napoleon, Thibault
    Gao, Hua
    Barras, Claude
    Rosset, Sophie
    Besacier, Laurent
    Verbeek, Jakob
    Quenot, Georges
    Jurie, Frederic
    Ekenel, Hazim Kemal
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 385 - 394
  • [25] Unsupervised Language Model Adaptation for Automatic Speech Recognition of Broadcast News Using Web 2.0
    Schlippe, Tim
    Gren, Lukasz
    Vu, Ngoc Thang
    Schultz, Tanja
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2697 - 2701
  • [26] Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition
    Chen, X.
    Tan, T.
    Liu, X.
    Lanchantin, P.
    Wan, M.
    Gales, M. J. F.
    Woodland, P. C.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3511 - 3515
  • [27] The Broadcast Narrow Band Speech Corpus: A New Resource Type for Large Scale Language Recognition
    Cieri, Christopher
    Brandschain, Linda
    Neely, Abby
    Graff, David
    Walker, Kevin
    Caruso, Chris
    Martin, Alvin
    Greenberg, Craig
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2819 - +
  • [28] Investigation on Mandarin Broadcast News Speech Recognition
    Hwang, Mei-Yuh
    Lei, Xin
    Wang, Wen
    Shinozaki, Takahiro
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1233 - +
  • [29] A study on Mandarin broadcast news speech recognition
    Chen, CL
    Wang, YR
    Chen, SH
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 257 - 260
  • [30] Automatic Speech Recognition on Firefighter TETRA broadcast
    Stein, Daniel
    Usabaev, Bela
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 119 - 124