STBU system for the NIST 2006 speaker recognition evaluation

被引:0
|
作者
Matejka, P. [1 ]
Burget, L. [1 ]
Schwarz, P. [1 ]
Glembek, O. [1 ]
Karafiat, M. [1 ]
Grezl, F. [1 ]
Cernocky, J. [1 ]
van Leeuwen, D. A. [2 ]
Bruemmer, N. [3 ]
Strasheim, A. [4 ]
机构
[1] Brno Univ Technol, Fac Informat Technol, Speech FIT, Brno, Czech Republic
[2] TNO Human Factors, Soesterberg, Netherlands
[3] Spescom Data Voice, Stellenbosch, South Africa
[4] Univ Stellenbosch, Dept Elect & Elect Engn, ZA-7600 Stellenbosch, South Africa
关键词
speaker recognition; GMM; SVM; eigen-channel; NAP;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes STBU 2006 speaker recognition system, which performed well in the NIST 2006 speaker recognition evaluation. STBU is consortium of 4 partners: Spescom DataVoice (South Africa), TNO (Netherlands), BUT (Czech Republic) and University of Stellenbosch (South Africa). The primary system is a combination of three main kinds of systems: (1) GMM, with short-time MFCC or PLP features, (2) GMM-SVM, using GMM mean supervectors as input and (3) MLLR-SVM, using MLLR speaker adaptation coefficients derived from English LVCSR system. In this paper, we describe these sub-systems and present results for each system alone and in combination on the NIST Speaker Recognition Evaluation (SRE) 2006 development and evaluation data sets.
引用
收藏
页码:221 / +
页数:2
相关论文
共 50 条
  • [41] Speaker recognition - The ATVS-UAM system at NIST SRE 05
    Gonzalez-Rodriguez, Joaquin
    Ramos-Castro, Daniel
    Toledano, Doroteo Torre
    Montero-Asenjo, Alberto
    Gonzalez-Dominguez, Javier
    Lopez-Moreno, Ignacio
    Fierrez-Aguilar, Julian
    Garcia-Romero, Daniel
    Ortega-Garcia, Javier
    [J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2007, 22 (01) : 15 - 21
  • [42] The IIR submission to CSLP 2006 speaker recognition evaluation
    Lee, Kong-Aik
    Sun, Hanwu
    Tong, Rong
    Ma, Bin
    Dong, Minghui
    You, Changhuai
    Zhu, Donglai
    Koh, Chin-Wei Eugene
    Wang, Lei
    Kinnunen, Tomi
    Chng, Eng-Siong
    Li, Haizhou
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 494 - +
  • [43] Comparison of Voice Activity Detectors for Interview Speech in NIST Speaker Recognition Evaluation
    Yu, Hon-Bill
    Mak, Man-Wai
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2364 - +
  • [44] HLT-NUS Submission for 2019 NIST Multimedia Speaker Recognition Evaluation
    Das, Rohan Kumar
    Tao, Ruijie
    Yang, Jichen
    Rao, Wei
    Yu, Cheng
    Li, Haizhou
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 605 - 609
  • [45] The I3A Speaker Recognition System for NIST SRE12: Post-evaluation Analysis
    Villalba, Jesus
    Lleida, Eduardo
    Ortega, Alfonso
    Miguel, Antonio
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3646 - 3650
  • [46] Performance evaluation of speaker recognition system
    Palia, Nivedita
    Kant, Shri
    Dev, Amita
    [J]. JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2019, 22 (02): : 203 - 218
  • [47] Evaluation of EMD-based speaker recognition using ISCSLP2006 Chinese speaker recognition evaluation corpus
    Kuroiwa, Shingo
    Tsuge, Satoru
    Kita, Masahiko
    Ren, Fuji
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 539 - +
  • [48] The IBM system for the NIST-2002 cellular speaker verification evaluation
    Ramaswamy, GN
    Navrátil, J
    Chaudhari, UV
    Zilca, RD
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 61 - 64
  • [49] The 14U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016
    Lee, K. A.
    Hautamaki, V.
    Kinnunen, T.
    Larcher, A.
    Zhang, C.
    Nautsch, A.
    Stafylakis, T.
    Liu, G.
    Rouvier, M.
    Rao, W.
    Alegre, F.
    Ma, J.
    Mak, M. W.
    Sarkar, A. K.
    Delgado, H.
    Saeidi, R.
    Aronowitz, H.
    Sizov, A.
    Sun, H.
    Nguyen, T. H.
    Wang, G.
    Ma, B.
    Vestman, V.
    Sahidullah, M.
    Halonen, M.
    Kanervisto, A.
    Le Lan, G.
    Bahmaninezhad, F.
    Isadskiy, S.
    Rathgeb, C.
    Busch, C.
    Tzimiropoulos, G.
    Qian, Q.
    Wang, Z.
    Zhao, Q.
    Wang, T.
    Li, H.
    Xue, J.
    Zhu, S.
    Jin, R.
    Zhao, T.
    Bousquet, P. -M
    Ajili, M.
    Kheder, W. B.
    Matrouf, D.
    Lim, Z. H.
    Xu, C.
    Xu, H.
    Xiao, X.
    Chng, E. S.
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1328 - 1332
  • [50] NIST 2008 Speaker Recognition Evaluation: Performance Across Telephone and Room Microphone Channels
    Martin, Alvin F.
    Greenberg, Craig S.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2539 - 2542