SPEAKER RECOGNITION USING SYLLABLE-BASED CONSTRAINTS FOR CEPSTRAL FRAME SELECTION

被引:14
|
作者
Bocklet, Tobias [1 ]
Shriberg, Elizabeth [2 ]
机构
[1] Univ Erlangen Nurnberg, D-8520 Erlangen, Germany
[2] SRI Int, Menlo Pk, CA 94025 USA
关键词
Speaker recognition; higher-level features; GMMs; cepstral features; MFCCs; syllables;
D O I
10.1109/ICASSP.2009.4960636
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We, describe a new GMM-UBM speaker recognition system that uses standard cepstral features, but selects different frames of speech for different subsystems. Subsystems, or "constraints", are based on syllable-level information and combined at the score level. Results on both the NIST 2006 and 2008 test data sets for the English telephone train and test condition reveal that a set of eight constraints performs extremely well, resulting in better performance than other commonly-used cepstral models. Given the still largely-unexplored world of possible constraints and combinations, it is likely that the approach can be even further improved.
引用
收藏
页码:4525 / +
页数:2
相关论文
共 50 条
  • [1] Syllable-Based Speech Recognition Using EMG
    Lopez-Larraz, Eduardo
    Mozos, Oscar M.
    Antelis, Javier M.
    Minguez, Javier
    [J]. 2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 4699 - 4702
  • [2] Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM
    Nakagawa, S
    Zhang, W
    Takahashi, M
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 81 - 84
  • [3] Automatic syllable-based phoneme recognition using ESTER corpus
    Le Blouch, Olivier
    Collen, Patrice
    [J]. PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION (ISCGAV'-07), 2007, : 77 - +
  • [4] SYLLABLE-BASED SPEECH RECOGNITION USING ELECTROMYOGRAPHY AND DECISION SET CLASSIFIER
    Topalovic, Marko
    Damnjanovic, Dorde
    Peulic, Aleksandar
    Blagojevic, Milan
    Filipovic, Nenad
    [J]. BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2015, 27 (02):
  • [5] Syllable-based automatic Arabic speech recognition
    Azmi, Mohamed Mostafa
    Tolba, Hesham
    Mahdy, Sherif
    Fashal, Mervat
    [J]. PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION: ADVANCED TOPICS ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION, 2008, : 246 - +
  • [6] Syllable-based Myanmar Language Model for Speech Recognition
    Soe, Wunna
    Thein, Yadana
    [J]. 2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 291 - 296
  • [7] Text-independent/text-prompted speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM
    Nakagawa, S
    Zhang, W
    Takahashi, M
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 1058 - 1065
  • [8] A CEPSTRAL BASED SPEAKER RECOGNITION SYSTEM
    SETHURAMAN, R
    GOWDY, JN
    [J]. PROCEEDINGS : THE TWENTY-FIRST SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 1989, : 503 - 507
  • [9] Syllable-based large vocabulary continuous speech recognition
    Ganapathiraju, A
    Hamaker, J
    Picone, J
    Ordowski, M
    Doddington, GR
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (04): : 358 - 366
  • [10] Syllable-Based Recognition of Arabic & English Digits in Noisy Environment
    Azmi, Mohamed M.
    Tolba, Hesham
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 583 - +