A Comparison of Session Variability Compensation Approaches for Speaker Verification

被引:8
|
作者
McLaren, Mitchell [1 ]
Vogt, Robert [1 ]
Baker, Brendan [1 ]
Sridharan, Sridha [1 ]
机构
[1] Queensland Univ Technol, SAIVT Grp, Speech & Audio Res Lab, Brisbane, Qld 4001, Australia
基金
澳大利亚研究理事会;
关键词
Factor analysis; nuisance attribute projection (NAP); session variation; speaker verification; support vector machine (SVM); SUPPORT VECTOR MACHINES;
D O I
10.1109/TIFS.2010.2068290
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper compares two of the leading techniques for session variability compensation in the context of support vector machine (SVM) speaker verification using Gaussian mixture model (GMM) mean supervectors: joint factor analysis (JFA) modeling and nuisance attribute projection (NAP). Motivation for this comparison comes from the distinctly different domains in which these techniques are employed-the probabilistic GMM domain versus the discriminative SVM kernel. A theoretical analysis is given comparing the JFA and NAP approaches to variability compensation. The role of speaker factors in the factor analysis model is also contrasted against the scatter difference NAP objective of retaining speaker information in the SVM kernel space. These methods for retaining speaker variation are found to provide improved verification performance over the removal of channel effects alone. Overall, experimental results on the NIST 2006 and 2008 SRE corpora demonstrate the effectiveness of both JFA and NAP techniques for reducing the effects of variability. However, the overheads associated with the implementation of JFA may make NAP a more attractive technique due to its simple yet effective approach to variability compensation.
引用
收藏
页码:802 / 809
页数:8
相关论文
共 50 条
  • [1] Robust Session Variability Compensation for SVM Speaker Verification
    Seo, Hyunson
    Jung, Chi-Sang
    Kang, Hong-Goo
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
  • [2] Session variability subspace projection based model compensation for speaker verification
    Deng, Jing
    Zheng, Thomas Fang
    Wu, Wenhu
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 57 - +
  • [3] Session Variability in Automatic Speaker Verification
    Hayet, Djellali
    Radia, Amirouche
    Akila, Djebbar
    Tayeb, Laskri Mohamed
    [J]. 2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 185 - 190
  • [4] Experiments in session variability modelling for speaker verification
    Vogt, Robbie
    Sridharan, Sridha
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 897 - 900
  • [5] Explicit modelling of session variability for speaker verification
    Vogt, Robbie
    Sridharan, Sridha
    [J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (01): : 17 - 38
  • [6] Speaker and session variability in GMM-based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
  • [7] A Comparison of Session Variability Compensation Techniques for SVM-Based Speaker Recognition
    McLaren, Mitchell
    Vogt, Robbie
    Baker, Brendan
    Sridharan, Sridha
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2500 - 2503
  • [8] GROUP NONNEGATIVE MATRIX FACTORISATION WITH SPEAKER AND SESSION VARIABILITY COMPENSATION FOR SPEAKER IDENTIFICATION
    Serizel, Romain
    Essid, Slim
    Richard, Gael
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5470 - 5474
  • [9] Factor Analysis Multi-Session Training Constraint in Session Compensation for Speaker Verification
    Matrouf, Driss
    Bonastre, Jean-Francois
    Mezaache, Salah Eddine
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1421 - 1424
  • [10] Speaker Verification for Variable Duration Segments and the Effect of Session Variability
    Das, Rohan Kumar
    Prasanna, S. R. M.
    [J]. ADVANCES IN COMMUNICATION AND COMPUTING, 2015, 347 : 193 - 200