A PARTIAL LEAST SQUARES FRAMEWORK FOR SPEAKER RECOGNITION

被引:0
|
作者
Srinivasan, Balaji Vasan [1 ]
Zotkin, Dmitry N.
Duraiswami, Ramani
机构
[1] Univ Maryland, Dept Comp Sci, Perceptual Interfaces & Real Lab, College Pk, MD 20742 USA
关键词
Partial least squares; speaker recognition; latent vector; GMM supervectors;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Modern approaches to speaker recognition (verification) operate in a space of "supervectors" created via concatenation of the mean vectors of a Gaussian mixture model (GMM) adapted from a universal background model (UBM). In this space, a number of approaches to model inter-class separability and nuisance attribute variability have been proposed. We develop a method for modeling the variability associated with each class (speaker) by using partial-least-squares - a latent variable modeling technique, which isolates the most informative subspace for each speaker. The method is tested on NIST SRE 2008 data and provides promising results. The method is shown to be noise-robust and to be able to efficiently learn the subspace corresponding to a speaker on training data consisting of multiple utterances.
引用
收藏
页码:5276 / 5279
页数:4
相关论文
共 50 条
  • [1] A Symmetric Kernel Partial Least Squares Framework for Speaker Recognition
    Srinivasan, Balaji Vasan
    Luo, Yuancheng
    Garcia-Romero, Daniel
    Zotkin, Dmitry N.
    Duraiswami, Ramani
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1415 - 1423
  • [2] Kernel partial least squares for speaker recognition
    Srinivasan, Balaji Vasan
    Garcia-Romero, Daniel
    Zotkin, Dmitry N.
    Duraiswami, Ramani
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 500 - +
  • [3] Speaker recognition using least squares IOHMMs
    Mukherjee, N
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 276 - 279
  • [4] Mitigating Effects of Recording Condition Mismatch in Speaker Recognition Using Partial Least Squares
    Remus, Jeremiah J.
    Estrada, Jenniffer M.
    Schuckers, Stephanie A. C.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2673 - 2676
  • [5] MKPLS: Manifold Kernel Partial Least Squares for Lipreading and Speaker Identification
    Bakry, Amr
    Elgammal, Ahmed
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 684 - 691
  • [6] Face recognition using partial least squares components
    Baek, J
    Kim, M
    PATTERN RECOGNITION, 2004, 37 (06) : 1303 - 1306
  • [7] A Joint Bayesian Framework Based on Partial Least Squares Discriminant Analysis for Finger Vein Recognition
    Zhang, Liping
    Sun, Linjun
    Li, Weijun
    Zhang, Jianming
    Cai, Weiwei
    Cheng, Chuantong
    Ning, Xin
    IEEE SENSORS JOURNAL, 2022, 22 (01) : 785 - 794
  • [8] Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition
    Liu, Mengyi
    Wang, Ruiping
    Huang, Zhiwu
    Shan, Shiguang
    Chen, Xilin
    ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 525 - 530
  • [9] Infrared face recognition based on DCT and partial least squares
    Xie, Zhihua
    Liu, Guodong
    Communications in Computer and Information Science, 2014, 437 : 67 - 73
  • [10] Two Dimension Nonnegative Partial Least Squares for Face Recognition
    Ge, Yongxin
    Bu, Wenbin
    Yang, Dan
    Feng, Xin
    Zhang, Xiaohong
    2013 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2013,