GROUP NONNEGATIVE MATRIX FACTORISATION WITH SPEAKER AND SESSION VARIABILITY COMPENSATION FOR SPEAKER IDENTIFICATION

被引:0
|
作者
Serizel, Romain [1 ]
Essid, Slim [1 ]
Richard, Gael [1 ]
机构
[1] Univ Paris Saclay, CNRS, LTCI, Telecom ParisTech, F-75013 Paris, France
关键词
Nonnegative matrix factorisation; spectrogram factorisation; feature learning; speaker variability; speaker identification;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a feature learning approach for speaker identification that is based on nonnegative matrix factorisation. Recent studies have shown that with such models, the dictionary atoms can represent well the speaker identity. The approaches proposed so far focused only on speaker variability and not on session variability. However, this later point is a crucial aspect in the success of the I-vector approach that is now the state-of-the-art in speaker identification. This paper proposes a method that relies on group nonnegative matrix factorisation and that is inspired by the I-vector training procedure. By doing so the proposed approach intends to capture both the speaker variability and the session variability. Results on a small corpus prove that the proposed approach can be competitive with I-vectors.
引用
收藏
页码:5470 / 5474
页数:5
相关论文
共 50 条
  • [1] SUPERVISED GROUP NONNEGATIVE MATRIX FACTORISATION WITH SIMILARITY CONSTRAINTS AND APPLICATIONS TO SPEAKER IDENTIFICATION
    Serizel, Romain
    Bisot, Victor
    Essid, Slim
    Richard, Gael
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 36 - 40
  • [2] A Comparison of Session Variability Compensation Approaches for Speaker Verification
    McLaren, Mitchell
    Vogt, Robert
    Baker, Brendan
    Sridharan, Sridha
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2010, 5 (04) : 802 - 809
  • [3] Robust Session Variability Compensation for SVM Speaker Verification
    Seo, Hyunson
    Jung, Chi-Sang
    Kang, Hong-Goo
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
  • [4] Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization
    Aronowitz, Hagai
    [J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 138 - 145
  • [5] Session variability subspace projection based model compensation for speaker verification
    Deng, Jing
    Zheng, Thomas Fang
    Wu, Wenhu
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 57 - +
  • [6] Speaker and session variability in GMM-based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
  • [7] A bilevel framework for joint optimization of session compensation and classification for speaker identification
    Chen, Chen
    Wang, Wei
    He, Yongjun
    Han, Jiqing
    [J]. DIGITAL SIGNAL PROCESSING, 2019, 89 : 104 - 115
  • [8] Session Variability in Automatic Speaker Verification
    Hayet, Djellali
    Radia, Amirouche
    Akila, Djebbar
    Tayeb, Laskri Mohamed
    [J]. 2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 185 - 190
  • [9] Speaker recognition using temporal information and session variability compensation in a binary framework
    Hernandez-Sierra, Gabriel
    Calvo, Jose R.
    Bonastre, Jean-Francois
    [J]. INTELLIGENT DATA ANALYSIS, 2016, 20 : S83 - S94
  • [10] A Comparison of Session Variability Compensation Techniques for SVM-Based Speaker Recognition
    McLaren, Mitchell
    Vogt, Robbie
    Baker, Brendan
    Sridharan, Sridha
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2500 - 2503