GROUP NONNEGATIVE MATRIX FACTORISATION WITH SPEAKER AND SESSION VARIABILITY COMPENSATION FOR SPEAKER IDENTIFICATION

被引：0

作者：

Serizel, Romain ^{[1
]}

Essid, Slim ^{[1
]}

Richard, Gael ^{[1
]}

机构：

[1] Univ Paris Saclay, CNRS, LTCI, Telecom ParisTech, F-75013 Paris, France

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年

关键词：

Nonnegative matrix factorisation; spectrogram factorisation; feature learning; speaker variability; speaker identification;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a feature learning approach for speaker identification that is based on nonnegative matrix factorisation. Recent studies have shown that with such models, the dictionary atoms can represent well the speaker identity. The approaches proposed so far focused only on speaker variability and not on session variability. However, this later point is a crucial aspect in the success of the I-vector approach that is now the state-of-the-art in speaker identification. This paper proposes a method that relies on group nonnegative matrix factorisation and that is inspired by the I-vector training procedure. By doing so the proposed approach intends to capture both the speaker variability and the session variability. Results on a small corpus prove that the proposed approach can be competitive with I-vectors.

引用

页码：5470 / 5474

页数：5

共 50 条

[1] SUPERVISED GROUP NONNEGATIVE MATRIX FACTORISATION WITH SIMILARITY CONSTRAINTS AND APPLICATIONS TO SPEAKER IDENTIFICATION
Serizel, Romain
Bisot, Victor
Essid, Slim
Richard, Gael
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 36 - 40
[2] A Comparison of Session Variability Compensation Approaches for Speaker Verification
McLaren, Mitchell
Vogt, Robert
Baker, Brendan
Sridharan, Sridha
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2010, 5 (04) : 802 - 809
[3] Robust Session Variability Compensation for SVM Speaker Verification
Seo, Hyunson
Jung, Chi-Sang
Kang, Hong-Goo
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
[4] Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization
Aronowitz, Hagai
[J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 138 - 145
[5] Session variability subspace projection based model compensation for speaker verification
Deng, Jing
Zheng, Thomas Fang
Wu, Wenhu
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 57 - +
[6] Speaker and session variability in GMM-based speaker verification
Kenny, Patrick
Boulianne, Gilles
Ouellet, Pierre
Dumouchel, Pierre
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
[7] A bilevel framework for joint optimization of session compensation and classification for speaker identification
Chen, Chen
Wang, Wei
He, Yongjun
Han, Jiqing
[J]. DIGITAL SIGNAL PROCESSING, 2019, 89 : 104 - 115
[8] Session Variability in Automatic Speaker Verification
Hayet, Djellali
Radia, Amirouche
Akila, Djebbar
Tayeb, Laskri Mohamed
[J]. 2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 185 - 190
[9] Speaker recognition using temporal information and session variability compensation in a binary framework
Hernandez-Sierra, Gabriel
Calvo, Jose R.
Bonastre, Jean-Francois
[J]. INTELLIGENT DATA ANALYSIS, 2016, 20 : S83 - S94
[10] A Comparison of Session Variability Compensation Techniques for SVM-Based Speaker Recognition
McLaren, Mitchell
Vogt, Robbie
Baker, Brendan
Sridharan, Sridha
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2500 - 2503

← 1 2 3 4 5 →