SOFT NONNEGATIVE MATRIX CO-FACTORIZATION WITH APPLICATION TO MULTIMODAL SPEAKER DIARIZATION

被引:0
|
作者
Seichepine, N. [1 ]
Essid, S. [1 ]
Fevotte, C. [2 ]
Cappe, O. [3 ]
机构
[1] Telecom ParisTech, Inst Mines Telecom, CNRS LTCI, Paris, France
[2] Univ Nice, CNRS, OCA, Lab Lagrange, Nice, France
[3] Telecom Paris Tech, CNRS LTCI, Paris, France
关键词
Nonnegative matrix factorization; co-factorization; multimodality; speaker diarization;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new method for bimodal nonnegative matrix factorization (NMF). This method is well-suited to situations where two streams of data are concurrently analyzed and are expected to be related by loosely common factors. It allows for a soft co-factorization, which takes into account the relationship that exists between the modalities being processed, but returns different factors for distinct modalities. There is no need that the data related with each modality live in the same feature space; there is also no need that they have the same dimensionality. The co-factorization is obtained via a majorization-minimization (MM) algorithm. The behavior of the method is illustrated on both synthetic and real-world data. In particular, we show that exploiting the correlation between audio and video modalities in edited talk-show videos improve speaker diarization results.
引用
收藏
页码:3537 / 3541
页数:5
相关论文
共 50 条
  • [1] Soft Nonnegative Matrix Co-Factorization
    Seichepine, Nicolas
    Essid, Slim
    Fevotte, Cedric
    Cappe, Olivier
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (22) : 5940 - 5949
  • [2] Multimodal Soft Nonnegative Matrix Co-Factorization for Convolutive Source Separation
    Sedighin, Farnaz
    Babaie-Zadeh, Massoud
    Rivet, Bertrand
    Jutten, Christian
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (12) : 3179 - 3190
  • [3] NONNEGATIVE MATRIX PARTIAL CO-FACTORIZATION FOR DRUM SOURCE SEPARATION
    Yoo, Jiho
    Kim, Minje
    Kang, Kyeongok
    Choi, Seungjin
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1942 - 1945
  • [4] Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation
    Kim, Minje
    Yoo, Jiho
    Kang, Kyeongok
    Choi, Seungjin
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1192 - 1204
  • [5] Separation of Singing Voice Using Nonnegative Matrix Partial Co-Factorization for Singer Identification
    Hu, Ying
    Liu, Guizhong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 643 - 653
  • [6] TEXT-INFORMED AUDIO SOURCE SEPARATION USING NONNEGATIVE MATRIX PARTIAL CO-FACTORIZATION
    Le Magoarou, Luc
    Ozerov, Alexey
    Duong, Ngoc Q. K.
    [J]. 2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [7] Monaural Speech Separation by Means of Convolutive Nonnegative Matrix Partial Co-factorization in Low SNR Condition
    Dong, Xing-Lei
    Hu, Ying
    Huang, Hao
    Wushour, Silamu
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (06): : 1200 - 1209
  • [8] HIERARCHICAL VARIATIONAL BAYESIAN MATRIX CO-FACTORIZATION
    Yoo, Jiho
    Choi, Seungjin
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1901 - 1904
  • [9] Incremental Matrix Co-factorization for Recommender Systems with Implicit Feedback
    Anyosa, Susan C.
    Vinagre, Joao
    Jorge, Alipio M.
    [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1413 - 1418
  • [10] NONNEGATIVE MATRIX FACTORIZATION BASED NOISE ROBUST SPEAKER VERIFICATION
    Liu, S. H.
    Zou, Y. X.
    Ning, H. K.
    [J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 35 - 39