Speaker Clustering Based on Non-Negative Matrix Factorization Using Gaussian Mixture Model in Complementary Subspace

被引:0
|
作者
Nishida, Masafumi [1 ]
Yamamoto, Seiichi [2 ]
机构
[1] Shizuoka Univ, Dept Informat, Shizuoka, Japan
[2] Doshisha Univ, Dept Informat & Comp Sci, Kyoto, Japan
关键词
ACM proceedings; text tagging; DIARIZATION;
D O I
10.1145/3095713.3095721
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech feature variations are mainly attributed to variations in phonetic and speaker information included in speech data. If these two types of information are separated from each other, more robust speaker clustering can be achieved. Principal component analysis transformation can separate speaker information from phonetic information, under the assumption that a space with large within-speaker variance is a "phonetic subspace" and a space within-speaker variance is a "phonetic sub-space". We propose a speaker clustering method based on non-negative matrix factorization using a Gaussian mixture model trained in the speaker subspace. We carried out comparative experiments of the proposed method with conventional methods based on Bayesian information criterion and Gaussian mixture model in an observation space. The experimental results showed that the proposed method can achieve higher clustering accuracy than conventional methods.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [21] Rapid speaker adaptation in latent speaker space with non-negative matrix factorization
    Zhang, Xueru
    Demuynck, Kris
    Van Hamme, Hugo
    SPEECH COMMUNICATION, 2013, 55 (09) : 893 - 908
  • [22] DCCNMF: Deep Complementary and Consensus Non-negative Matrix Factorization for multi-view clustering
    Gunawardena, Sohan
    Luong, Khanh
    Balasubramaniam, Thirunavukarasu
    Nayak, Richi
    KNOWLEDGE-BASED SYSTEMS, 2024, 285
  • [23] Consensus and complementary regularized non-negative matrix factorization for multi-view image clustering
    Li, Guopeng
    Song, Dan
    Bai, Wei
    Han, Kun
    Tharmarasa, Ratnasingham
    INFORMATION SCIENCES, 2023, 623 : 524 - 538
  • [24] A modular non-negative matrix factorization for parts-based object recognition using subspace representation
    Bajla, Ivan
    Soukup, Daniel
    IMAGE PROCESSING: MACHINE VISION APPLICATIONS, 2008, 6813
  • [25] Enhanced clustering of biomedical documents using ensemble non-negative matrix factorization
    Huang, Xiaodi
    Zheng, Xiaodong
    Yuan, Wei
    Wang, Fei
    Zhu, Shanfeng
    INFORMATION SCIENCES, 2011, 181 (11) : 2293 - 2302
  • [26] Document Clustering Based on Non-Negative Matrix Factorization and Affinity Propagation Using Preference Estimation
    Chen, Jiawei
    Li, Fei
    Wu, Xiaofan
    Zhang, Qinqin
    INDUSTRIAL ENGINEERING, MACHINE DESIGN AND AUTOMATION (IEMDA 2014) & COMPUTER SCIENCE AND APPLICATION (CCSA 2014), 2015, : 380 - 385
  • [27] Query based summarization using non-negative matrix factorization
    Park, Sun
    Lee, Ju-Hong
    Ahn, Chan-Min
    Hong, Jun Sik
    Chun, Seok-Ju
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2006, 4253 : 84 - 89
  • [28] General subspace constrained non-negative matrix factorization for data representation
    Liu, Yong
    Liao, Yiyi
    Tang, Liang
    Tang, Feng
    Liu, Weicong
    NEUROCOMPUTING, 2016, 173 : 224 - 232
  • [29] FACIAL EXPRESSION RECOGNITION USING CLUSTERING DISCRIMINANT NON-NEGATIVE MATRIX FACTORIZATION
    Nikitidis, Symeon
    Tefas, Anastasios
    Nikolaidis, Nikos
    Pitas, Ioannis
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [30] Transferred Subspace Learning Based on Non-negative Matrix Factorization for EEG Signal Classification
    Dong, Aimei
    Li, Zhigang
    Zheng, Qiuyu
    FRONTIERS IN NEUROSCIENCE, 2021, 15