A speaker recognition method based on GMM using non -negative matrix factorization

被引:0
|
作者
Huang, Liming [1 ]
Liu, Dongbo [1 ]
Fang, Yu [1 ]
Wang, Weibo [1 ]
机构
[1] Xi Hua Univ, Sch Elect & Elect Informat, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
NMF; speaker recognition; GYM; feature fusion; ALGORITHMS;
D O I
10.1109/ICIEA54703.2022.10006158
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Speaker recognition enhances communication efficiency based on dialogue. In this paper, we propose a speaker recognition method based on feature fusion with non negative matrix decomposition (NMF). It uses NMF to decompose the speaker's speech spectrogram, extract features from the decomposed basis matrix, and perform feature fusion with Mel-frequency cepstral coefficients (MFCC). Experimental results show that the fused features improve speaker recognition accuracy in low signal-to-noise (SNR) environment. Comparison with other speaker recognition methods using MFCC features.
引用
收藏
页码:870 / 875
页数:6
相关论文
共 50 条
  • [1] Speaker Clustering Based on Non-negative Matrix Factorization
    Nishida, Masafumi
    Yamamoto, Seiichi
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 956 - 959
  • [2] Speaker conversion using kernel non-negative matrix factorization
    Xu Qinyu
    Lu Guanming
    Yan Jingjie
    Li Haibo
    Cheng Xiao
    [J]. The Journal of China Universities of Posts and Telecommunications, 2017, (05) : 60 - 67
  • [3] Speaker conversion using kernel non-negative matrix factorization
    Xu Qinyu
    Lu Guanming
    Yan Jingjie
    Li Haibo
    Cheng Xiao
    [J]. TheJournalofChinaUniversitiesofPostsandTelecommunications., 2017, 24 (05) - 67
  • [4] Speaker conversion using kernel non-negative matrix factorization
    [J]. Guanming, Lu (lugm@njupt.edu.cn), 2017, Beijing University of Posts and Telecommunications (24):
  • [5] Fast speaker adaptation using non-negative matrix factorization
    Duchateau, Jacques
    Leroy, Tobias
    Demuynck, Kris
    Van hamme, Hugo
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4269 - 4272
  • [6] Ear recognition using improved Non-Negative Matrix Factorization
    Yuan, Li
    Mu, Zhi-Chun
    Zhang, Yu
    Liu, Ke
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 501 - +
  • [7] Chinese Character Recognition Using Non-negative Matrix Factorization
    Voon, Chen Huey
    Shin, Ker
    Shean, Ng Wei
    [J]. JURNAL KEJURUTERAAN, 2024, 36 (02): : 653 - 660
  • [8] Human Action Recognition Based on Non-negative Matrix Factorization
    Lin, Chih-Yang
    Chen, Bo-You
    Wu, Wen-Chuan
    Lin, Wei-Yang
    Tsai, Chia-Ling
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1091 - 1093
  • [9] Non-negative matrix factorization based methods for object recognition
    Liu, WX
    Zheng, NN
    [J]. PATTERN RECOGNITION LETTERS, 2004, 25 (08) : 893 - 897
  • [10] RAPID SPEAKER ADAPTATION WITH SPEAKER ADAPTIVE TRAINING AND NON-NEGATIVE MATRIX FACTORIZATION
    Zhang, Xueru
    Demuynck, Kris
    Van Hamme, Hugo
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4456 - 4459