Discovering speech phones using convolutive non-negative matrix factorisation with a sparseness constraint

被引:44
|
作者
O'Grady, Paul D. [1 ]
Pearlmutter, Barak A. [2 ]
机构
[1] Univ Coll Dublin, Complex & Adapt Syst Lab, Dublin 4, Ireland
[2] Natl Univ Ireland Maynooth, Hamilton Inst, Kildare, Ireland
基金
爱尔兰科学基金会;
关键词
Non-negative matrix factorisation; Sparse representations; Convolutive dictionaries; Speech phone analysis;
D O I
10.1016/j.neucom.2008.01.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering a representation that allows auditory data to be parsimoniously represented is useful for many machine learning and signal processing tasks. Such a representation can be constructed by non-negative matrix factorisation (NMF), a method for finding parts-based representations of non-negative data. Here, we present an extension to convolutive NMF that includes a sparseness constraint, where the resultant algorithm has multiplicative updates and utilises the beta divergence as its reconstruction objective. In combination with a spectral magnitude transform of speech, this method discovers auditory objects that resemble speech phones along with their associated sparse activation patterns. We use these in a supervised separation scheme for monophonic mixtures, finding improved separation performance in comparison to standard convolutive NMF. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:88 / 101
页数:14
相关论文
共 50 条
  • [21] Online Convolutive Non-Negative Bases Learning for Speech Enhancement
    Li, Yinan
    Zhang, Xiongwei
    Sun, Meng
    Hu, Yonggang
    Li, Li
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (08) : 1609 - 1613
  • [22] Convex Hull Convolutive Non-negative Matrix Factorization Based Speech Enhancement For Multimedia Communication
    Wang, Dongxia
    Cui, Jie
    Wang, Jinghua
    Tan, Huan
    Xu, Ming
    2022 6TH INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, SECURITY AND PRIVACY, CSP 2022, 2022, : 138 - 142
  • [23] Face recognition using Fisher non-negative matrix factorization with sparseness constraints
    Pu, XR
    Yi, Z
    Zheng, ZM
    Zhou, W
    Ye, M
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 112 - 117
  • [24] Unsupervised learning of auditory filter banks using non-negative matrix factorisation
    Bertrand, Alexander
    Demuynck, Kris
    Stouten, Veronique
    Van Hamme, Hugo
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4713 - 4716
  • [25] Underwater reverberation suppression based on non-negative matrix factorisation
    Jia, Hongjian
    Li, Xiukun
    JOURNAL OF SOUND AND VIBRATION, 2021, 506
  • [26] Shifted non-negative matrix factorisation for sound source separation
    FitzGerald, Derry
    Cranitch, Matt
    Coyle, Eugene
    2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 1061 - 1065
  • [27] LEARNING SPEECH FEATURES IN THE PRESENCE OF NOISE: SPARSE CONVOLUTIVE ROBUST NON-NEGATIVE MATRIX FACTORIZATION
    de Frein, Ruairi
    Rickard, Scott T.
    2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 1248 - 1253
  • [28] Non-negative matrix factorisation of large mass spectrometry datasets
    Trindade, Gustavo F.
    Abel, Marie-Laure
    Watts, John F.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 163 : 76 - 85
  • [29] SIMILARITY INDUCED GROUP SPARSITY FOR NON-NEGATIVE MATRIX FACTORISATION
    Hurmalainen, Antti
    Saeidi, Rahim
    Virtanen, Tuomas
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4425 - 4429
  • [30] Relative Pairwise Relationship Constrained Non-Negative Matrix Factorisation
    Jiang, Shuai
    Li, Kan
    Xu, Richard Yi Da
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (08) : 1595 - 1609