Discovering speech phones using convolutive non-negative matrix factorisation with a sparseness constraint

被引:44
|
作者
O'Grady, Paul D. [1 ]
Pearlmutter, Barak A. [2 ]
机构
[1] Univ Coll Dublin, Complex & Adapt Syst Lab, Dublin 4, Ireland
[2] Natl Univ Ireland Maynooth, Hamilton Inst, Kildare, Ireland
基金
爱尔兰科学基金会;
关键词
Non-negative matrix factorisation; Sparse representations; Convolutive dictionaries; Speech phone analysis;
D O I
10.1016/j.neucom.2008.01.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering a representation that allows auditory data to be parsimoniously represented is useful for many machine learning and signal processing tasks. Such a representation can be constructed by non-negative matrix factorisation (NMF), a method for finding parts-based representations of non-negative data. Here, we present an extension to convolutive NMF that includes a sparseness constraint, where the resultant algorithm has multiplicative updates and utilises the beta divergence as its reconstruction objective. In combination with a spectral magnitude transform of speech, this method discovers auditory objects that resemble speech phones along with their associated sparse activation patterns. We use these in a supervised separation scheme for monophonic mixtures, finding improved separation performance in comparison to standard convolutive NMF. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:88 / 101
页数:14
相关论文
共 50 条
  • [1] Discovering convolutive speech phones using sparseness and non-negativity
    O'Grady, Paul D.
    Pearlmutter, Barak A.
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2007, 4666 : 520 - +
  • [2] Denoising of Facial Images using Non-negative Matrix Factorization with Sparseness Constraint
    Varghese, Kitty
    Kolhekar, Megha M.
    Hande, Smita
    2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [3] Discovering Latent Blockmodels in Sparse and Noisy Graphs using Non-Negative Matrix Factorisation
    Chan, Jeffrey
    Liu, Wei
    Kan, Andrey
    Leckie, Christopher
    Bailey, James
    Ramamohanarao, Kotagiri
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 811 - 816
  • [4] Automatically Learning the Units of Speech by Non-negative Matrix Factorisation
    Stouten, Veronique
    Demuynck, Kris
    Van Hamme, Hugo
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1733 - 1736
  • [5] Object Recognition Using Non-Negative Matrix Factorization with Sparseness Constraint and Neural Network
    Lei, Songze
    Zhang, Boxing
    Wang, Yanhong
    Dong, Baihua
    Li, Xiaoping
    Xiao, Feng
    INFORMATION, 2019, 10 (02)
  • [6] Convolutive Sparse Non-negative Matrix Factorization for Windy Speech
    Lai Xiaoqiang
    Li Shuangtian
    Yang Jie
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 494 - 497
  • [7] Molecular cancer class discovery using non-negative matrix factorization with sparseness constraint
    Kong, Xiangzhen
    Zheng, Chunhou
    Wu, Yuqiang
    Shang, Li
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2007, 4681 : 792 - +
  • [8] Collaborative filtering using non-negative matrix factorisation
    Aghdam, Mehdi Hosseinzadeh
    Analoui, Morteza
    Kabiri, Peyman
    JOURNAL OF INFORMATION SCIENCE, 2017, 43 (04) : 567 - 579
  • [9] Speech Enhancement Using Sparse Convolutive Non-negative Matrix Factorization with Basis Adaptation
    Carlin, Michael A.
    Malyska, Nicolas
    Quatieri, Thomas F.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 582 - 585
  • [10] CONSTRAINT NON-NEGATIVE MATRIX FACTORIZATION WITH SPARSENESS AND PIECEWISE SMOOTHNESS FOR HYPERSPECTRAL UNMIXING
    Sun, Xu
    Peng, Qian
    Zhang, Bing
    Gao, Lianru
    Yang, Lina
    2018 9TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2018,