Discovering speech phones using convolutive non-negative matrix factorisation with a sparseness constraint

被引:44
|
作者
O'Grady, Paul D. [1 ]
Pearlmutter, Barak A. [2 ]
机构
[1] Univ Coll Dublin, Complex & Adapt Syst Lab, Dublin 4, Ireland
[2] Natl Univ Ireland Maynooth, Hamilton Inst, Kildare, Ireland
基金
爱尔兰科学基金会;
关键词
Non-negative matrix factorisation; Sparse representations; Convolutive dictionaries; Speech phone analysis;
D O I
10.1016/j.neucom.2008.01.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering a representation that allows auditory data to be parsimoniously represented is useful for many machine learning and signal processing tasks. Such a representation can be constructed by non-negative matrix factorisation (NMF), a method for finding parts-based representations of non-negative data. Here, we present an extension to convolutive NMF that includes a sparseness constraint, where the resultant algorithm has multiplicative updates and utilises the beta divergence as its reconstruction objective. In combination with a spectral magnitude transform of speech, this method discovers auditory objects that resemble speech phones along with their associated sparse activation patterns. We use these in a supervised separation scheme for monophonic mixtures, finding improved separation performance in comparison to standard convolutive NMF. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:88 / 101
页数:14
相关论文
共 50 条
  • [31] A Unifying Framework and Comparison of Algorithms for Non-negative Matrix Factorisation
    Hobolth, Asger
    Guo, Qianyun
    Kousholt, Astrid
    Jensen, Jens Ledet
    INTERNATIONAL STATISTICAL REVIEW, 2020, 88 (01) : 29 - 53
  • [32] Overlapping Community Detection Using Non-Negative Matrix Factorization With Orthogonal and Sparseness Constraints
    Chen, Naiyue
    Liu, Yun
    Chao, Han-Chieh
    IEEE ACCESS, 2018, 6 : 21266 - 21274
  • [33] Using non-negative matrix factorisation to facilitate efficient bird species richness surveys
    Zhang, Liang
    Towsey, Michael
    Zhang, Jinglan
    Roe, Paul
    ECOLOGICAL INDICATORS, 2017, 80 : 297 - 302
  • [34] Research and Application of Non-negative Matrix Factorization with Sparseness Constraint in Recognition of Traditional Chinese Medicine Pulse Condition
    Guo Rui
    Wang Yiqin
    Yan Haixia
    Li Fufeng
    Xu Zhaoxia
    Yan Jianjun
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2010, : 682 - 685
  • [35] Automatic Non-negative Matrix Factorization Clustering with Competitive Sparseness Constraints
    Liu, Chenglin
    Ma, Jinwen
    INTELLIGENT COMPUTING METHODOLOGIES, 2014, 8589 : 118 - 125
  • [36] Unsupervised learning of phonemes of whispered speech in a noisy environment based on convolutive non-negative matrix factorization
    Zhou, Jian
    Liang, Ruiyu
    Zhao, Li
    Tao, Liang
    Zou, Cairong
    INFORMATION SCIENCES, 2014, 257 : 115 - 126
  • [37] Speech Enhancement Using Convolutive Non negative Matrix Factorization with Multiple Dictionaries
    He, Shan
    Wu, Jiawen
    Li, Lin
    2015 IEEE 9TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2015, : 105 - 108
  • [38] Mutually exclusive grounding for weakly supervised Non-negative Matrix Factorisation
    Renkens, Vincent
    Van Hamme, Hugo
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1388 - 1392
  • [39] Transcribing Bach chorales: Limitations and potentials of non-negative matrix factorisation
    Somnuk Phon-Amnuaisuk
    EURASIP Journal on Audio, Speech, and Music Processing, 2012
  • [40] Relevance feedback based on non-negative matrix factorisation for image retrieval
    Liang, D.
    Yang, J.
    Chang, Y.
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2006, 153 (04): : 436 - 444