SEMI-SUPERVISED MULTICHANNEL SPEECH ENHANCEMENT WITH VARIATIONAL AUTOENCODERS AND NON-NEGATIVE MATRIX FACTORIZATION

被引:0
|
作者
Leglaive, Simon [1 ]
Girin, Laurent [1 ,2 ]
Horaud, Radu [1 ]
机构
[1] Inria Grenoble Rhone Alpes, Montbonnot St Martin, France
[2] Univ Grenoble Alpes, Grenoble INP, GIPSA Lab, Grenoble, France
关键词
Multichannel speech enhancement; local Gaussian modeling; variational autoencoders; non-negative matrix factorization; Monte Carlo expectation-maximization; AUDIO SOURCE SEPARATION; INFORMATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we address speaker-independent multichannel speech enhancement in unknown noisy environments. Our work is based on a well-established multichannel local Gaussian modeling framework. We propose to use a neural network for modeling the speech spectro-temporal content. The parameters of this supervised model are learned using the framework of variational autoencoders. The noisy recording environment is supposed to be unknown, so the noise spectro-temporal modeling remains unsupervised and is based on non-negative matrix factorization (NMF). We develop a Monte Carlo expectation-maximization algorithm and we experimentally show that the proposed approach outperforms its NMF-based counterpart, where speech is modeled using supervised NMF.
引用
收藏
页码:101 / 105
页数:5
相关论文
共 50 条
  • [1] Guided Semi-Supervised Non-Negative Matrix Factorization
    Li, Pengyu
    Tseng, Christine
    Zheng, Yaxuan
    Chew, Joyce A.
    Huang, Longxiu
    Jarman, Benjamin
    Needell, Deanna
    ALGORITHMS, 2022, 15 (05)
  • [2] Transfer Semi-Supervised Non-Negative Matrix Factorization for Speech Emotion Recognition
    Song, Peng
    Ou, Shifeng
    Zhang, Xinran
    Jin, Yun
    Zheng, Wenming
    Liu, Jinglei
    Yu, Yanwei
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10) : 2647 - 2650
  • [3] A STATISTICAL APPROACH TO SEMI-SUPERVISED SPEECH ENHANCEMENT WITH LOW-ORDER NON-NEGATIVE MATRIX FACTORIZATION
    Mohammed, Shoaib
    Tashev, Ivan
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 546 - 550
  • [4] Non-negative matrix factorization for semi-supervised data clustering
    Chen, Yanhua
    Rege, Manjeet
    Dong, Ming
    Hua, Jing
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (03) : 355 - 379
  • [5] Non-negative matrix factorization for semi-supervised data clustering
    Yanhua Chen
    Manjeet Rege
    Ming Dong
    Jing Hua
    Knowledge and Information Systems, 2008, 17 : 355 - 379
  • [6] Discriminative semi-supervised non-negative matrix factorization for data clustering
    Xing, Zhiwei
    Wen, Meng
    Peng, Jigen
    Feng, Jinqian
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 103
  • [7] Semi-Supervised Projective Non-Negative Matrix Factorization for Cancer Classification
    Zhang, Xiang
    Guan, Naiyang
    Jia, Zhilong
    Qiu, Xiaogang
    Luo, Zhigang
    PLOS ONE, 2015, 10 (09):
  • [8] Semi-Supervised Non-Negative Matrix Factorization With Dissimilarity and Similarity Regularization
    Jia, Yuheng
    Kwong, Sam
    Hou, Junhui
    Wu, Wenhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2510 - 2521
  • [9] Robust Semi-Supervised Non-Negative Matrix Factorization With Structured Normalization
    Wang, Liujing
    Guan, Naiyang
    Shi, Dianxi
    Fan, Zunlin
    Su, Longfei
    IEEE ACCESS, 2019, 7 : 133996 - 134013
  • [10] A Fast Optimized Semi-Supervised Non-Negative Matrix Factorization Algorithm
    Lopes, Noel
    Ribeiro, Bernardete
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2495 - 2500