共 29 条
- [1] WANG Deliang, BROWN G J., Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, pp. 1-14, (2006)
- [2] SCHMIDT M N, OLSSON R K., Single-channel speech separation using sparse non-negative matrix factorization, The INTERSPEECH 2006, (2006)
- [3] ZHOU Weili, Zhen ZHU, LIANG Peiying, Speech denoising using Bayesian NMF with online base update[J], Multimedia Tools and Applications, 78, 11, pp. 15647-15664, (2019)
- [4] SUN Lei, DU Jun, DAI Lirong, Et al., Multiple-target deep learning for LSTM-RNN based speech enhancement[C], 2017 Hands-free Speech Communications and Microphone Arrays, pp. 136-140, (2017)
- [5] HERSHEY J R, CHEN Zhuo, ROUX J L, Et al., Deep clustering: Discriminative embeddings for segmentation and separation[C], 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 31-35, (2016)
- [6] YU Dong, KOLBAEK M, TAN Zhenghua, Et al., Permutation invariant training of deep models for speaker-independent multi-talker speech separation[C], 2017 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 241-245, (2017)
- [7] GOLUMBIC E Z, COGAN G B, SCHROEDER C E, Et al., Visual input enhances selective speech envelope tracking in auditory cortex at a “cocktail party”[J], The Journal of Neuroscience, 33, 4, pp. 1417-1426, (2013)
- [8] SUSSMAN E S., Integration and segregation in auditory scene analysis[J], The Journal of the Acoustical Society of America, 117, 3, pp. 1285-1298, (2005)
- [9] TAO Ruijie, PAN Zexu, DAS R K, Et al., Is someone speaking?: Exploring long-term temporal features for audiovisual active speaker detection[C], The ACM Multimedia Conference, pp. 3927-3935, (2021)
- [10] LAKHAN A, MOHAMMED M A, KADRY S, Et al., Federated Learning-Aware Multi-Objective Modeling and blockchain-enable system for IIoT applications, Computers and Electrical Engineering, 100, (2022)