Adaptive unscented particle filter algorithm based on multi-feature for speaker tracking in noisy and reverberant environments

被引:0
|
作者
Liu W. [1 ]
Pan H. [1 ]
Wang M. [2 ]
机构
[1] School of Mechanical Engineering and Automation, Zhejiang Sci-Tech University, Hangzhou
[2] Key Laboratory of Special Purpose Equipment and Advanced Processing Technology, Zhejiang University of Technology, Ministry of Education, Hangzhou
关键词
AUPF algorithm; Microphone array; Multi-feature; Room reverberation; Speaker tracking;
D O I
10.19650/j.cnki.cjsi.J2108759
中图分类号
学科分类号
摘要
To improve the accuracy and robustness of the speaker tracking system in noisy and reverberant environments, an adaptive unscented particle filter (AUPF) algorithm based on multi-feature is proposed. The multi-feature of the speech signal is regarded as the observation information in this algorithm, where the multi-hypothesis and frequency selection function is applied to the mechanisms of time delay selection and beam output energy optimization. Subsequently, the likelihood function is constructed by combining these two mechanisms, which makes up for the deficiency that noise and reverberation cannot be restrained simultaneously by a single feature. Considering the randomness of speaker motion, a new proposal distribution is utilized in the particle filter algorithm, which combines the unscented Kalman filter (UKF) and the robust estimation theory based on the adaptive constant speed model to improve the adaptability of the model. The simulation and experimental results show that based on AUPF, the position average RMSE of multi feature algorithm is reduced by more than 18% compared with that of SBFSRP, and under multi-feature observation, the position average RMSE of AUPF algorithm is reduced by more than 14% compared with that of CV algorithm. It has the characteristics of high tracking accuracy and strong numerical stability. © 2022, Science Press. All right reserved.
引用
收藏
页码:224 / 233
页数:9
相关论文
共 22 条
  • [1] EVERS C, LOELLMANN H, MELLMANN H, Et al., The LOCATA challenge: Acoustic source localization and tracking, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, pp. 1620-1643, (2020)
  • [2] CHAKRABARTY S, HABETS E A P., Multi-speaker DOA estimation using deep convolutional networks trained with noise signals, IEEE Journal of Selected Topics in Signal Processing, 13, 1, pp. 8-21, (2019)
  • [3] VERA-DIAZ J M, PIZARRO D, MACIAS-GUARASA J., Acoustic source localization with deep generalized cross correlations, Signal Processing, 187, 2, pp. 1-22, (2021)
  • [4] LI B W, ZHANG X G., Improved microphone array sound source localization method based on generalized cross correlation, Journal of Nanjing University (Natural Sciences), 56, 6, pp. 917-922, (2020)
  • [5] CHENG F X, LIU L, YAO Q H, Et al., Acoustic source localization algorithm based on improved time delay estimation, Journal of Jilin University(Science Edition), 56, 3, pp. 681-687, (2018)
  • [6] MARXIM R B B, MOHANTY A R., Time delay estimation in reverberant and low SNR environment by EMD based maximum likelihood method, Measurement, 137, pp. 655-663, (2019)
  • [7] PERTILA P, PARVIAINEN M., Time difference of arrival estimation of speech signals using deep neural networks with integrated time-frequency masking, IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), pp. 436-440, (2019)
  • [8] ZHAO X Y, CHEN SH W, ZHOU L., Sound source localization using SNR-based frequency weighting with microphone array, Journal of Signal Processing, 36, 3, pp. 449-456, (2020)
  • [9] BRENDEL A, ALTMANN I, KELLERMANN W., Acoustic source position estimation based on multi-feature Gaussian processes, European Signal Processing Conference, pp. 1-5, (2019)
  • [10] KWAK Y, KIM D, HAM H, Et al., Convolutional neural network trained with synthetic pseudo-images for detecting an acoustic source, Applied Acoustics, 179, 6, pp. 1-7, (2021)