Joint Speech Enhancement and Speaker Identification Using Monte Carlo Methods

被引:0
|
作者
Maina, Ciira Wa [1 ]
Walsh, John MacLaren [1 ]
机构
[1] Drexel Univ, Dept Elect & Comp Engn, Philadelphia, PA 19104 USA
关键词
Speaker identification; Markov chain Monte Carlo methods; speech enhancement; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach to speaker identification using noisy speech observations where the speech enhancement and speaker identification tasks are performed jointly. This is motivated by the belief that human beings perform these tasks jointly and that optimality may be sacrificed if sequential processing is used. We employ a Bayesian approach where the speech features arc modeled using a mixture of Gaussians prior. A Gibbs sampler is used to estimate the speech source and the identity of the speaker. Preliminary experimental results are presented comparing our approach to a maximum likelihood approach and demonstrating the ability of our method to both enhance speech and identify speakers.
引用
收藏
页码:1359 / 1362
页数:4
相关论文
共 50 条
  • [1] Joint Speech Enhancement and Speaker Identification Using Approximate Bayesian Inference
    Maina, Ciira Wa
    Walsh, John MacLaren
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1517 - 1529
  • [2] Markov chain Monte Carlo methods for speech enhancement
    Vermaak, J
    Niranjan, M
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1013 - 1016
  • [3] Speech Enhancement for Speaker Identification
    Mahesh, R.
    [J]. 2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [4] Speaker Re-identification with Speaker Dependent Speech Enhancement
    Shi, Yanpei
    Huang, Qiang
    Hain, Thomas
    [J]. INTERSPEECH 2020, 2020, : 1530 - 1534
  • [5] Blind speech dereverberation using batch and sequential Monte Carlo methods
    Evers, Christine
    Hopgood, James R.
    Bell, Judith
    [J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 3226 - +
  • [6] DNN Based Speech Enhancement for Unseen Noises Using Monte Carlo Dropout
    Nazreen, P. M.
    Ramakrishnan, A. G.
    [J]. 2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [7] Performance enhancement of speaker identification systems using speech encryption and cancelable features
    Soliman N.F.
    Mostfa Z.
    El-Samie F.E.A.
    Abdalla M.I.
    [J]. Soliman, Naglaa F. (nagla_soliman@yahoo.com), 1600, Springer Science and Business Media, LLC (20): : 977 - 1004
  • [8] Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
    Kanda, Naoyuki
    Gaur, Yashesh
    Wang, Xiaofei
    Meng, Zhong
    Chen, Zhuo
    Zhou, Tianyan
    Yoshioka, Takuya
    [J]. INTERSPEECH 2020, 2020, : 36 - 40
  • [9] ARTICULATORY BASED SPEECH MODELS FOR BLIND SPEECH DEREVERBERATION USING SEQUENTIAL MONTE CARLO METHODS
    Evers, Christine
    Hopgood, James R.
    [J]. 18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 2131 - 2135
  • [10] Speaker Identification using Whispered Speech
    Jawarkar, Naresh P.
    Holambe, Raghunath S.
    Basu, Tapan Kumar
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 778 - 781