Speaker Verification Based on Single Channel Speech Separation

被引:0
|
作者
Jin, Rong [1 ]
Ablimit, Mijit [1 ]
Hamdulla, Askar [1 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830017, Peoples R China
关键词
Speech separation; voiceprint recognition; speaker verification; multi-tasking; IDENTIFICATION;
D O I
10.1109/ACCESS.2023.3287868
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In multi-speaker scenarios, speech processing tasks like speaker identification and speech recognition are susceptible to noise and overlapped voices. As the overlapped voices are a complicated mixture of signals, a target extraction method from this mixture is a good front-end solution for further processing like understanding and classifying. The quality of speech separation can be assessed by the noise ratio or subjective scoring and can also be assessed by accuracy of the downstream processing tasks like speaker identification. In order to make the separation model and speaker identification model more adapted to complex multi-speaker speech overlapping scenarios, this research investigates the speech separation model and incorporate with a voiceprint recognition task. This paper proposes a feature-scale single channel speech separation network connected to a back-end speaker verification network with MFCCT features, so the accuracy of speaker identification indicates the quality of speech separation task. The datasets are prepared by synthesizing Voxceleb1 data, and used for training and testing. The results show that using an objective downstream evaluation can effectively improve the overall performance, as the optimized speech separation model significantly reduced the error rate of speaker verification.
引用
收藏
页码:112631 / 112638
页数:8
相关论文
共 50 条
  • [1] Speaker Verification-Based Evaluation of Single-Channel Speech Separation
    Maciejewski, Matthew
    Watanabe, Shinji
    Khudanpur, Sanjeev
    [J]. INTERSPEECH 2021, 2021, : 3520 - 3524
  • [2] CASA BASED SUPERVISED SINGLE CHANNEL SPEAKER INDEPENDENT SPEECH SEPARATION
    Rehman, M. Fazal Ur
    Saleem, Nasir
    Nawaz, Asif
    Jan, Sadeeq
    Najam, Zeeshan
    Khattak, M. Irfan
    Ahmed, Sheeraz
    [J]. JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (06): : 973 - 984
  • [3] Speaker-independent model-based single channel speech separation
    Radfar, M. H.
    Dansereau, R. M.
    Sayadiyan, A.
    [J]. NEUROCOMPUTING, 2008, 72 (1-3) : 71 - 78
  • [4] Exploring single channel speech separation for short-time text-dependent speaker verification
    Han, Jiangyu
    Shi, Yan
    Long, Yanhua
    Liang, Jiaen
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 261 - 268
  • [5] Exploring single channel speech separation for short-time text-dependent speaker verification
    Jiangyu Han
    Yan Shi
    Yanhua Long
    Jiaen Liang
    [J]. International Journal of Speech Technology, 2022, 25 : 261 - 268
  • [6] JOINT SINGLE-CHANNEL SPEECH SEPARATION AND SPEAKER IDENTIFICATION
    Mowlaee, P.
    Saeidi, R.
    Tan, Z. -H.
    Christensen, M. G.
    Franti, P.
    Jensen, S. H.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4430 - 4433
  • [7] UNIVERSAL SPEECH MODELS FOR SPEAKER INDEPENDENT SINGLE CHANNEL SOURCE SEPARATION
    Sun, Dennis L.
    Mysore, Gautham J.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 141 - 145
  • [8] A Joint Approach for Single-Channel Speaker Identification and Speech Separation
    Mowlaee, Pejman
    Saeidi, Rahim
    Christensen, Mads Grsboll
    Tan, Zheng-Hua
    Kinnunen, Tomi
    Franti, Pasi
    Jensen, Soren Holdt
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (09): : 2586 - 2601
  • [9] Multi-Channel Speaker Verification for Single and Multi-talker Speech
    Kataria, Saurabh
    Zhang, Shi-Xiong
    Yu, Dong
    [J]. INTERSPEECH 2021, 2021, : 4608 - 4612
  • [10] A generalized approach for model-based speaker-dependent single channel speech separation
    Radfar, M. H.
    Sayadiyan, A.
    Dansereau, R. M.
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY TRANSACTION B-ENGINEERING, 2007, 31 (B3): : 361 - 375