ENABLING IMPROVED SPEAKER RECOGNITION BY VOICE QUALITY ESTIMATION

被引:0
|
作者
Bartos, Anthony L. [1 ]
Nelson, Douglas J. [2 ]
机构
[1] Assurance Technol Corp, Chantilly, VA 20151 USA
[2] Natl Secur Agcy, Ft George G Meade, MD 20755 USA
关键词
SAD; Speech Activity Detection; VAD; voice Activity Detection; SID; Speaker ID; LID; Language ID; EER; Equal Error Rate; VQE; Voice Quality Estimate; SNR; Signal to Noise Ratio;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Presented is a method to mitigate noise and interference in automated speaker identification (SID). This process uses the MIT/LL SID module without modifications. In this process, speaker models are built for a lattice of signal to noise ratio (SNR) levels. The SNR of the received signal is estimated by first applying speech activity detection to identify portions of the signal that actually contain speech. A voice quality estimation process is then applied to estimate the SNR of the received signal. The speaker models representing the SNR of the received signal are dynamically loaded, and conventional SID is applied. In training, the SNR of each training signal is estimated, and the signal is modified by adding noise to create a signal at the desired SNR. Using this process, each signal may be used to train models at any SNR level less than or equal to the SNR of the original signal. The process has been fully implemented and is completely automated.
引用
收藏
页码:595 / 599
页数:5
相关论文
共 50 条
  • [41] Preprocessing techniques for voice-print analysis for speaker recognition
    Ramli, Dzati Athiar
    Samad, Salina Abdul
    Hussain, Aini
    [J]. 2007 5TH STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT, 2007, : 452 - 456
  • [42] Improved MFCC Algorithm in Speaker Recognition System
    Shi, Yibo
    Wang, Li
    [J]. INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [43] Robust Speaker Recognition Based on Improved GFCC
    Shi, Xiaoyuan
    Yang, Haiyan
    Zhou, Ping
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1927 - 1931
  • [44] RECOGNITION OF FALSETTO VOICE QUALITY
    LERMAN, JW
    DUFFY, RJ
    [J]. FOLIA PHONIATRICA, 1970, 22 (01): : 21 - 27
  • [45] Influence of Natural Voice Disguise Techniques on Automatic Speaker Recognition
    Staroniewicz, Piotr
    [J]. 2018 JOINT CONFERENCE - ACOUSTICS, 2018, : 299 - 302
  • [46] Sound Identification and Speaker Recognition for Aircraft Cockpit Voice Recorder
    Lin, Yang
    [J]. PROCEEDINGS OF 2010 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL 1 AND 2, 2010, : 260 - 263
  • [47] Robust Threshold Selection for Environment Specific Voice in Speaker Recognition
    Kanrar, Soumen
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2022, 126 (04) : 3071 - 3092
  • [48] HUMAN SPEAKER RECOGNITION PERFORMANCE OF LPC VOICE PROCESSORS.
    Uzdy, Z.
    [J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985, ASSP-33 (03): : 752 - 753
  • [49] Disentangling Voice and Content with Self-Supervision for Speaker Recognition
    Liu, Tianchi
    Lee, Kong Aik
    Wang, Qiongqiong
    Li, Haizhou
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [50] Voice Activation Using Speaker Recognition for Controlling Humanoid Robot
    Tuasikal, Dyah Ayu Anggreini
    Fakhrurroja, Hanif
    Machbub, Carmadi
    [J]. 2018 IEEE 8TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET), 2018, : 79 - 84