Evaluation of a noise-robust multi-stream speaker verification method using F0 information

被引:2
|
作者
Asami, Taichi [1 ]
Iwano, Koji [1 ]
Furui, Sadaoki [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 1528552, Japan
关键词
speaker verification; F-0; information; multi-stream HMMs; stream-weight and threshold optimization; Adaboost;
D O I
10.1093/ietisy/e91-d.3.549
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We have previously proposed a noise-robust speaker verification method using fundamental frequency (F-0) extracted using the Hough transform. The method also incorporates an automatic stream-weight and decision threshold estimation technique. It has been confirmed that the proposed method is effective for white noise at various SNR conditions. This paper evaluates the proposed method in more practical in-car and elevator-hall noise conditions. The paper first describes the noise-robust F-0 extraction method and details of our robust speaker verification method using multi-stream HMMs for integrating the extracted F-0 and cepstral features. Details of the automatic stream-weight and threshold estimation method for multi-stream speaker verification framework are also explained. This method simultaneously optimizes stream-weights and a decision threshold by combining the linear discriminant analysis (LDA) and the Adaboost technique. Experiments were conducted using Japanese connected digit speech contaminated by white, in-car, or elevator-hall noise at various SNRs. Experimental results show that the F-0 features improve the verification performance in various noisy environments, and that our stream-weight and threshold optimization method effectively estimates control parameters so that FARs and FRRs are adjusted to achieve equal error rates (EERs) under various noisy conditions.
引用
收藏
页码:549 / 557
页数:9
相关论文
共 11 条
  • [1] A stream-weight and threshold estimation method using adaboost for multi-stream speaker verification
    Asami, Taichi
    Iwano, Koji
    Furui, Sadaoki
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 5939 - 5942
  • [2] Multi-SNR GMMs-based noise-robust speaker verification using 1/fα noises
    Yang, Liping
    Gong, Weiguo
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 241 - +
  • [3] Noise robust speech recognition using F0 contour information
    Iwano, K
    Seki, T
    Furui, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1102 - 1109
  • [4] MULTI-STREAM CONVOLUTIONAL NEURAL NETWORK WITH FREQUENCY SELECTION FOR ROBUST SPEAKER VERIFICATION
    Yao, Wei
    Chen, Shen
    Cui, Jiamin
    Lou, Yaolin
    [J]. COMPUTING AND INFORMATICS, 2024, 43 (04) : 819 - 848
  • [5] Multi-Task Adversarial Network Bottleneck Features for Noise-Robust Speaker Verification
    Yu, Hong
    Hu, Tianrui
    Ma, Zhanyu
    Tan, Zheng-Hua
    Guo, Jun
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 165 - 169
  • [6] NOISE-ROBUST F0 ESTIMATION USING SNR-WEIGHTED SUMMARY CORRELOGRAMS FROM MULTI-BAND COMB FILTERS
    Tan, Lee Ngee
    Alwan, Abeer
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4464 - 4467
  • [7] Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
    Jung, Myunghun
    Jung, Youngmoon
    Goo, Jahyun
    Kim, Hoirin
    [J]. INTERSPEECH 2020, 2020, : 931 - 935
  • [8] Noise Robust Speaker Verification using GMM-UBM Multi-Condition Training
    Mekonnen, Bezawit Wubishet
    Dufera, Bisrat Derebssa
    [J]. PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,
  • [9] Statistical Regression Models for Noise Robust F0 Estimation Using Recurrent Deep Neural Networks
    Kato, Akihiro
    Kinnunen, Tomi H.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2336 - 2349
  • [10] External Light Noise-Robust Multi-Touch Screen Using Frame Data Differential Method
    Lee, Gwang Jun
    Lee, Sang Kook
    Lyu, Hong Kun
    Jang, Jae Eun
    [J]. JOURNAL OF DISPLAY TECHNOLOGY, 2015, 11 (09): : 759 - 763