An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition

被引:8
|
作者
Yan, Furong [1 ]
Men, Aidong [1 ]
Yang, Bo [1 ]
Jiang, Zhuqing [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100000, Peoples R China
来源
IEEE ACCESS | 2016年 / 4卷
关键词
Robustness; feature warping; missing data method; ranking feature; autocorrelation; rank correlation; open-set speaker recognition; TRANSFORMATIONS;
D O I
10.1109/ACCESS.2016.2607778
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although the field of automatic speaker or speech recognition has been extensively studied over the past decades, the lack of robustness has remained a major challenge. Feature warping is a promising approach and its effectiveness significantly depends on the relative positions of each of the features in a sliding window. However, the relative positions are changed due to the non-linear effect of noise. Aiming at the problem, this paper takes the advantage of ranking feature, which is obtained directly by sorting a feature sequence in descending order, to propose a method. It first labels the central frame in a sliding window as speech or noise dominant ("reliable'' or "unreliable''). In the unreliable case, the ranking of the central frame is estimated. Subsequently, the estimated ranking is mapped to a warped feature using a desired target distribution for recognition experiments. Through the theoretical analysis and experimental results, it is found that autocorrelation of a ranking sequence is larger than that of the corresponding feature sequence. What is more, rank correlation is not easily influenced by abnormal data or data that are highly variable. Thus, this paper deals with a ranking sequence rather than a feature sequence. The proposed feature enhancement approach is evaluated in an open-set speaker recognition system. The experimental results show that it outperforms missing data method based on linear interpolation and feature warping in terms of recognition performance in all noise conditions. Furthermore, the method proposed here is a feature-based method, which may be combined with other technologies, such as model-based, scores-based, to enhance the robustness of speaker or speech recognition system.
引用
收藏
页码:5258 / 5267
页数:10
相关论文
共 50 条
  • [1] A ranking-based feature selection approach for handwritten character recognition
    Cilia, Nicole Dalia
    De Stefano, Claudio
    Fontanella, Francesco
    di Freca, Alessandra Scotto
    [J]. PATTERN RECOGNITION LETTERS, 2019, 121 : 77 - 86
  • [2] Robust speaker recognition - A feature-based approach
    Mammone, RJ
    Zhang, XY
    Ramachandran, RP
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 1996, 13 (05) : 58 - 71
  • [3] A comprehensive analysis of feature ranking-based fish disease recognition
    Rajbongshi, Aditya
    Shakil, Rashiduzzaman
    Akter, Bonna
    Lata, Munira Akter
    Joarder, Md. Mahbubul Alam
    [J]. ARRAY, 2024, 21
  • [4] Neighborhood Ranking-Based Feature Selection
    Ipkovich, Adam
    Abonyi, Janos
    [J]. IEEE ACCESS, 2024, 12 : 20152 - 20168
  • [5] Robust Speaker Recognition Based on Improved GFCC
    Shi, Xiaoyuan
    Yang, Haiyan
    Zhou, Ping
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1927 - 1931
  • [6] A sub-band-based feature reconstruction approach for robust speaker recognition
    Furong Yan
    Yanbin Zhang
    Jiachang Yan
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [7] A sub-band-based feature reconstruction approach for robust speaker recognition
    Yan, Furong
    Zhang, Yanbin
    Yan, Jiachang
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 13
  • [8] A robust feature based on sparse representation for speaker recognition
    Xie, Yining
    Huang, Jinjie
    Wang, Xinlei
    [J]. Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561
  • [9] Feature enhancement by speaker-normalized splice for robust speech recognition
    Shinohara, Yusuke
    Masuko, Takashi
    Akamine, Masami
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4881 - 4884
  • [10] A COCHLEAR NEURON BASED ROBUST FEATURE FOR SPEAKER RECOGNITION
    You, Datao
    Jiang, Tao
    Han, Jiqing
    Zheng, Tieran
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5440 - 5443