Noise robust speaker identification using sub-band weighting in multi-band approach

被引:2
|
作者
Kim, Sungtak [1 ]
Ji, Mikyong [1 ]
Suh, Youngjoo [1 ]
Kim, Hoirin [1 ]
机构
[1] Informat & Commun Univ, Sch Engn, Taejon, South Korea
关键词
feature recombination; multi-band approach; speaker identification; sub-band likelihood; sub-band weighting;
D O I
10.1093/ietisy/e90-d.12.2110
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, many techniques have been proposed to improve speaker identification in noise environments. Among these techniques, we consider the feature recombination technique for the multi-band approach in noise robust speaker identification. The conventional feature recombination technique is very effective in the band-limited noise condition, but in broad-band noise condition, the conventional feature recombination technique does not provide notable performance improvement compared with the full-band system. Even though the speech is corrupted by the broad-band noise, the degree of the noise corruption on each sub-band is different from each other. In the conventional feature recombination for speaker identification, all sub-band features are used to compute multiband likelihood score, but this likelihood computation does not use a merit of multi-band approach effectively, even though the sub-band features are extracted independently. Here we propose a new technique of sub-band likelihood computation with sub-band weighting in the feature recombination method. The signal to noise ratio (SNR) is used to compute the subband weights. The proposed sub-band-weighted likelihood computation makes a speaker identification system more robust to noise. Experimental results show that the average error reduction rate (ERR) in various noise environments is more than 24% compared with the conventional feature recombination-based speaker identification system.
引用
收藏
页码:2110 / 2114
页数:5
相关论文
共 50 条
  • [41] Sub-band Modulation Spectrum Compensation for Robust Speech Recognition
    Tu, Wen-hsiang
    Huang, Sheng-Yuan
    Hung, Jeih-weih
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 261 - 265
  • [42] Improved data modeling for text-dependent speaker recognition using sub-band processing
    Finan R.A.
    Damper R.I.
    Sapeluk A.T.
    International Journal of Speech Technology, 2001, 4 (1) : 45 - 62
  • [43] Frequency Sub-band Reduction of Spatially Correlated Noise in Images
    Miroshnichenko, Oleksandr
    Ponomarenko, Mykola
    Abramov, Sergey
    Lukin, Vladimir
    INTEGRATED COMPUTER TECHNOLOGIES IN MECHANICAL ENGINEERING-2023, VOL 1, ICTM 2023, 2024, 1008 : 621 - 631
  • [44] Noise robust speech rate estimation using signal-to-noise ratio dependent sub-band selection and peak detection strategy
    Yarra, Chiranjeevi
    Nagesh, Supriya
    Deshmukh, Om D.
    Ghosh, Prasanta Kumar
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (03): : 1615 - 1628
  • [45] Prosodic modeling for speaker recognition based on sub-band energy temporal trajectories
    Adami, AG
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 189 - 192
  • [46] CHANGE DETECTION BETWEEN MULTI-BAND IMAGES USING A ROBUST FUSION-BASED APPROACH
    Ferraris, Vinicius
    Dobigeon, Nicolas
    Wei, Qi
    Chabert, Marie
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 3346 - 3350
  • [47] Robust speech recognition using compression of Mel sub-band energies and temporal filtering
    Moradi N.
    Nasersharif B.
    Akbari A.
    2010 5th International Symposium on Telecommunications, IST 2010, 2010, : 760 - 763
  • [48] Sub-band, dual-channel adaptive noise cancellation using normalised LMS
    Darlington, DJ
    Campbell, DR
    1996 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP, PROCEEDINGS, 1996, : 327 - 330
  • [49] New Sub-Band Adaptive Volterra Filter for Identification of Loudspeaker
    Kinoshita, Satoshi
    Kajikawa, Yoshinobu
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2019, E102A (12) : 1946 - 1955
  • [50] Automatic adjustment of subband likelihood recombination weights for improving noise-robustness of a multi-SNR multi-band speaker identification system
    Yoshida, K
    Takagi, K
    Ozeki, K
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (11) : 2453 - 2459