Noise robust speaker identification using sub-band weighting in multi-band approach

被引：2

作者：

Kim, Sungtak ^{[1
]}

Ji, Mikyong ^{[1
]}

Suh, Youngjoo ^{[1
]}

Kim, Hoirin ^{[1
]}

机构：

[1] Informat & Commun Univ, Sch Engn, Taejon, South Korea

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2007年 / E90D卷 / 12期

关键词：

feature recombination; multi-band approach; speaker identification; sub-band likelihood; sub-band weighting;

D O I：

10.1093/ietisy/e90-d.12.2110

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, many techniques have been proposed to improve speaker identification in noise environments. Among these techniques, we consider the feature recombination technique for the multi-band approach in noise robust speaker identification. The conventional feature recombination technique is very effective in the band-limited noise condition, but in broad-band noise condition, the conventional feature recombination technique does not provide notable performance improvement compared with the full-band system. Even though the speech is corrupted by the broad-band noise, the degree of the noise corruption on each sub-band is different from each other. In the conventional feature recombination for speaker identification, all sub-band features are used to compute multiband likelihood score, but this likelihood computation does not use a merit of multi-band approach effectively, even though the sub-band features are extracted independently. Here we propose a new technique of sub-band likelihood computation with sub-band weighting in the feature recombination method. The signal to noise ratio (SNR) is used to compute the subband weights. The proposed sub-band-weighted likelihood computation makes a speaker identification system more robust to noise. Experimental results show that the average error reduction rate (ERR) in various noise environments is more than 24% compared with the conventional feature recombination-based speaker identification system.

引用

页码：2110 / 2114

页数：5

共 50 条

[41] Sub-band Modulation Spectrum Compensation for Robust Speech Recognition
Tu, Wen-hsiang
Huang, Sheng-Yuan
Hung, Jeih-weih
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 261 - 265
[42] Improved data modeling for text-dependent speaker recognition using sub-band processing
Finan R.A.
Damper R.I.
Sapeluk A.T.
International Journal of Speech Technology, 2001, 4 (1) : 45 - 62
[43] Frequency Sub-band Reduction of Spatially Correlated Noise in Images
Miroshnichenko, Oleksandr
Ponomarenko, Mykola
Abramov, Sergey
Lukin, Vladimir
INTEGRATED COMPUTER TECHNOLOGIES IN MECHANICAL ENGINEERING-2023, VOL 1, ICTM 2023, 2024, 1008 : 621 - 631
[44] Noise robust speech rate estimation using signal-to-noise ratio dependent sub-band selection and peak detection strategy
Yarra, Chiranjeevi
Nagesh, Supriya
Deshmukh, Om D.
Ghosh, Prasanta Kumar
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (03): : 1615 - 1628
[45] Prosodic modeling for speaker recognition based on sub-band energy temporal trajectories
Adami, AG
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 189 - 192
[46] CHANGE DETECTION BETWEEN MULTI-BAND IMAGES USING A ROBUST FUSION-BASED APPROACH
Ferraris, Vinicius
Dobigeon, Nicolas
Wei, Qi
Chabert, Marie
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 3346 - 3350
[47] Robust speech recognition using compression of Mel sub-band energies and temporal filtering
Moradi N.
Nasersharif B.
Akbari A.
2010 5th International Symposium on Telecommunications, IST 2010, 2010, : 760 - 763
[48] Sub-band, dual-channel adaptive noise cancellation using normalised LMS
Darlington, DJ
Campbell, DR
1996 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP, PROCEEDINGS, 1996, : 327 - 330
[49] New Sub-Band Adaptive Volterra Filter for Identification of Loudspeaker
Kinoshita, Satoshi
Kajikawa, Yoshinobu
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2019, E102A (12) : 1946 - 1955
[50] Automatic adjustment of subband likelihood recombination weights for improving noise-robustness of a multi-SNR multi-band speaker identification system
Yoshida, K
Takagi, K
Ozeki, K
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (11) : 2453 - 2459

← 1 2 3 4 5 →