Pre-Processing for Performance Enhancement of Speech Recognition in Digital Communication Systems

被引：0

作者：

Seo, Jinho

Park, Hochong

机构：

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA | 2005年 / 24卷 / 07期

关键词：

speech recognition; speech codec; digital speech communication; spectral compensation;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech recognition in digital communication systems has very low performance due to the spectral distortion caused by speech codecs. In this paper, the spectral distortion by speech codecs is analyzed and a pre-processing method which compensates for the spectral distortion is proposed for performance enhancement of speech recognition. Three standard speech codecs, IS-127 EVRC, ITU G.729 CS-ACELP and IS-96 QCELP, are considered for algorithm development and evaluation, and a single method which can be applied commonly to all codecs is developed. The performance of the proposed method is evaluated for three codecs, and by using the speech features extracted from the compensated spectrum, the recognition rate is improved by the maximum of 15.6% compared with that using the degraded speech features.

引用

页码：416 / 422

页数：7

共 50 条

[1] Speech enhancement using pre-processing
Singh, L
Sridharan, S
IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 755 - 758
[2] Signal pre-processing in speech recognition
Kolokolov, A.S.
Avtomatika i Telemekhanika, 2002, (03): : 160 - 168
[3] Speech recognition by neural networks and pre-processing wavelet
Cister, AM
Galante, GMF
WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING V, 1997, 3169 : 575 - 578
[4] Pre-processing and segmentation of speech signal in frequency domain for speech recognition
Kolokolov, A.S.
Avtomatika i Telemekhanika, 2003, (06): : 152 - 162
[5] Pre-processing of the speech data
不详
ROBUST ADAPTATION TO NON-NATIVE ACCENTS IN AUTOMATIC SPEECH RECOGNITION, 2002, 2560 : 15 - 19
[6] Pre-processing Voice Signals for Voice Recognition Systems
Berdibaeva, Gulmira K.
Bodin, Oleg N.
Kozlov, Valery V.
Nefed'ev, Dmitry I.
Ozhikenov, Kasymbek A.
Pizhonkov, Yaroslav A.
2017 18TH INTERNATIONAL CONFERENCE OF YOUNG SPECIALISTS ON MICRO/NANOTECHNOLOGIES AND ELECTRON DEVICES (EDM), 2017, : 242 - 245
[7] Pre-processing speech signals in FPGAs
Jun, X
Ariyaeeinia, A
Sotudeh, R
Ahmad, Z
2005 6TH INTERNATIONAL CONFERENCE ON ASIC PROCEEDINGS, BOOKS 1 AND 2, 2005, : 722 - 725
[8] Modulation enhancement of speech by a pre-processing algorithm for improving intelligibility in reverberant environments
Kusumoto, A
Arai, T
Kinoshita, K
Hodoshima, N
Vaughan, N
SPEECH COMMUNICATION, 2005, 45 (02) : 101 - 113
[9] PRE-PROCESSING OF DATA FOR CHARACTER RECOGNITION
ALCORN, TM
HOGGAR, CW
MARCONI REVIEW, 1969, 32 (172): : 61 - &
[10] Pre-processing of compressed digital video
Segall, CA
Karunaratne, P
Katsaggelos, AK
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2001, 2001, 4310 : 163 - 174

← 1 2 3 4 5 →