Pre-Processing for Performance Enhancement of Speech Recognition in Digital Communication Systems

被引:0
|
作者
Seo, Jinho
Park, Hochong
机构
来源
关键词
speech recognition; speech codec; digital speech communication; spectral compensation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech recognition in digital communication systems has very low performance due to the spectral distortion caused by speech codecs. In this paper, the spectral distortion by speech codecs is analyzed and a pre-processing method which compensates for the spectral distortion is proposed for performance enhancement of speech recognition. Three standard speech codecs, IS-127 EVRC, ITU G.729 CS-ACELP and IS-96 QCELP, are considered for algorithm development and evaluation, and a single method which can be applied commonly to all codecs is developed. The performance of the proposed method is evaluated for three codecs, and by using the speech features extracted from the compensated spectrum, the recognition rate is improved by the maximum of 15.6% compared with that using the degraded speech features.
引用
收藏
页码:416 / 422
页数:7
相关论文
共 50 条
  • [1] Speech enhancement using pre-processing
    Singh, L
    Sridharan, S
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 755 - 758
  • [2] Signal pre-processing in speech recognition
    Kolokolov, A.S.
    Avtomatika i Telemekhanika, 2002, (03): : 160 - 168
  • [3] Speech recognition by neural networks and pre-processing wavelet
    Cister, AM
    Galante, GMF
    WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING V, 1997, 3169 : 575 - 578
  • [4] Pre-processing and segmentation of speech signal in frequency domain for speech recognition
    Kolokolov, A.S.
    Avtomatika i Telemekhanika, 2003, (06): : 152 - 162
  • [5] Pre-processing of the speech data
    不详
    ROBUST ADAPTATION TO NON-NATIVE ACCENTS IN AUTOMATIC SPEECH RECOGNITION, 2002, 2560 : 15 - 19
  • [6] Pre-processing Voice Signals for Voice Recognition Systems
    Berdibaeva, Gulmira K.
    Bodin, Oleg N.
    Kozlov, Valery V.
    Nefed'ev, Dmitry I.
    Ozhikenov, Kasymbek A.
    Pizhonkov, Yaroslav A.
    2017 18TH INTERNATIONAL CONFERENCE OF YOUNG SPECIALISTS ON MICRO/NANOTECHNOLOGIES AND ELECTRON DEVICES (EDM), 2017, : 242 - 245
  • [7] Pre-processing speech signals in FPGAs
    Jun, X
    Ariyaeeinia, A
    Sotudeh, R
    Ahmad, Z
    2005 6TH INTERNATIONAL CONFERENCE ON ASIC PROCEEDINGS, BOOKS 1 AND 2, 2005, : 722 - 725
  • [8] Modulation enhancement of speech by a pre-processing algorithm for improving intelligibility in reverberant environments
    Kusumoto, A
    Arai, T
    Kinoshita, K
    Hodoshima, N
    Vaughan, N
    SPEECH COMMUNICATION, 2005, 45 (02) : 101 - 113
  • [9] PRE-PROCESSING OF DATA FOR CHARACTER RECOGNITION
    ALCORN, TM
    HOGGAR, CW
    MARCONI REVIEW, 1969, 32 (172): : 61 - &
  • [10] Pre-processing of compressed digital video
    Segall, CA
    Karunaratne, P
    Katsaggelos, AK
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2001, 2001, 4310 : 163 - 174