Pre-Processing for Performance Enhancement of Speech Recognition in Digital Communication Systems

被引:0
|
作者
Seo, Jinho
Park, Hochong
机构
来源
关键词
speech recognition; speech codec; digital speech communication; spectral compensation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech recognition in digital communication systems has very low performance due to the spectral distortion caused by speech codecs. In this paper, the spectral distortion by speech codecs is analyzed and a pre-processing method which compensates for the spectral distortion is proposed for performance enhancement of speech recognition. Three standard speech codecs, IS-127 EVRC, ITU G.729 CS-ACELP and IS-96 QCELP, are considered for algorithm development and evaluation, and a single method which can be applied commonly to all codecs is developed. The performance of the proposed method is evaluated for three codecs, and by using the speech features extracted from the compensated spectrum, the recognition rate is improved by the maximum of 15.6% compared with that using the degraded speech features.
引用
收藏
页码:416 / 422
页数:7
相关论文
共 50 条
  • [21] Defining properties of speech spectrogram images to allow effective pre-processing prior to pattern recognition
    Mohammed, Aldarkazali
    Rupert, Young
    Chris, Chatwin
    Philip, Birch
    OPTICAL PATTERN RECOGNITION XXIV, 2013, 8748
  • [22] Binaural pre-processing for contralateral sound field attenuation and improved speech-in-noise recognition
    Lopez-Poveda, Enrique A.
    Eustaquio-Martin, Almudena
    San-Victoriano, Fernando M.
    HEARING RESEARCH, 2022, 418
  • [23] Bilateral Histogram Equalization with Pre-processing for Contrast Enhancement
    Amil, Feroz Mahmud
    Rahman, Md Mostafijur
    Rahman, Shanto
    Dey, Emon Kumar
    Shoyaib, Mohammad
    2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 231 - 236
  • [24] Applying Enhancement Filters in the Pre-processing of Images of Lymphoma
    Silva, Sergio Henrique
    do Nascimento, Marcelo Zanchetta
    Neves, Leandro Alves
    Batista, Valerio Ramos
    3RD INTERNATIONAL CONFERENCE ON MATHEMATICAL MODELING IN PHYSICAL SCIENCES (IC-MSQUARE 2014), 2015, 574
  • [25] Identification of Pre-processing Technique for Enhancement of Mammogram Images
    Sharma, Jaya
    Rai, J. K.
    Tewari, R. P.
    2014 INTERNATIONAL CONFERENCE ON MEDICAL IMAGING, M-HEALTH & EMERGING COMMUNICATION SYSTEMS (MEDCOM), 2015, : 115 - 119
  • [26] Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
    Ochiai, Tsubasa
    Iwamoto, Kazuma
    Delcroix, Marc
    Ikeshita, Rintaro
    Sato, Hiroshi
    Araki, Shoko
    Katagiri, Shigeru
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3589 - 3602
  • [27] Study of the Pre-processing Impact in a Facial Recognition System
    Calvo, Guillermo
    Baruque, Bruno
    Corchado, Emilio
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, 2013, 8073 : 334 - 344
  • [28] Pre-Processing Cascades and Fusion in Finger Vein Recognition
    Kauba, Christoph
    Reissig, Jakob
    Uhl, Andreas
    2014 INTERNATIONAL CONFERENCE OF THE BIOMETRICS SPECIAL INTEREST GROUP (BIOSIG), 2014,
  • [29] Pre-processing algorithm of unconstrained handwritten numerals recognition
    Moshi Shibie yu Rengong Zhineng, 3 (243-250):
  • [30] Input pre-processing for transformation invariant pattern recognition
    Tascini, G
    Montesanto, A
    Fazzini, G
    Puliti, P
    ENGINEERING APPLICATIONS OF BIO-INSPIRED ARTIFICIAL NEURAL NETWORKS, VOL II, 1999, 1607 : 393 - 401