Effect of speech coders on speech recognition performance

被引:0
|
作者
Lilly, BT
Paliwal, KK
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech coders with bitrates as low as 2.4 kbits/s are now being developed for speech transmission in the telecommunications industry. For speech coders to work at this reduced bitrate, some speech information has to be removed and it is only natural to expect that the performance of speech recognition systems will deteriorate when coded speech is applied as input to a recognition system. In this paper, the results of a study to examine the effects speech coders have on speech recogntion am presented. Six different speech coders ranging from 4.8 kbits/s to 40 kbits/s are used with two different speech recognition systems 1) isolated word recogntion and 2) phoneme recogntion from continuous speech. The effects on speech recognition performance by tandeming each of the speech coders are also presented.
引用
收藏
页码:2344 / 2347
页数:4
相关论文
共 50 条
  • [21] SAMPLE ROBBING IN PREDICTIVE SPEECH CODERS
    AGRAWAL, JP
    IYER, SS
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1986, 34 (11) : 1068 - 1072
  • [22] SPEECH CODERS - FROM IDEA TO PRODUCT
    COX, RV
    KROON, P
    CHEN, JH
    THORKILDSEN, R
    ODELL, KM
    ISENBERG, DS
    [J]. AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 14 - 22
  • [23] Requirements on speech coders imposed by speech service solutions in cellular systems
    Minde, TB
    Bruhn, S
    Ekudden, E
    Hermansson, H
    [J]. 1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 89 - 90
  • [24] Performance comparison between VBR speech coders for adaptive VoIP applications
    Beritelli, F
    Casale, S
    Ruggeri, G
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2002, : 2578 - 2582
  • [25] Performance comparison between VBR speech coders for adaptive VoIP applications
    Beritelli, F
    Casale, S
    Ruggeri, G
    [J]. IEEE COMMUNICATIONS LETTERS, 2001, 5 (10) : 423 - 425
  • [26] DESIGN AND PERFORMANCE OF AN ANALYSIS-BY-SYNTHESIS CLASS OF PREDICTIVE SPEECH CODERS
    ROSE, RC
    BARNWELL, TP
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (09): : 1489 - 1503
  • [27] Quantifying and Improving the Performance of Speech Recognition Systems on Dysphonic Speech
    Lopez, Julio C. Hidalgo C.
    Sandeep, Shelly
    Wright, MaKayla
    Wandell, Grace M. M.
    Law, Anthony B. B.
    [J]. OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2023, 168 (05) : 1130 - 1138
  • [28] SPEECH RECOGNITION WITH NO SPEECH OR WITH NOISY SPEECH
    Krishna, Gautam
    Co Tran
    Yu, Jianguo
    Tewfik, Ahmed H.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1090 - 1094
  • [29] The effect of speech and speech intelligibility on task performance
    Venetjoki, N.
    Kaarlela-Tuomaala, A.
    Keskinen, E.
    Hongisto, V.
    [J]. ERGONOMICS, 2006, 49 (11) : 1068 - 1091
  • [30] Effect of motion on speech recognition
    Davis, Timothy J.
    Grantham, D. Wesley
    Gifford, Rene H.
    [J]. HEARING RESEARCH, 2016, 337 : 80 - 88