Variable bit-rate CELP coding of speech with phonetic classification

被引:0
|
作者
Paksoy, Erdal [1 ]
Srinivasan, Krishnaswamy [1 ]
Gersho, Allen [1 ]
机构
[1] Univ of California, Santa Barbara, United States
关键词
Algorithms - Cellular telephone systems - Digital communication systems - Speech - Speech analysis - Speech processing - Speech transmission - Vocoders;
D O I
暂无
中图分类号
学科分类号
摘要
A variable bit-rate speech coder intended for digital cellular applications is described. A voice activity detection algorithm is used to distinguish active speech from background noise. Each frame of active speech is further classified to distinguish between three phonetic categories: voiced, unvoiced, and onset. Each input frame is assigned one of five bit rates according to voice activity and phonetic classification and coded using an analysis-by-synthesis algorithm tailored to the needs of the class that it belongs to. The resulting coder, called Variable Rate Phonetic Segmentation, produces good quality speech at an average bit-rate below 3 kbit/s when operating with a voice activity factor of 0.5. Informal subjective quality assessment for speech in clean and noisy backgrounds indicates a performance that is comparable to the TIA standard QCELP algorithm while operating at a 25% to 40% lower average bit rate.
引用
收藏
页码:591 / 601
相关论文
共 50 条
  • [31] IMPROVING LOW BIT-RATE CODING
    Rumsey, Francis
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2010, 58 (12): : 1116 - 1121
  • [32] THE APPLICATION OF ARTIFICIAL NEURAL NETWORK TECHNIQUES TO LOW BIT-RATE SPEECH CODING
    KAOURI, HA
    MCCANNY, JV
    [J]. FIRST IEE INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1989, : 100 - 104
  • [33] Low bit-rate speech coding by perceptually optimized noise excitation modulation
    Tsoukalas, D
    Mourjopoulos, J
    Kokkinakis, G
    [J]. SIGNAL PROCESSING, 1997, 56 (01) : 77 - 89
  • [34] LOW BIT-RATE SPEECH CODING WITH VQ-VAE AND A WAVENET DECODER
    Garbacea, Cristina
    van den Oord, Aaron
    Li, Yazhe
    Lim, Felicia S. C.
    Luebs, Alejandro
    Vinyals, Oriol
    Walters, Thomas C.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 735 - 739
  • [35] A MIXED EXCITATION LPC VOCODER MODEL FOR LOW BIT-RATE SPEECH CODING
    MCCREE, AV
    BARNWELL, TP
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04): : 242 - 250
  • [36] On the Study of Noise Allocation for Speech Signal in Low Bit-Rate Audio Coding
    Lee, Chang-Heon
    Oh, Hyen-O
    Kang, Hong-Goo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (10) : 849 - 852
  • [37] Low bit-rate speech coding by perceptually optimized noise excitation modulation
    Univ of Patras, Patras, Greece
    [J]. Signal Process, 1 (77-89):
  • [38] A neural network-based video bit-rate control algorithm for variable bit-rate applications of versatile video coding standard
    Raufmehr, Farhad
    Salehi, Mohammad Reza
    Abiri, Ebrahim
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 96
  • [39] Low bit-rate speech coding based on multicomponent AFM signal model
    Bansal M.
    Sircar P.
    [J]. International Journal of Speech Technology, 2018, 21 (4) : 783 - 795
  • [40] Algorithms for Low Bit-Rate Coding with Adaptation to Statistical Characteristics of Speech Signal
    Saveliev, Anton
    Basov, Oleg
    Ronzhin, Andrey
    Ronzhin, Alexander
    [J]. SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 65 - 72