Interactive Multimedia Association-Adaptive Differential Pulse Code Modulation Codec With Gated Recurrent Unit Predictor

被引:0
|
作者
Sheferaw, Gebremichael Kibret [1 ]
Mwangi, Waweru [1 ]
Kimwele, Michael W. [1 ]
Mamuye, Adane [2 ]
Salau, Ayodeji Olalekan [3 ,4 ]
机构
[1] Jomo Kenyatta Univ Agr & Technol, Sch Comp & Informat Technol, Nairobi, Kenya
[2] Addis Ababa Inst Technol AAiT, Sch Informat Technol & Engn, Addis Ababa, Ethiopia
[3] Afe Babalola Univ, Dept Elect Elect & Comp Engn, Ado Ekiti, Nigeria
[4] Saveetha Inst Med & Tech Sci, Saveetha Sch Engn, Chennai 602105, Tamil Nadu, India
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Speech coding; IMA-ADPCM; GRU predictor; predictive model; speech compression; NETWORK;
D O I
10.1109/ACCESS.2024.3493604
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech coding is important for effective storage and transmission of audio signals. However, current Interactive Multimedia Association Adaptive Differential Pulse Code Modulation (IMA-ADPCM) speech coding techniques that use a fixed predictor have an impact on the encoding of dynamic and non-stationary speech signals. The limitation of the fixed predictor in IMA-ADPCM speech coding is the motivation for this study. Our goal is to improve the fixed predictor by integrating a GRU predictor that can adapt to and make better predictions of dynamic speech signals. We evaluated the performance of the IMA-ADPCM encoding baseline and the GRU predictor embedded with the IMA-ADPCM codec algorithm. The proposed pre-trained GRU predictor based encoding system outperformed the maximum Signal-to-Noise Ratio (SNR) (43.2 dB and MOS scores 3.8 to 4.3) of 5.0, and our results demonstrated considerable improvements in audio quality. The main contribution of this study is the development of a GRU Predictor that integrates IMA-ADPCM coding algorithms according to the IMA-ADPCM output speech sample and the actual PCM speech sample dataset required. By integrating the GRU predictor model in accordance with these data samples, the newly designed algorithm significantly improved the quality of the IMA-ADPCM speech codec.
引用
收藏
页码:165395 / 165406
页数:12
相关论文
共 24 条
  • [21] Errors associated with the use of adaptive differential pulse code modulation in the compression of isometric and dynamic myo-electric signals
    A. D. C. Chan
    D. F. Lovely
    B. Hudgins
    Medical and Biological Engineering and Computing, 1998, 36 : 215 - 219
  • [22] Errors associated with the use of adaptive differential pulse code modulation in the compression of isometric and dynamic myo-electric signals
    Chan, A.D.C.
    Lovely, D.F.
    Hudgins, B.
    Medical and Biological Engineering and Computing, 1998, 36 (02): : 215 - 219
  • [23] Errors associated with the use of adaptive differential pulse code modulation in the compression of isometric and dynamic myo-electric signals
    Chan, ADC
    Lovely, DF
    Hudgins, B
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 1998, 36 (02) : 215 - 219
  • [24] ANALYSIS OF DIFFERENTIAL PULSE CODE MODULATION WITH FORWARD ADAPTIVE LLOYD-MAX'S QUANTIZER FOR LOW BIT-RATE SPEECH CODING
    Peric, Zoran
    Jocic, Aleksandar
    Nikolic, Jelena
    Velimirovic, Lazar
    Denic, Dragan
    REVUE ROUMAINE DES SCIENCES TECHNIQUES-SERIE ELECTROTECHNIQUE ET ENERGETIQUE, 2013, 58 (04): : 424 - 434