Interactive Multimedia Association-Adaptive Differential Pulse Code Modulation Codec With Gated Recurrent Unit Predictor

被引:0
|
作者
Sheferaw, Gebremichael Kibret [1 ]
Mwangi, Waweru [1 ]
Kimwele, Michael W. [1 ]
Mamuye, Adane [2 ]
Salau, Ayodeji Olalekan [3 ,4 ]
机构
[1] Jomo Kenyatta Univ Agr & Technol, Sch Comp & Informat Technol, Nairobi, Kenya
[2] Addis Ababa Inst Technol AAiT, Sch Informat Technol & Engn, Addis Ababa, Ethiopia
[3] Afe Babalola Univ, Dept Elect Elect & Comp Engn, Ado Ekiti, Nigeria
[4] Saveetha Inst Med & Tech Sci, Saveetha Sch Engn, Chennai 602105, Tamil Nadu, India
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Speech coding; IMA-ADPCM; GRU predictor; predictive model; speech compression; NETWORK;
D O I
10.1109/ACCESS.2024.3493604
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech coding is important for effective storage and transmission of audio signals. However, current Interactive Multimedia Association Adaptive Differential Pulse Code Modulation (IMA-ADPCM) speech coding techniques that use a fixed predictor have an impact on the encoding of dynamic and non-stationary speech signals. The limitation of the fixed predictor in IMA-ADPCM speech coding is the motivation for this study. Our goal is to improve the fixed predictor by integrating a GRU predictor that can adapt to and make better predictions of dynamic speech signals. We evaluated the performance of the IMA-ADPCM encoding baseline and the GRU predictor embedded with the IMA-ADPCM codec algorithm. The proposed pre-trained GRU predictor based encoding system outperformed the maximum Signal-to-Noise Ratio (SNR) (43.2 dB and MOS scores 3.8 to 4.3) of 5.0, and our results demonstrated considerable improvements in audio quality. The main contribution of this study is the development of a GRU Predictor that integrates IMA-ADPCM coding algorithms according to the IMA-ADPCM output speech sample and the actual PCM speech sample dataset required. By integrating the GRU predictor model in accordance with these data samples, the newly designed algorithm significantly improved the quality of the IMA-ADPCM speech codec.
引用
收藏
页码:165395 / 165406
页数:12
相关论文
共 24 条