Gated recurrent unit predictor model-based adaptive differential pulse code modulation speech decoder

被引:2
|
作者
Sheferaw, Gebremichael Kibret [1 ]
Mwangi, Waweru [1 ]
Kimwele, Michael [1 ]
Mamuye, Adane [2 ]
机构
[1] Jomo Kenyatta Univ Agr & Technol, Sch Comp & Informat Technol, Nairobi, Kenya
[2] Addis Ababa Univ, Sch Informat Technol & Engn, Inst Technol, Addis Ababa, Ethiopia
关键词
Speech coding; Gated recurrent unit; Nonlinear prediction; Waveform coding; Audio coding; Adaptive differential pulse code modulation; Speech compression; NEURAL-NETWORKS;
D O I
10.1186/s13636-023-00325-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech coding is a method to reduce the amount of data needs to represent speech signals by exploiting the statistical properties of the speech signal. Recently, in the speech coding process, a neural network prediction model has gained attention as the reconstruction process of a nonlinear and nonstationary speech signal. This study proposes a novel approach to improve speech coding performance by using a gated recurrent unit (GRU)-based adaptive differential pulse code modulation (ADPCM) system. This GRU predictor model is trained using a data set of speech samples from the DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus actual sample and the ADPCM fixed-predictor output speech sample. Our contribution lies in the development of an algorithm for training the GRU predictive model that can improve its performance in speech coding prediction and a new offline trained predictive model for speech decoder. The results indicate that the proposed system significantly improves the accuracy of speech prediction, demonstrating its potential for speech prediction applications. Overall, this work presents a unique application of the GRU predictive model with ADPCM decoding in speech signal compression, providing a promising approach for future research in this field.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] An Optimal Feature Parameter Set Based on Gated Recurrent Unit Recurrent Neural Networks for Speech Segment Detection
    Batur Dinler, Ozlem
    Aydin, Nizamettin
    APPLIED SCIENCES-BASEL, 2020, 10 (04):
  • [32] ANALYSIS OF DIFFERENTIAL PULSE CODE MODULATION WITH FORWARD ADAPTIVE LLOYD-MAX'S QUANTIZER FOR LOW BIT-RATE SPEECH CODING
    Peric, Zoran
    Jocic, Aleksandar
    Nikolic, Jelena
    Velimirovic, Lazar
    Denic, Dragan
    REVUE ROUMAINE DES SCIENCES TECHNIQUES-SERIE ELECTROTECHNIQUE ET ENERGETIQUE, 2013, 58 (04): : 424 - 434
  • [33] Domain Adaptation for Code Model-Based Unit Test Case Generation
    Shin, Jiho
    Hashtroudi, Sepehr
    Hemmati, Hadi
    Wang, Song
    PROCEEDINGS OF THE 33RD ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2024, 2024, : 1211 - 1222
  • [34] An Enhanced Gated Recurrent Unit-Based Adaptive Fault Diagnosis of Rotating Machinery
    Li, Zhen
    Riaz, Saleem
    Waqas, Muhammad
    Batool, Munira
    SHOCK AND VIBRATION, 2022, 2022
  • [35] Hybrid Deep Learning Predictor for Smart Agriculture Sensing Based on Empirical Mode Decomposition and Gated Recurrent Unit Group Model
    Jin, Xue-Bo
    Yang, Nian-Xiang
    Wang, Xiao-Yi
    Bai, Yu-Ting
    Su, Ting-Li
    Kong, Jian-Lei
    SENSORS, 2020, 20 (05)
  • [36] Lithium-ion Battery State of Charge Estimation Based on Gated Recurrent Unit Encoder-decoder
    Liu K.
    Kang L.
    Yue R.
    Xie D.
    Dianwang Jishu/Power System Technology, 2024, 48 (05): : 2161 - 2169
  • [37] A NOVEL ATTENTION-BASED GATED RECURRENT UNIT AND ITS EFFICACY IN SPEECH EMOTION RECOGNITION
    Rajamani, Srividya Tirunellai
    Rajamani, Kumar T.
    Mallol-Ragolta, Adria
    Liu, Shuo
    Schuller, Bjoern
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6294 - 6298
  • [38] Convolutional gated recurrent unit networks based real-time monaural speech enhancement
    Sunny Dayal Vanambathina
    Vaishnavi Anumola
    Ponnapalli Tejasree
    R. Divya
    B. Manaswini
    Multimedia Tools and Applications, 2023, 82 : 45717 - 45732
  • [39] Model-Based Adaptive Modulation and Coding with Latent Thompson Sampling
    Saxena, Vidit
    Tullberg, Hugo
    Jalden, Joakim
    2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
  • [40] Convolutional gated recurrent unit networks based real-time monaural speech enhancement
    Vanambathina, Sunny Dayal
    Anumola, Vaishnavi
    Tejasree, Ponnapalli
    Divya, R.
    Manaswini, B.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (29) : 45717 - 45732