Speech coding techniques and challenges: a comprehensive literature survey

被引:3
|
作者
Nagaraja, B. G. [1 ]
Anees, Mohamed [1 ]
Thimmaraja Yadava, G. [2 ]
机构
[1] Vidyavardhaka Coll Engn, E&CE, Gokulam 3 Stage, Mysuru 570002, Karnataka, India
[2] Nitte Meenakshi Inst Technol, E&CE, Bengaluru 560064, Karnataka, India
关键词
Speech coding techniques; Metrics; Speech enhancement; Noisy environment; Speech database; FEATURE-EXTRACTION; SPEAKER RECOGNITION; MODELING TECHNIQUES; QUALITY; AUDIO;
D O I
10.1007/s11042-023-16665-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech coding is the process of compressing speech signals for transmission and storage in communication systems. In recent years, speech coding has become increasingly important due to the growing demand for low bitrate communication systems. This paper presents a comprehensive literature survey of speech coding techniques, their importance, and the challenges associated with their implementation. We also discuss the use of speech enhancement techniques in speech coding. The survey covers various speech coding techniques and their limitations in adverse conditions. We highlight the potential of machine learning-based methods in improving speech quality and intelligibility in speech coding systems. Further, metrics for evaluating the performance of speech coding algorithms are highlighted. The survey also discusses the key issues and challenges associated with speech coding, including the trade-off between speech quality and bit rate, and the impact of background noise on speech quality. Further it also covers popular speech databases used in coding research. Our findings provide valuable insights for researchers and practitioners working in speech coding and demonstrate the importance of speech enhancement techniques for improving speech quality and intelligibility in low bitrate communication systems.
引用
收藏
页码:29859 / 29879
页数:21
相关论文
共 50 条
  • [41] SURVEY OF DIGITAL SPEECH PROCESSING TECHNIQUES
    SCHAFER, RW
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1972, AU20 (01): : 28 - +
  • [42] Comprehensive Review of Various Speech Enhancement Techniques
    Gulati, Savy
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 536 - 540
  • [43] A Comprehensive Investigation into The Noise Reduction Techniques for Speech
    Tyagi, Suryakant
    Varkonyi-Koczy, Annamaria R.
    Szenasi, Sandor
    2023 IEEE 21ST WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS, SAMI, 2023, : 207 - 212
  • [44] Literature Survey of Arabic Speech Recognition
    Al-Anzi, Fawaz S.
    AbuZeina, Dia
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES AND ENGINEERING (ICCSE), 2018,
  • [45] Chinese dialect speech recognition: a comprehensive survey
    Qiang Li
    Qianyu Mai
    Mandou Wang
    Mingjuan Ma
    Artificial Intelligence Review, 57
  • [46] Comprehensive survey on haze removal techniques
    Singh, Dilbag
    Kumar, Vijay
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (08) : 9595 - 9620
  • [47] Trends in speech emotion recognition: a comprehensive survey
    Kaur, Kamaldeep
    Singh, Parminder
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29307 - 29351
  • [48] A Comprehensive Survey on Shadow Detection Techniques
    Panchal, Monali H.
    Gamit, Nikunj C.
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 2249 - 2253
  • [49] A Comprehensive Survey on Computer Forensics: State-of-the-Art, Tools, Techniques, Challenges, and Future Directions
    Javed, Abdul Rehman
    Ahmed, Waqas
    Alazab, Mamoun
    Jalil, Zunera
    Kifayat, Kashif
    Gadekallu, Thippa Reddy
    IEEE ACCESS, 2022, 10 : 11065 - 11089
  • [50] A Comprehensive Survey on Computer Forensics: State-of-the-Art, Tools, Techniques, Challenges, and Future Directions
    Javed, Abdul Rehman
    Ahmed, Waqas
    Alazab, Mamoun
    Jalil, Zunera
    Kifayat, Kashif
    Gadekallu, Thippa Reddy
    IEEE Access, 2022, 10 : 11065 - 11089