Speech coding techniques and challenges: a comprehensive literature survey

被引:3
|
作者
Nagaraja, B. G. [1 ]
Anees, Mohamed [1 ]
Thimmaraja Yadava, G. [2 ]
机构
[1] Vidyavardhaka Coll Engn, E&CE, Gokulam 3 Stage, Mysuru 570002, Karnataka, India
[2] Nitte Meenakshi Inst Technol, E&CE, Bengaluru 560064, Karnataka, India
关键词
Speech coding techniques; Metrics; Speech enhancement; Noisy environment; Speech database; FEATURE-EXTRACTION; SPEAKER RECOGNITION; MODELING TECHNIQUES; QUALITY; AUDIO;
D O I
10.1007/s11042-023-16665-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech coding is the process of compressing speech signals for transmission and storage in communication systems. In recent years, speech coding has become increasingly important due to the growing demand for low bitrate communication systems. This paper presents a comprehensive literature survey of speech coding techniques, their importance, and the challenges associated with their implementation. We also discuss the use of speech enhancement techniques in speech coding. The survey covers various speech coding techniques and their limitations in adverse conditions. We highlight the potential of machine learning-based methods in improving speech quality and intelligibility in speech coding systems. Further, metrics for evaluating the performance of speech coding algorithms are highlighted. The survey also discusses the key issues and challenges associated with speech coding, including the trade-off between speech quality and bit rate, and the impact of background noise on speech quality. Further it also covers popular speech databases used in coding research. Our findings provide valuable insights for researchers and practitioners working in speech coding and demonstrate the importance of speech enhancement techniques for improving speech quality and intelligibility in low bitrate communication systems.
引用
收藏
页码:29859 / 29879
页数:21
相关论文
共 50 条
  • [21] A comprehensive survey of multimodal fake news detection techniques: advances, challenges, and opportunities
    Shivani Tufchi
    Ashima Yadav
    Tanveer Ahmed
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [22] A comprehensive survey of multimodal fake news detection techniques: advances, challenges, and opportunities
    Tufchi, Shivani
    Yadav, Ashima
    Ahmed, Tanveer
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
  • [23] A comprehensive survey of image and video forgery techniques: variants, challenges, and future directions
    Syed Tufael Nabi
    Munish Kumar
    Paramjeet Singh
    Naveen Aggarwal
    Krishan Kumar
    Multimedia Systems, 2022, 28 : 939 - 992
  • [24] Comprehensive Survey on VLC in E-Healthcare: Channel Coding Schemes and Modulation Techniques
    Guana-Moya, Javier
    Canizares, Milton Roman
    Jativa, Pablo Palacios
    Sanchez, Ivan
    Ruminot, Dayana
    Lobos, Fernando Vergara
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [25] THE DEVELOPMENT OF A COMPREHENSIVE SYSTEM FOR CODING SIMULTANEOUS SPEECH
    ROGER, D
    BULL, P
    SMITH, S
    BULLETIN OF THE BRITISH PSYCHOLOGICAL SOCIETY, 1987, 40 : A21 - A22
  • [26] GNSS-Based Attitude Determination Techniques-A Comprehensive Literature Survey
    Raskaliyev, Almat
    Patel, Sarosh Hosi
    Sobh, Tarek M.
    Ibrayev, Aidos
    IEEE ACCESS, 2020, 8 : 24873 - 24886
  • [27] A Comprehensive Survey on Summarization Techniques
    Uppalapati P.J.
    Dabbiru M.
    Rao K.V.
    SN Computer Science, 4 (5)
  • [28] Deblurring Techniques - A Comprehensive Survey
    Sankaraiah, Y. Ravi
    Varadarajan, S.
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 2032 - 2035
  • [29] Speech based detection of Alzheimer's disease: a survey of AI techniques, datasets and challenges
    Ding, Kewen
    Chetty, Madhu
    Hoshyar, Azadeh Noori
    Bhattacharya, Tanusri
    Klein, Britt
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (12)
  • [30] Speech Emotion Recognition: A Comprehensive Survey
    Mohammed Jawad Al-Dujaili
    Abbas Ebrahimi-Moghadam
    Wireless Personal Communications, 2023, 129 : 2525 - 2561