Speech coding techniques and challenges: a comprehensive literature survey

被引:3
|
作者
Nagaraja, B. G. [1 ]
Anees, Mohamed [1 ]
Thimmaraja Yadava, G. [2 ]
机构
[1] Vidyavardhaka Coll Engn, E&CE, Gokulam 3 Stage, Mysuru 570002, Karnataka, India
[2] Nitte Meenakshi Inst Technol, E&CE, Bengaluru 560064, Karnataka, India
关键词
Speech coding techniques; Metrics; Speech enhancement; Noisy environment; Speech database; FEATURE-EXTRACTION; SPEAKER RECOGNITION; MODELING TECHNIQUES; QUALITY; AUDIO;
D O I
10.1007/s11042-023-16665-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech coding is the process of compressing speech signals for transmission and storage in communication systems. In recent years, speech coding has become increasingly important due to the growing demand for low bitrate communication systems. This paper presents a comprehensive literature survey of speech coding techniques, their importance, and the challenges associated with their implementation. We also discuss the use of speech enhancement techniques in speech coding. The survey covers various speech coding techniques and their limitations in adverse conditions. We highlight the potential of machine learning-based methods in improving speech quality and intelligibility in speech coding systems. Further, metrics for evaluating the performance of speech coding algorithms are highlighted. The survey also discusses the key issues and challenges associated with speech coding, including the trade-off between speech quality and bit rate, and the impact of background noise on speech quality. Further it also covers popular speech databases used in coding research. Our findings provide valuable insights for researchers and practitioners working in speech coding and demonstrate the importance of speech enhancement techniques for improving speech quality and intelligibility in low bitrate communication systems.
引用
收藏
页码:29859 / 29879
页数:21
相关论文
共 50 条
  • [31] Speech Emotion Recognition: A Comprehensive Survey
    Al-Dujaili, Mohammed Jawad
    Ebrahimi-Moghadam, Abbas
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 129 (04) : 2525 - 2561
  • [32] Speech coding: Applications, challenges and new directions
    Kroon, P
    ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 1 - 4
  • [33] A comprehensive survey on Machine Learning techniques in opportunistic networks: Advances, challenges and future directions
    Gandhi, Jay
    Narmawala, Zunnun
    PERVASIVE AND MOBILE COMPUTING, 2024, 100
  • [34] FREQUENCY-DOMAIN TECHNIQUES FOR SPEECH CODING
    CROCHIERE, RE
    TRIBOLET, JM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S139 - S139
  • [35] FREQUENCY-DOMAIN TECHNIQUES FOR SPEECH CODING
    CROCHIERE, RE
    TRIBOLET, JM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 66 (06): : 1642 - 1646
  • [36] COMPREHENSIVE IMPROVEMENT IN LOW BIT RATE SPEECH CODING
    FAN, CX
    MA, HF
    DALLAS GLOBECOM 89, VOLS 1-3: COMMUNICATIONS TECHNOLOGY FOR THE 1990S AND BEYOND, 1989, : 1916 - 1920
  • [37] A Comprehensive Look at Coding Techniques on Riemannian Manifolds
    Faraki, Masoud
    Harandi, Mehrtash T.
    Porikli, Fatih
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5701 - 5712
  • [38] A SURVEY OF SPEECH BANDWIDTH COMPRESSION TECHNIQUES
    CAMPANELLA, SJ
    IRE TRANSACTIONS ON AUDIO, 1958, 6 (05): : 104 - 116
  • [39] A Survey of Different Speech Synthesis Techniques
    Jalil, Madiha
    Butt, Faran Awais
    Malik, Ahmed
    2013 INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (TAEECE), 2013, : 204 - 207
  • [40] A Survey: Speech Recognition Approaches and Techniques
    Singh, Atma Prakash
    Nath, Ravindra
    Kumar, Santosh
    2018 5TH IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (UPCON), 2018, : 563 - 566