A scalable speech coding scheme using compressive sensing and orthogonal mapping based vector quantization

被引:6
|
作者
Sankar, M. S. Arun [1 ]
Sathidevi, P. S. [1 ]
机构
[1] Natl Inst Technol Calicut, Dept Elect & Commun Engn, Kozhikode, Kerala, India
关键词
Electrical engineering; Speech processing; Wavelet; Speech coding; CELP; Compressive sensing; Speech compression;
D O I
10.1016/j.heliyon.2019.e01820
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A novel scalable speech coding scheme based on Compressive Sensing (CS), which can operate at bit rates from 3.275 to 7.275 kbps is designed and implemented in this paper. The CS based speech coding offers the benefit of combined compression and encryption with inherent de-noising and bit rate scalability. The non-stationary nature of speech signal causes the recovery process from CS measurements very complex due to the variation in sparsifying bases. In this work, the complexity of the recovery process is reduced by assigning a suitable basis to each frame of the speech signal based on its statistical properties. As the quality of the reconstructed speech depends on the sensing matrix used at the transmitter, a variant of Binary Permuted Block Diagonal (BPBD) matrix is also proposed here which offers a better performance than that of the commonly used Gaussian random matrix. To improve the coding efficiency, formant filter coefficients are quantized using the conventional Vector Quantization (VQ) and an orthogonal mapping based VQ is developed for the quantization of CS measurements. The proposed coding scheme offers the listening quality for reconstructed speech similar to that of Adaptive Multi rate - Narrowband (AMR-NB) codec at 6.7 kbps and Enhanced Voice Services (EVS) at 7.2 kbps. A separate de-noising block is not required in the proposed coding scheme due to the inherent de-noising property of CS. Scalability in bit rate is achieved in the proposed method by varying the number of random measurements and the number of levels for orthogonal mapping in the VQ stage of measurements.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Compressive Sensing Based Scalable Speech Coder with Dynamic Selection of Basis and Vector Quantization
    Sankar, M. S. Arun
    Sathidevi, P. S.
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 1053 - 1058
  • [2] Subdata image encryption scheme based on compressive sensing and vector quantization
    Fan, Haiju
    Zhou, Kanglei
    Zhang, En
    Wen, Wenying
    Li, Ming
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16): : 12771 - 12787
  • [3] Subdata image encryption scheme based on compressive sensing and vector quantization
    Haiju Fan
    Kanglei Zhou
    En Zhang
    Wenying Wen
    Ming Li
    [J]. Neural Computing and Applications, 2020, 32 : 12771 - 12787
  • [4] Scalable Low bit rate CELP Coder based on Compressive Sensing and Vector Quantization
    Sankar, Arun M. S.
    Sathidevi, P. S.
    [J]. 2016 IEEE ANNUAL INDIA CONFERENCE (INDICON), 2016,
  • [5] Scalable Video Coding Using Compressive Sensing
    Jiang, Hong
    Li, Chengbo
    Haimi-Cohen, Raziel
    Wilford, Paul A.
    Zhang, Yin
    [J]. BELL LABS TECHNICAL JOURNAL, 2012, 16 (04) : 149 - 169
  • [6] EFFICIENT LOSSLESS CODING SCHEME FOR VECTOR QUANTIZATION USING DYNAMIC INDEX MAPPING
    LEE, SJ
    YANG, KH
    KIM, CW
    LEE, CW
    [J]. ELECTRONICS LETTERS, 1995, 31 (17) : 1426 - 1427
  • [7] SPEECH CODING BASED UPON VECTOR QUANTIZATION
    BUZO, A
    GRAY, AH
    GRAY, RM
    MARKEL, JD
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (05): : 562 - 574
  • [8] Wavelet scalable speech coding using algebraic quantization
    De Meuleneire, M.
    Taddei, H.
    Pastor, D.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4753 - +
  • [9] Cross-Scale Vector Quantization for Scalable Neural Speech Coding
    Jiang, Xue
    Peng, Xiulian
    Xue, Huaying
    Zhang, Yuan
    Lu, Yan
    [J]. INTERSPEECH 2022, 2022, : 4222 - 4226
  • [10] VECTOR QUANTIZATION IN SPEECH CODING
    MAKHOUL, J
    ROUCOS, S
    GISH, H
    [J]. PROCEEDINGS OF THE IEEE, 1985, 73 (11) : 1551 - 1588