A scalable speech coding scheme using compressive sensing and orthogonal mapping based vector quantization

被引：6

作者：

Sankar, M. S. Arun ^{[1
]}

Sathidevi, P. S. ^{[1
]}

机构：

[1] Natl Inst Technol Calicut, Dept Elect & Commun Engn, Kozhikode, Kerala, India

来源：

HELIYON | 2019年 / 5卷 / 05期

关键词：

Electrical engineering; Speech processing; Wavelet; Speech coding; CELP; Compressive sensing; Speech compression;

D O I：

10.1016/j.heliyon.2019.e01820

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

A novel scalable speech coding scheme based on Compressive Sensing (CS), which can operate at bit rates from 3.275 to 7.275 kbps is designed and implemented in this paper. The CS based speech coding offers the benefit of combined compression and encryption with inherent de-noising and bit rate scalability. The non-stationary nature of speech signal causes the recovery process from CS measurements very complex due to the variation in sparsifying bases. In this work, the complexity of the recovery process is reduced by assigning a suitable basis to each frame of the speech signal based on its statistical properties. As the quality of the reconstructed speech depends on the sensing matrix used at the transmitter, a variant of Binary Permuted Block Diagonal (BPBD) matrix is also proposed here which offers a better performance than that of the commonly used Gaussian random matrix. To improve the coding efficiency, formant filter coefficients are quantized using the conventional Vector Quantization (VQ) and an orthogonal mapping based VQ is developed for the quantization of CS measurements. The proposed coding scheme offers the listening quality for reconstructed speech similar to that of Adaptive Multi rate - Narrowband (AMR-NB) codec at 6.7 kbps and Enhanced Voice Services (EVS) at 7.2 kbps. A separate de-noising block is not required in the proposed coding scheme due to the inherent de-noising property of CS. Scalability in bit rate is achieved in the proposed method by varying the number of random measurements and the number of levels for orthogonal mapping in the VQ stage of measurements.

引用

页数：13

共 50 条

[1] Compressive Sensing Based Scalable Speech Coder with Dynamic Selection of Basis and Vector Quantization
Sankar, M. S. Arun
Sathidevi, P. S.
[J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 1053 - 1058
[2] Subdata image encryption scheme based on compressive sensing and vector quantization
Fan, Haiju
Zhou, Kanglei
Zhang, En
Wen, Wenying
Li, Ming
[J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16): : 12771 - 12787
[3] Subdata image encryption scheme based on compressive sensing and vector quantization
Haiju Fan
Kanglei Zhou
En Zhang
Wenying Wen
Ming Li
[J]. Neural Computing and Applications, 2020, 32 : 12771 - 12787
[4] Scalable Low bit rate CELP Coder based on Compressive Sensing and Vector Quantization
Sankar, Arun M. S.
Sathidevi, P. S.
[J]. 2016 IEEE ANNUAL INDIA CONFERENCE (INDICON), 2016,
[5] Scalable Video Coding Using Compressive Sensing
Jiang, Hong
Li, Chengbo
Haimi-Cohen, Raziel
Wilford, Paul A.
Zhang, Yin
[J]. BELL LABS TECHNICAL JOURNAL, 2012, 16 (04) : 149 - 169
[6] EFFICIENT LOSSLESS CODING SCHEME FOR VECTOR QUANTIZATION USING DYNAMIC INDEX MAPPING
LEE, SJ
YANG, KH
KIM, CW
LEE, CW
[J]. ELECTRONICS LETTERS, 1995, 31 (17) : 1426 - 1427
[7] SPEECH CODING BASED UPON VECTOR QUANTIZATION
BUZO, A
GRAY, AH
GRAY, RM
MARKEL, JD
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (05): : 562 - 574
[8] Wavelet scalable speech coding using algebraic quantization
De Meuleneire, M.
Taddei, H.
Pastor, D.
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4753 - +
[9] Cross-Scale Vector Quantization for Scalable Neural Speech Coding
Jiang, Xue
Peng, Xiulian
Xue, Huaying
Zhang, Yuan
Lu, Yan
[J]. INTERSPEECH 2022, 2022, : 4222 - 4226
[10] VECTOR QUANTIZATION IN SPEECH CODING
MAKHOUL, J
ROUCOS, S
GISH, H
[J]. PROCEEDINGS OF THE IEEE, 1985, 73 (11) : 1551 - 1588

← 1 2 3 4 5 →