Stacked causal convolutional autoencoder based speech compression method

被引:0
|
作者
Bekiryazici, Tahir [1 ]
Aydemir, Gurkan [1 ]
Gurkan, Hakan [1 ]
机构
[1] Bursa Tekn Univ, Elekt Elekt Muhendisligi Bolumu, Bursa, Turkiye
关键词
Speech compression; residual vector quantization; convolutional autoencoder; deep learning;
D O I
10.1109/SIU61531.2024.10600779
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study proposes a speech compression method based on one-dimensional convolutional autoencoder and residual vector quantization. The proposed method offers different compression ratios at low bit rates. Speech quality evaluation metric (PESQ) was used to test the performance of the proposed method. Experimental results show that the proposed method achieves a PESQ value of 1.903 for 2.5 kbps and 2.24 for 5 kbps.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder
    Li, You-Jin
    Wang, Syu-Siang
    Tsao, Yu
    Su, Borching
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1245 - 1250
  • [2] ECG Compression method based on convolutional autoencoder and discrete wavelet transform
    Bekiryazici, Tahir
    Gurkan, Hakan
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [3] Face Recognition Based on Stacked Convolutional Autoencoder and Sparse Representation
    Chang, Liping
    Yang, Jianjun
    Li, Sheng
    Xu, Hong
    Liu, Kai
    Huang, Chaogeng
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [4] Loop closure detection for visual SLAM based on stacked convolutional autoencoder
    Zhang Y.-Z.
    Hu H.
    Qin C.
    Chu H.
    Wu Y.-X.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (05): : 981 - 988
  • [5] HRRP target recognition method based on one-dimensional stacked pooling fusion convolutional autoencoder
    Zhang, Guoling
    Wu, Chongming
    Li, Rui
    Lai, Jie
    Xiang, Qian
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (12): : 3533 - 3541
  • [6] Intelligent fault diagnosis method of rolling bearing based on stacked denoising autoencoder and convolutional neural network
    Che Changchang
    Wang Huawei
    Ni Xiaomei
    Fu Qiang
    INDUSTRIAL LUBRICATION AND TRIBOLOGY, 2020, 72 (07) : 947 - 953
  • [7] Impact of Different Compression Rates for Hyperspectral Data Compression Based on a Convolutional Autoencoder
    Kuester, Jannick
    Gross, Wolfgang
    Heizmann, Michael M.
    Middelmann, Wolfgang
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVII, 2021, 11862
  • [8] Denoising Convolutional Autoencoder Based Approach for Disordered Speech Recognition
    Chandrakala, S.
    Vishnika, Veni S.
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2024, 33 (01)
  • [9] Deep Convolutional AutoEncoder-based Lossy Image Compression
    Cheng, Zhengxue
    Sun, Heming
    Takeuchi, Masaru
    Katto, Jiro
    2018 PICTURE CODING SYMPOSIUM (PCS 2018), 2018, : 253 - 257
  • [10] HDR Image Compression with Convolutional Autoencoder
    Han, Fei
    Wang, Jin
    Xiong, Ruiqin
    Zhu, Qing
    Yin, Baocai
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 25 - 28