Analysis of Compressed Speech Signals in an Automatic Speaker Recognition System

被引:0
|
作者
Metzger, Richard A. [1 ]
Doherty, John F. [1 ]
Jenkins, David M. [2 ]
机构
[1] Penn State Univ, Dept Elect Engn, University Pk, PA 16802 USA
[2] Appl Res Lab, University Pk, PA USA
关键词
Speaker Recognition; Gaussian Mixture Models; Mel-Frequency Cepstrum Coefficients; Audio Compression;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper analyzes the effects popular audio compression algorithms have on the performance of a speaker recognition system. Popular audio compression algorithms were used to compress both clean and noisy speech before being passed to a speaker recognition system. The features extracted from each speaker were 19-dimensional Mel-Frequency Cepstrum Coefficients (MFCC) and the corresponding features were modeled using a 16 mixture Gaussian Mixture Model (GMM). Our experiments show that compression will have a negative effect on recognition rates if the compressed speech is clean. However, if small amounts of white Gaussian noise are added before the speech is compressed, recognition rates can be increased by as much as 7% with certain compression algorithms.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] SPARSIFICATION VIA COMPRESSED SENSING FOR AUTOMATIC SPEECH RECOGNITION
    Zhen, Kai
    Hieu Duy Nguyen
    Chang, Feng-Ju
    Mouchtaris, Athanasios
    Rastrow, Ariya
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6009 - 6013
  • [32] Holonic multi-agent system model for fuzzy automatic speech/speaker recognition
    Valencia-Jimenez, J. J.
    Fernandez-Caballero, Antonio
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PROCEEDINGS, 2008, 4953 : 73 - 82
  • [33] An Automatic Real Time Speech-Speaker Recognition System: A Real Time Approach
    Kakade, Mandar Nitin
    Salunke, D. B.
    ICCCE 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND CYBER-PHYSICAL ENGINEERING, 2020, 570 : 151 - 158
  • [34] Speaker-Independent Automatic Speech Recognition System for Mobile Phone Applications in Punjabi
    Mittal, Puneet
    Singh, Navdeep
    ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2018, 678 : 369 - 382
  • [35] Digital Speech Watermarking for Authenticity of Speaker in Speaker Recognition System
    Desai, Nihalkumar
    Tahilramani, Nikunj
    2016 INTERNATIONAL CONFERENCE ON MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING (ICMETE), 2016, : 105 - 109
  • [36] ON THE PATH TO THE AUTOMATIC RECOGNITION OF ACOUSTIC SPEECH SIGNALS
    UNTERBERGER
    ANGEWANDTE INFORMATIK, 1982, (09): : 445 - 450
  • [37] Improved automatic speech recognition system using sparse decomposition by basis pursuit with deep rectifier neural networks and compressed sensing recomposition of speech signals
    Gavrilescu, Mihai
    2014 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2014,
  • [38] Automatic Speech Recognition System Based on Wavelet Analysis
    Ziolko, Mariusz
    Galka, Jakub
    Ziolko, Bartosz
    Jadczyk, Tomasz
    Skurzok, Dawid
    Wicijowski, Jan
    2010 IEEE FOURTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2010), 2010, : 450 - 451
  • [39] AUTOMATIC SPEECH RECOGNITION SYSTEM
    RUSKE, G
    UMSCHAU IN WISSENSCHAFT UND TECHNIK, 1979, 79 (18) : 566 - 572
  • [40] Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition
    Sivasankaran, Sunit
    Vincent, Emmanuel
    Fohr, Dominique
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 346 - 350