Analysis of Compressed Speech Signals in an Automatic Speaker Recognition System

被引:0
|
作者
Metzger, Richard A. [1 ]
Doherty, John F. [1 ]
Jenkins, David M. [2 ]
机构
[1] Penn State Univ, Dept Elect Engn, University Pk, PA 16802 USA
[2] Appl Res Lab, University Pk, PA USA
关键词
Speaker Recognition; Gaussian Mixture Models; Mel-Frequency Cepstrum Coefficients; Audio Compression;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper analyzes the effects popular audio compression algorithms have on the performance of a speaker recognition system. Popular audio compression algorithms were used to compress both clean and noisy speech before being passed to a speaker recognition system. The features extracted from each speaker were 19-dimensional Mel-Frequency Cepstrum Coefficients (MFCC) and the corresponding features were modeled using a 16 mixture Gaussian Mixture Model (GMM). Our experiments show that compression will have a negative effect on recognition rates if the compressed speech is clean. However, if small amounts of white Gaussian noise are added before the speech is compressed, recognition rates can be increased by as much as 7% with certain compression algorithms.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] SPEECH RECOGNITION SYSTEM WITH AUTOMATIC SPEAKER-ADAPTION
    BROUWER, P
    FREQUENZ, 1978, 32 (07) : 204 - 207
  • [2] Improving Robustness to Compressed Speech in Speaker Recognition
    McLaren, Mitchell
    Abrash, Victor
    Graciarena, Martin
    Lei, Yun
    Pesan, Jan
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3665 - 3669
  • [3] ADAPTING TO THE SPEAKER IN AUTOMATIC SPEECH RECOGNITION
    TALBOT, M
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1987, 27 (04): : 449 - 457
  • [4] Performance analysis of compressed-domain automatic speaker recognition as a function of speech coding technique and bit rate
    Petracca, M.
    Servetti, A.
    De Martin, J. C.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1393 - +
  • [5] Evaluation of a forensic automatic speaker recognition system with emotional speech recordings
    Essery, Robert
    Harrison, Philip
    Hughes, Vincent
    INTERSPEECH 2023, 2023, : 2568 - 2572
  • [6] An automatic speech recognition system with speaker-independent identification support
    Caranica, Alexandru
    Burileanu, Corneliu
    ADVANCED TOPICS IN OPTOELECTRONICS, MICROELECTRONICS, AND NANOTECHNOLOGIES VII, 2015, 9258
  • [7] Minimum of Information Divergence Criterion for Signals with Tuning to Speaker Voice in Automatic Speech Recognition
    Savchenko V.V.
    Radioelectronics and Communications Systems, 2020, 63 (01) : 42 - 54
  • [8] Automatic speaker recognition with crosslanguage speech material
    Kuenzel, Hermann J.
    INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2013, 20 (01) : 21 - 44
  • [9] SIMILARITY MEASURE FOR AUTOMATIC SPEECH AND SPEAKER RECOGNITION
    SCHROEDER, MR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 43 (02): : 375 - +
  • [10] AN AUTOMATIC SPEAKER RECOGNITION SYSTEM
    Akrouf, Samir
    Mehamel, Abbas
    Benhamouda, Nacera
    Mostefai, Messaoud
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2, 2009, : 719 - 727