Analysis of Compressed Speech Signals in an Automatic Speaker Recognition System

被引：0

作者：

Metzger, Richard A. ^{[1
]}

Doherty, John F. ^{[1
]}

Jenkins, David M. ^{[2
]}

机构：

[1] Penn State Univ, Dept Elect Engn, University Pk, PA 16802 USA

[2] Appl Res Lab, University Pk, PA USA

来源：

2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS) | 2015年

关键词：

Speaker Recognition; Gaussian Mixture Models; Mel-Frequency Cepstrum Coefficients; Audio Compression;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper analyzes the effects popular audio compression algorithms have on the performance of a speaker recognition system. Popular audio compression algorithms were used to compress both clean and noisy speech before being passed to a speaker recognition system. The features extracted from each speaker were 19-dimensional Mel-Frequency Cepstrum Coefficients (MFCC) and the corresponding features were modeled using a 16 mixture Gaussian Mixture Model (GMM). Our experiments show that compression will have a negative effect on recognition rates if the compressed speech is clean. However, if small amounts of white Gaussian noise are added before the speech is compressed, recognition rates can be increased by as much as 7% with certain compression algorithms.

引用

页数：5

共 50 条

[31] SPARSIFICATION VIA COMPRESSED SENSING FOR AUTOMATIC SPEECH RECOGNITION
Zhen, Kai
Hieu Duy Nguyen
Chang, Feng-Ju
Mouchtaris, Athanasios
Rastrow, Ariya
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6009 - 6013
[32] Holonic multi-agent system model for fuzzy automatic speech/speaker recognition
Valencia-Jimenez, J. J.
Fernandez-Caballero, Antonio
AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PROCEEDINGS, 2008, 4953 : 73 - 82
[33] An Automatic Real Time Speech-Speaker Recognition System: A Real Time Approach
Kakade, Mandar Nitin
Salunke, D. B.
ICCCE 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND CYBER-PHYSICAL ENGINEERING, 2020, 570 : 151 - 158
[34] Speaker-Independent Automatic Speech Recognition System for Mobile Phone Applications in Punjabi
Mittal, Puneet
Singh, Navdeep
ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2018, 678 : 369 - 382
[35] Digital Speech Watermarking for Authenticity of Speaker in Speaker Recognition System
Desai, Nihalkumar
Tahilramani, Nikunj
2016 INTERNATIONAL CONFERENCE ON MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING (ICMETE), 2016, : 105 - 109
[36] ON THE PATH TO THE AUTOMATIC RECOGNITION OF ACOUSTIC SPEECH SIGNALS
UNTERBERGER
ANGEWANDTE INFORMATIK, 1982, (09): : 445 - 450
[37] Improved automatic speech recognition system using sparse decomposition by basis pursuit with deep rectifier neural networks and compressed sensing recomposition of speech signals
Gavrilescu, Mihai
2014 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2014,
[38] Automatic Speech Recognition System Based on Wavelet Analysis
Ziolko, Mariusz
Galka, Jakub
Ziolko, Bartosz
Jadczyk, Tomasz
Skurzok, Dawid
Wicijowski, Jan
2010 IEEE FOURTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2010), 2010, : 450 - 451
[39] AUTOMATIC SPEECH RECOGNITION SYSTEM
RUSKE, G
UMSCHAU IN WISSENSCHAFT UND TECHNIK, 1979, 79 (18) : 566 - 572
[40] Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition
Sivasankaran, Sunit
Vincent, Emmanuel
Fohr, Dominique
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 346 - 350

← 1 2 3 4 5 →