Using Approximate Entropy as a Speech Quality Measure for a Speaker Recognition System

被引:0
|
作者
Metzger, Richard A. [1 ]
Doherty, John F. [1 ]
Jenkins, David M. [2 ]
机构
[1] Penn State Univ, Dept Elect Engn, University Pk, PA 16802 USA
[2] Appl Res Lab, University Pk, PA 16802 USA
关键词
Approximate Entropy; Speaker Recognition; Voice Activity Detection; Speech Activity Detection; VOICE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we will show that Approximate Entropy (ApEn) can be used to detect high-quality speech frames in an otherwise distorted speech signal. By exploiting the property of quasi-periodicity in speech, ApEn is able to detect small aberrations in speech frames that would otherwise cause a decrease in the performance in an automatic speaker recognition (ASR) system. In addition, we obtain the statistics of ApEn values representative of clean speech and propose threshold bounds to obtain maximum recognition rates. When compared to other popular voice activity detector (VAD) algorithms, our simulation results showed that utilization of ApEn will outperform the other VADs in discerning clean speech from noisy speech. This ability to properly detect clean speech allows for a speaker recognition system to obtain a recognition rate close to 87%, which is close to the same performance of the system when noise is not present.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] SIMILARITY MEASURE FOR AUTOMATIC SPEECH AND SPEAKER RECOGNITION
    SCHROEDER, MR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 43 (02): : 375 - +
  • [2] Approximate Entropy and Empirical Mode Decomposition for Improved Speaker Recognition
    Metzger, Richard A.
    Doherty, John F.
    Jenkins, David M.
    Hall, Donald L.
    [J]. ADVANCES IN DATA SCIENCE AND ADAPTIVE ANALYSIS, 2020, 12 (3-4)
  • [3] Speech Recognition Using Speaker Adaptation by System Parameter Transformation
    Hao, Ying
    Fang, Ditang
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 63 - 68
  • [4] Digital Speech Watermarking for Authenticity of Speaker in Speaker Recognition System
    Desai, Nihalkumar
    Tahilramani, Nikunj
    [J]. 2016 INTERNATIONAL CONFERENCE ON MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING (ICMETE), 2016, : 105 - 109
  • [5] APPROXIMATE ENTROPY AS A MEASURE OF SYSTEM-COMPLEXITY
    PINCUS, SM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (06) : 2297 - 2301
  • [6] SPEAKER ADAPTATION IN A LIMITED SPEECH RECOGNITION SYSTEM
    MAKHOUL, J
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1971, C 20 (09) : 1057 - &
  • [7] A speaker-independent continuous speech recognition system using biomimetic pattern recognition
    Wang Shoujue
    Qin Hong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (03) : 460 - 462
  • [8] Facial Expression Recognition for Speaker Using Thermal Image Processing and Speech Recognition System
    Yoshitomi, Yasunari
    [J]. SELECTED TOPICS IN APPLIED COMPUTER SCIENCE, 2010, : 182 - +
  • [9] Continuous Speech Recognition and Identification of the Speaker System
    Guffanti, Diego
    Martinez, Danilo
    Paladines, Jose
    Sarmiento, Andrea
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY & SYSTEMS (ICITS 2018), 2018, 721 : 767 - 776
  • [10] From Speech Quality Measures to Speaker Recognition Performance
    Bello, Claudia
    Ribas, Dayana
    Calvo, Jose R.
    Ferrer, Carlos A.
    [J]. PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 199 - 206