Spectral entropy and spectral shape based pre-quantization for real time speaker identification system

被引:2
|
作者
Sarkar G. [1 ]
Saha G. [1 ]
机构
[1] Department of Electronics and Electrical Communication Engineering, IIT Kharagpur
关键词
Kurtosis; Pre-quantization; Speaker identification; Spectral entropy;
D O I
10.1007/s10772-010-9079-8
中图分类号
学科分类号
摘要
Pre-processing is one of the vital steps for developing robust and efficient recognition system. Better preprocessing not only aid in better data selection but also in significant reduction of computational complexity. Further an efficient frame selection technique can improve the overall performance of the system. Pre-quantization (PQ) is the technique of selecting less number of frames in the pre-processing stage to reduce the computational burden in the post processing stages of speaker identification (SI). In this paper, we develop PQ techniques based on spectral entropy and spectral shape to pick suitable frames containing speaker specific information that varies from frame to frame depending on spoken text and environmental conditions. The attempt is to exploit the statistical properties of distributions of speech frames at the pre-processing stage of speaker recognition. Our aim is not only to reduce the frame rate but also to maintain identification accuracy reasonably high. Further we have also analyzed the robustness of our proposed techniques on noisy utterances. To establish the efficacy of our proposed methods, we used two different databases, POLYCOST (telephone speech) and YOHO (microphone speech). © Springer Science+Business Media, LLC 2010.
引用
收藏
页码:189 / 199
页数:10
相关论文
共 50 条
  • [1] Efficient Pre-Quantization Techniques Based on Probability Density for Speaker Recognition System
    Sarkar, Gourav
    Saha, Goutam
    TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 53 - +
  • [2] Speaker Identification through Spectral Entropy Analysis
    Camarena-Ibarrola, Antonio
    Luque, Fernando
    Chavez, Edgar
    2017 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2017,
  • [3] Efficient speaker identification using spectral entropy
    Luque-Suarez, Fernando
    Camarena-Ibarrola, Antonio
    Chavez, Edgar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 16803 - 16815
  • [4] Efficient speaker identification using spectral entropy
    Fernando Luque-Suárez
    Antonio Camarena-Ibarrola
    Edgar Chávez
    Multimedia Tools and Applications, 2019, 78 : 16803 - 16815
  • [5] Forensic speaker identification based on spectral moments
    Rodman, R
    McAllister, D
    Bitzer, D
    Cepeda, L
    Abbitt, P
    FORENSIC LINGUISTICS-THE INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2002, 9 (01): : 22 - 43
  • [6] Spectral-Subtraction Based Features for Speaker Identification
    Chandra, Mahesh
    Nandi, Pratibha
    Kumari, Aparajita
    Mishra, Shipra
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 2, 2015, 328 : 529 - 536
  • [7] Real-time speaker identification system
    Al-Shboul, Bashar
    Alsawalqah, Hamad
    Lee, Dongman
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE: COMPUTER SCIENCE CHALLENGES, 2007, : 422 - +
  • [8] Spectral Restoration Based Speech Enhancement for Robust Speaker Identification
    Saleem, Nasir
    Tareen, Tayyaba Gul
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2018, 5 (01): : 34 - 39
  • [9] Identification of Network Topology Variations Based on Spectral Entropy
    Su, Housheng
    Chen, Dan
    Pan, Gui-Jun
    Zeng, Zhigang
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) : 10468 - 10478
  • [10] Explosive identification based on terahertz time-domain spectral system
    Xie Q.
    Yang H.-R.
    Li H.-G.
    Han Z.-S.
    Sun Y.-N.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2016, 24 (10): : 2392 - 2399