Efficient speaker identification using spectral entropy

被引:0
|
作者
Fernando Luque-Suárez
Antonio Camarena-Ibarrola
Edgar Chávez
机构
[1] CICESE,
[2] Universidad Michoacana,undefined
来源
关键词
Speaker recognition; Speaker identification; Entropygrams;
D O I
暂无
中图分类号
学科分类号
摘要
In voice recognition, the two main problems are speech recognition (what was said), and speaker recognition (who was speaking). The usual method for speaker recognition is to postulate a model where the speaker identity corresponds to the parameters of the model, which estimation could be time-consuming when the number of candidate speakers is large. In this paper, we model the speaker as a high dimensional point cloud of entropy-based features, extracted from the speech signal. The method allows indexing, and hence it can manage large databases. We experimentally assessed the quality of the identification with a publicly available database formed by extracting audio from a collection of YouTube videos of 1,000 different speakers. With 20 second audio excerpts, we were able to identify a speaker with 97% accuracy when the recording environment is not controlled, and with 99% accuracy for controlled recording environments.
引用
收藏
页码:16803 / 16815
页数:12
相关论文
共 50 条
  • [1] Efficient speaker identification using spectral entropy
    Luque-Suarez, Fernando
    Camarena-Ibarrola, Antonio
    Chavez, Edgar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 16803 - 16815
  • [2] Speaker Identification through Spectral Entropy Analysis
    Camarena-Ibarrola, Antonio
    Luque, Fernando
    Chavez, Edgar
    2017 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2017,
  • [3] EFFICIENT SPEAKER IDENTIFICATION USING DISTRIBUTIONAL SPEAKER MODEL CLUSTERING
    Apsingekar, Vijendra Raj
    De Leon, Phillip L.
    2008 42ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-4, 2008, : 1260 - 1264
  • [4] Efficient speaker recognition using approximated cross entropy (ACE)
    Aronowitz, Hagai
    Burshtein, David
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2033 - 2043
  • [5] Spectral entropy and spectral shape based pre-quantization for real time speaker identification system
    Sarkar G.
    Saha G.
    International Journal of Speech Technology, 2010, 13 (04) : 189 - 199
  • [6] Speaker Identification using Wavelet Shannon Entropy and Probabilistic Neural Network
    Lei, Lei
    She, Kun
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 566 - 571
  • [7] An expert system for speaker identification using adaptive wavelet sure entropy
    Avci, Derya
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 6295 - 6300
  • [8] Efficient Window for Monolingual and Crosslingual Speaker Identification using MFCC
    Nagaraja, B. G.
    Jayanna, H. S.
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
  • [9] A modified speaker clustering method for efficient speaker identification
    Yan, JiaChang
    Wang, Lei
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
  • [10] An application of fuzzy entropy clustering in speaker identification
    Tran, D
    Wagner, M
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 215 - 218