Efficient speaker identification using spectral entropy

被引:7
|
作者
Luque-Suarez, Fernando [1 ]
Camarena-Ibarrola, Antonio [2 ]
Chavez, Edgar [1 ]
机构
[1] CICESE, Ensenada, Baja California, Mexico
[2] Univ Michoacana, Morelia, Michoacan, Mexico
关键词
Speaker recognition; Speaker identification; Entropygrams; RECOGNITION;
D O I
10.1007/s11042-018-7035-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In voice recognition, the two main problems are speech recognition (what was said), and speaker recognition (who was speaking). The usual method for speaker recognition is to postulate a model where the speaker identity corresponds to the parameters of the model, which estimation could be time-consuming when the number of candidate speakers is large. In this paper, we model the speaker as a high dimensional point cloud of entropy-based features, extracted from the speech signal. The method allows indexing, and hence it can manage large databases. We experimentally assessed the quality of the identification with a publicly available database formed by extracting audio from a collection of YouTube videos of 1,000 different speakers. With 20 second audio excerpts, we were able to identify a speaker with 97% accuracy when the recording environment is not controlled, and with 99% accuracy for controlled recording environments.
引用
收藏
页码:16803 / 16815
页数:13
相关论文
共 50 条
  • [1] Efficient speaker identification using spectral entropy
    Fernando Luque-Suárez
    Antonio Camarena-Ibarrola
    Edgar Chávez
    Multimedia Tools and Applications, 2019, 78 : 16803 - 16815
  • [2] Speaker Identification through Spectral Entropy Analysis
    Camarena-Ibarrola, Antonio
    Luque, Fernando
    Chavez, Edgar
    2017 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2017,
  • [3] EFFICIENT SPEAKER IDENTIFICATION USING DISTRIBUTIONAL SPEAKER MODEL CLUSTERING
    Apsingekar, Vijendra Raj
    De Leon, Phillip L.
    2008 42ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-4, 2008, : 1260 - 1264
  • [4] Efficient speaker recognition using approximated cross entropy (ACE)
    Aronowitz, Hagai
    Burshtein, David
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2033 - 2043
  • [5] Spectral entropy and spectral shape based pre-quantization for real time speaker identification system
    Sarkar G.
    Saha G.
    International Journal of Speech Technology, 2010, 13 (04) : 189 - 199
  • [6] Speaker Identification using Wavelet Shannon Entropy and Probabilistic Neural Network
    Lei, Lei
    She, Kun
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 566 - 571
  • [7] An expert system for speaker identification using adaptive wavelet sure entropy
    Avci, Derya
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 6295 - 6300
  • [8] Efficient Window for Monolingual and Crosslingual Speaker Identification using MFCC
    Nagaraja, B. G.
    Jayanna, H. S.
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
  • [9] A modified speaker clustering method for efficient speaker identification
    Yan, JiaChang
    Wang, Lei
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
  • [10] An application of fuzzy entropy clustering in speaker identification
    Tran, D
    Wagner, M
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 215 - 218