A real-time text-independent speaker identification system

被引:9
|
作者
Cordella, LP [1 ]
Foggia, P [1 ]
Sansone, C [1 ]
Vento, M [1 ]
机构
[1] Univ Naples Federico II, Dipartimento Informat & Sistemist, I-80125 Naples, Italy
关键词
D O I
10.1109/ICIAP.2003.1234121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper presents a real-time speaker identification system based on the analysis of the audio track of a video stream. The system has been employed in the context of automatic video segmentation. It uses features evaluated in both domains of time and frequency. Their combined use significantly improved the performance of the system. Experiments have been carried on a database extracted from over one hour of television news, including 10 speakers. The obtained results confirm the effectiveness of the approach, showing an error rate less then 1% when the time interval used for identifying a speaker is about 1.5 seconds.
引用
收藏
页码:632 / 637
页数:6
相关论文
共 50 条
  • [31] Combining Dynamic Features with MFCC for Text-independent Speaker Identification
    Chaudhari, Amol
    Rahulkar, Amol
    Dhonde, S. B.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 160 - 164
  • [32] Text-independent speaker identification based on support vector machines
    He, Xin
    Liu, Chongqing
    Li, Jiegu
    [J]. Jisuanji Gongcheng/Computer Engineering, 2000, 26 (06): : 61 - 63
  • [33] Text-independent speaker identification using robust statistics estimation
    El Ayadi, Moataz
    Hassan, Abdel-Karim S. O.
    Abdel-Naby, Ahmed
    Elgendy, Omar A.
    [J]. SPEECH COMMUNICATION, 2017, 92 : 52 - 63
  • [34] Wavelet entropy and neural network for text-independent speaker identification
    Daqrouq, Khaled
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (05) : 796 - 802
  • [35] Application of time-frequency principal component analysis to text-independent speaker identification
    Magrin-Chagnolleau, I
    Durou, G
    Bimbot, F
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (06): : 371 - 378
  • [36] HCRF-UBM approach for text-independent speaker identification
    Hong, Wei-Tyng
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2012, 64 (05) : 1120 - 1127
  • [37] Text-Independent Speaker Identification by Combining MFCC and MVA Features
    Korba, Mohamed Cherif Amara
    Bourouba, Houcine
    Rafik, Djemili
    [J]. 2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
  • [38] A two-level classifier for text-independent speaker identification
    Hadjitodorov, S
    Boyanov, B
    Dalakchieva, N
    [J]. SPEECH COMMUNICATION, 1997, 21 (03) : 209 - 217
  • [39] Robust text-independent speaker identification over telephone channels
    Murthy, HA
    Beaufays, F
    Heck, LP
    Weintraub, M
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 554 - 568
  • [40] Text-independent speaker identification based on spectral weighting functions
    Ma, JY
    Gao, W
    [J]. AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 267 - 272