A real-time text-independent speaker identification system

被引：9

作者：

Cordella, LP ^{[1
]}

Foggia, P ^{[1
]}

Sansone, C ^{[1
]}

Vento, M ^{[1
]}

机构：

[1] Univ Naples Federico II, Dipartimento Informat & Sistemist, I-80125 Naples, Italy

来源：

12TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS | 2003年

关键词：

D O I：

10.1109/ICIAP.2003.1234121

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper presents a real-time speaker identification system based on the analysis of the audio track of a video stream. The system has been employed in the context of automatic video segmentation. It uses features evaluated in both domains of time and frequency. Their combined use significantly improved the performance of the system. Experiments have been carried on a database extracted from over one hour of television news, including 10 speakers. The obtained results confirm the effectiveness of the approach, showing an error rate less then 1% when the time interval used for identifying a speaker is about 1.5 seconds.

引用

页码：632 / 637

页数：6

共 50 条

[31] Combining Dynamic Features with MFCC for Text-independent Speaker Identification
Chaudhari, Amol
Rahulkar, Amol
Dhonde, S. B.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 160 - 164
[32] Text-independent speaker identification based on support vector machines
He, Xin
Liu, Chongqing
Li, Jiegu
[J]. Jisuanji Gongcheng/Computer Engineering, 2000, 26 (06): : 61 - 63
[33] Text-independent speaker identification using robust statistics estimation
El Ayadi, Moataz
Hassan, Abdel-Karim S. O.
Abdel-Naby, Ahmed
Elgendy, Omar A.
[J]. SPEECH COMMUNICATION, 2017, 92 : 52 - 63
[34] Wavelet entropy and neural network for text-independent speaker identification
Daqrouq, Khaled
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (05) : 796 - 802
[35] Application of time-frequency principal component analysis to text-independent speaker identification
Magrin-Chagnolleau, I
Durou, G
Bimbot, F
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (06): : 371 - 378
[36] HCRF-UBM approach for text-independent speaker identification
Hong, Wei-Tyng
[J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2012, 64 (05) : 1120 - 1127
[37] Text-Independent Speaker Identification by Combining MFCC and MVA Features
Korba, Mohamed Cherif Amara
Bourouba, Houcine
Rafik, Djemili
[J]. 2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
[38] A two-level classifier for text-independent speaker identification
Hadjitodorov, S
Boyanov, B
Dalakchieva, N
[J]. SPEECH COMMUNICATION, 1997, 21 (03) : 209 - 217
[39] Robust text-independent speaker identification over telephone channels
Murthy, HA
Beaufays, F
Heck, LP
Weintraub, M
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 554 - 568
[40] Text-independent speaker identification based on spectral weighting functions
Ma, JY
Gao, W
[J]. AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 267 - 272

← 1 2 3 4 5 →