Text-available speaker recognition system for forensic applications

被引：3

作者：

Yu, Chengzhu ^{[1
]}

Zhang, Chunlei ^{[1
]}

Kelly, Finnian ^{[1
]}

Sangwan, Abhijeet ^{[1
]}

Hansen, John H. L. ^{[1
]}

机构：

[1] Univ Texas Dallas, CRSS, Richardson, TX 75083 USA

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

基金：

美国国家科学基金会;

关键词：

speaker recognition; forensic speaker recognition; VERIFICATION; HMM;

D O I：

10.21437/Interspeech.2016-1520

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper examines a text-available speaker recognition approach targeting scenarios where the transcripts of test utterances are either available or obtainable through manual transcription. Forensic speaker recognition is one of such applications where the human supervision can be expected. In our study, we extend an existing Deep Neural Network (DNN) vector-based speaker recognition system to effectively incorporate text information associated with test utterances. We first show experimentally that speaker recognition performance drops significantly if the DNN output posteriors are directly replaced with their target senone, obtained from force alignment. The cause of such performance drops can be attributed to the fact that forced alignment selects only the single most probable senone as their output, which is not desirable in a current speaker recognition framework. To resolve this problem, we propose a posterior mapping approach where the relationship between forced aligned senonoes and its corresponding DNN posteriors are modeled. By replacing DNN output posteriors with senone mapped posteriors, a robust text-available speaker recognition system can be obtained in mismatched environments. Experiments using the proposed approach are performed on the Aurora-4 dataset.

引用

页码：1844 / 1847

页数：4

共 50 条

[1] Speaker recognition in forensic applications
Majewski, W
[J]. ACUSTICA, 1996, 82 : S230 - S230
[2] Automatic Speaker Recognition for Mobile Forensic Applications
Algabri, Mohammed
Mathkour, Hassan
Bencherif, Mohamed A.
Alsulaiman, Mansour
Mekhtiche, Mohamed A.
[J]. MOBILE INFORMATION SYSTEMS, 2017, 2017
[3] From Speaker Recognition to Forensic Speaker Recognition
Drygajlo, Andrzej
[J]. BIOMETRIC AUTHENTICATION (BIOMET 2014), 2014, 8897 : 93 - 104
[4] Design of a Text Independent Speaker Recognition System
Ozaydin, Selma
[J]. 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2017, : 55 - 59
[5] A Quality-Aware Forensic Speaker Recognition System
Pop, Gheorghe
Draghicescu, Dragos
Burileanu, Dragos
[J]. ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2014, 17 (02): : 134 - 149
[6] Special issue on speaker recognition and its commercial and forensic applications
André-Obrecht, R
[J]. SPEECH COMMUNICATION, 2000, 31 (2-3) : 87 - 88
[7] Speaker Recognition System for Security Applications
Selvan, Karthik
Joseph, Aju
Babu, Anish K. K.
[J]. 2013 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2013, : 26 - 30
[8] Forensic automatic speaker recognition
Drygajlo, Andrzej
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2007, 24 (02) : 132 - 135
[9] Text Independent Automatic Speaker Recognition System in Malayalam
Selvan, Karthik
Babu, Anish K. K.
[J]. PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
[10] A Text-dependent Speaker-Recognition System
Ishac, Dany
Abche, Antoine
Karam, Elie
Nassar, Georges
Callens, Dorothee
[J]. 2017 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2017, : 147 - 152

← 1 2 3 4 5 →