Text-available speaker recognition system for forensic applications

被引:3
|
作者
Yu, Chengzhu [1 ]
Zhang, Chunlei [1 ]
Kelly, Finnian [1 ]
Sangwan, Abhijeet [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, CRSS, Richardson, TX 75083 USA
基金
美国国家科学基金会;
关键词
speaker recognition; forensic speaker recognition; VERIFICATION; HMM;
D O I
10.21437/Interspeech.2016-1520
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper examines a text-available speaker recognition approach targeting scenarios where the transcripts of test utterances are either available or obtainable through manual transcription. Forensic speaker recognition is one of such applications where the human supervision can be expected. In our study, we extend an existing Deep Neural Network (DNN) vector-based speaker recognition system to effectively incorporate text information associated with test utterances. We first show experimentally that speaker recognition performance drops significantly if the DNN output posteriors are directly replaced with their target senone, obtained from force alignment. The cause of such performance drops can be attributed to the fact that forced alignment selects only the single most probable senone as their output, which is not desirable in a current speaker recognition framework. To resolve this problem, we propose a posterior mapping approach where the relationship between forced aligned senonoes and its corresponding DNN posteriors are modeled. By replacing DNN output posteriors with senone mapped posteriors, a robust text-available speaker recognition system can be obtained in mismatched environments. Experiments using the proposed approach are performed on the Aurora-4 dataset.
引用
收藏
页码:1844 / 1847
页数:4
相关论文
共 50 条
  • [1] Speaker recognition in forensic applications
    Majewski, W
    [J]. ACUSTICA, 1996, 82 : S230 - S230
  • [2] Automatic Speaker Recognition for Mobile Forensic Applications
    Algabri, Mohammed
    Mathkour, Hassan
    Bencherif, Mohamed A.
    Alsulaiman, Mansour
    Mekhtiche, Mohamed A.
    [J]. MOBILE INFORMATION SYSTEMS, 2017, 2017
  • [3] From Speaker Recognition to Forensic Speaker Recognition
    Drygajlo, Andrzej
    [J]. BIOMETRIC AUTHENTICATION (BIOMET 2014), 2014, 8897 : 93 - 104
  • [4] Design of a Text Independent Speaker Recognition System
    Ozaydin, Selma
    [J]. 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2017, : 55 - 59
  • [5] A Quality-Aware Forensic Speaker Recognition System
    Pop, Gheorghe
    Draghicescu, Dragos
    Burileanu, Dragos
    [J]. ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2014, 17 (02): : 134 - 149
  • [6] Special issue on speaker recognition and its commercial and forensic applications
    André-Obrecht, R
    [J]. SPEECH COMMUNICATION, 2000, 31 (2-3) : 87 - 88
  • [7] Speaker Recognition System for Security Applications
    Selvan, Karthik
    Joseph, Aju
    Babu, Anish K. K.
    [J]. 2013 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2013, : 26 - 30
  • [8] Forensic automatic speaker recognition
    Drygajlo, Andrzej
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2007, 24 (02) : 132 - 135
  • [9] Text Independent Automatic Speaker Recognition System in Malayalam
    Selvan, Karthik
    Babu, Anish K. K.
    [J]. PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
  • [10] A Text-dependent Speaker-Recognition System
    Ishac, Dany
    Abche, Antoine
    Karam, Elie
    Nassar, Georges
    Callens, Dorothee
    [J]. 2017 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2017, : 147 - 152