Data strategies in forensic automatic speaker comparison

被引:1
|
作者
van der Vloed, David [1 ]
机构
[1] Netherlands Forens Inst, Laan Ypenburg 6, NL-2497 GB The Hague, Netherlands
关键词
Automatic speaker recognition; Forensic speaker comparison; Forensic voice comparison; Forensic casework; Representative data; RECOGNITION;
D O I
10.1016/j.forsciint.2023.111790
中图分类号
DF [法律]; D9 [法律]; R [医药、卫生];
学科分类号
0301 ; 10 ;
摘要
Automatic speaker recognition (ASR) is a method used in forensic speaker comparison (FSC) casework. It needs collections of audio data that are representative of the case audio in order to perform reference normalization and to train a score-to-LR function. Audio from a certain minimum number of speakers is needed for each of those purposes to obtain relatively stable performance of ASR. Although it is not possible to set a hard cut-off, for the purpose of this work this number was chosen to be 30 for each, and 60 for both. Lack of representative data from that many speakers and uncertainty about what exactly constitutes representative data are major reasons for not employing ASR in FSC. An experiment was carried out in which a situation was simulated where a practitioner has only 30 speakers available. Several data strategies are tried out to handle the lack of data: leaving out reference normalization, splitting the 30 speakers into two groups of 15 (ignoring the minimum of 30) and a leave 1 or 2 out strategy where all 30 speakers are used for both reference normalization and calibration. They are compared to the baseline situation where the practitioner does have the required 60 speakers. The leave 1 or 2 out strategy with 30 speakers performs on par with baseline, and extension of that strategy to the full 60 speakers even outperforms baseline. This shows that a strategy that halves the data need is viable, lessening the data requirements for ASR in FSC and making the use of ASR possible in more cases.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] How we use automatic speaker comparison in forensic practice
    van der Vloed, David
    Cambier-Langeveld, Tina
    INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2022, 29 (02) : 201 - 224
  • [2] Forensic automatic speaker recognition
    Drygajlo, Andrzej
    IEEE SIGNAL PROCESSING MAGAZINE, 2007, 24 (02) : 132 - 135
  • [3] Automatic Speaker Recognition for Mobile Forensic Applications
    Algabri, Mohammed
    Mathkour, Hassan
    Bencherif, Mohamed A.
    Alsulaiman, Mansour
    Mekhtiche, Mohamed A.
    MOBILE INFORMATION SYSTEMS, 2017, 2017
  • [4] FM Features for Automatic Forensic Speaker Recognition
    Thiruvaran, Tharmarajah
    Ambikairajah, Eliathamby
    Epps, Julien
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1497 - 1500
  • [5] Forensic Automatic Speaker Recognition: Fiction or Science?
    Gonzalez-Rodriguez, Joaquin
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 16 - 17
  • [6] International practices in forensic speaker comparison
    Gold, Erica
    French, Peter
    INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2011, 18 (02) : 293 - 307
  • [7] Forensic Automatic Speaker Recognition with Degraded and Enhanced Speech
    Kuenzel, Hermann
    Alexander, Paul
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2014, 62 (04): : 244 - 253
  • [8] Aural and automatic forensic speaker recognition in mismatched conditions
    Alexander, Anil
    Dessimoz, Damien
    Botti, Filippo
    Drygajlo, Andrzel
    INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2005, 12 (02) : 214 - 234
  • [9] FABIOLE, a Speech Database For Forensic Speaker Comparison
    Ajili, Moez
    Bonastre, Jean-Francois
    Rossato, Solange
    Kahn, Juliette
    Bernard, Guillaume
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 726 - 733
  • [10] Data Mining applied to Forensic Speaker Identification
    Univaso, P.
    Ale, J. M.
    Gurlekian, J. A.
    IEEE LATIN AMERICA TRANSACTIONS, 2015, 13 (04) : 1098 - 1111