SpeechToText: An open-source software for automatic detection and transcription of voice recordings in digital forensics

被引:6
|
作者
Negra, Miguel [1 ,2 ]
Domingues, Patricio [1 ,2 ,3 ]
机构
[1] Polytech Inst Leiria, Sch Technol & Management, Leiria, Portugal
[2] Comp Sci & Commun Res Ctr, Leiria, Portugal
[3] Inst Telecomunicacoes, Aveiro, Portugal
关键词
Voice recordings; Automatic speech recognition; Automatic speech transcription; Digital forensics; Android applications; RECOGNITION;
D O I
10.1016/j.fsidi.2021.301223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Voice is the most natural way for humans to communicate with each other, and more recently, to interact with voice controlled digital machines. Although text is predominant in digital platforms, voice and video are becoming increasingly important, with communication applications supporting voice messages and videos. This is relevant for digital forensic examinations, as content held in voice format can hold relevant evidence for the investigation. In this paper, we present the open source SpeechToText software, which resorts to state-of-the art Voice Activity Detection (VAD) and Automatic Speech Recognition (ASR) modules to detect voice content, and then to transcribe it to text. This allows integrating voice content into the regular flow of a digital forensic investigation, with transcribed audio indexed by text search engines. Although SpeechToText can be run independently, it also provides a Jython-based software module for the well-known Autopsy software. The paper also analyzes the availability, storage location and audio format of voice-recorded content in 14 popular Android applications featuring voice recordings. SpeechToText achieves 100% accuracy for detecting voice in unencrypted audio/video files, a word error rate (WER) of 27.2% when transcribing English voice messages by non-native speakers and a WER of 7.80% for the test-clean set of LibriSpeech. It achieves a real time factor of 0.15 for the detection and transcription process in a medium-range laptop, meaning that 1 min of speech is processed in roughly 9 s. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] An Overview of Open-Source Software Licenses and the Value of Open-Source Software to Public Health Initiatives
    Hahn, Erin N.
    [J]. JOHNS HOPKINS APL TECHNICAL DIGEST, 2014, 32 (04): : 690 - 698
  • [22] NeuroAssist: Open-Source Automatic Event Detection in Scalp EEG
    Ali Alqarni, Mohammad
    Masood, Hira
    Jowad Qureshi, Adil
    Alvi, Muiz
    Arbab, Haziq
    Khan, Hassan Aqeel
    Mehmood Kamboh, Awais
    Shafait, Saima
    Shafait, Faisal
    [J]. IEEE Access, 2024, 12 : 170321 - 170334
  • [23] Digital curation and open-source software in LAM-related publications
    Piotrowski, Dominik Miroslaw
    Marzec, Pawel
    [J]. JOURNAL OF LIBRARIANSHIP AND INFORMATION SCIENCE, 2023, 55 (04) : 935 - 947
  • [24] CowLog: Open-source software for coding behaviors from digital video
    Laura Hänninen
    Matti Pastell
    [J]. Behavior Research Methods, 2009, 41 : 472 - 476
  • [25] CowLog: Open-source software for coding behaviors from digital video
    Hanninen, Laura
    Pastell, Matti
    [J]. BEHAVIOR RESEARCH METHODS, 2009, 41 (02) : 472 - 476
  • [26] Reusing open-source software and practices: The impact of open-source on commercial vendors
    Brown, AW
    Booch, G
    [J]. SOFTWARE REUSE: METHODS, TECHNIQUES, AND TOOLS, PROCEEDINGS, 2002, 2319 : 123 - 136
  • [27] WormGender - Open-Source Software for Automatic Caenorhabditis elegans Sex Ratio Measurement
    Labocha, Marta K.
    Jung, Sang-Kyu
    Aleman-Meza, Boanerges
    Liu, Zheng
    Zhong, Weiwei
    [J]. PLOS ONE, 2015, 10 (09):
  • [28] An open-source software for automatic calculation of respiratory parameters based on esophageal pressure
    Mayaud, Louis
    Lejaille, Michele
    Prigent, Helene
    Louis, Bruno
    Fauroux, Brigitte
    Lofaso, Frederic
    [J]. RESPIRATORY PHYSIOLOGY & NEUROBIOLOGY, 2014, 192 : 1 - 6
  • [29] Open-source software - maps for all?
    Jukes, Dominic
    [J]. PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-CIVIL ENGINEERING, 2007, 160 (01) : 16 - 16
  • [30] OPEN-SOURCE SIMULATION SOFTWARE "JAAMSIM"
    King, D. H.
    Harrison, Harvey S.
    [J]. 2013 WINTER SIMULATION CONFERENCE (WSC), 2013, : 2163 - 2171