Noise-Robust Detection of Whispering in Telephone Calls Using Deep Neural Networks

被引:0
|
作者
Diment, Aleksandr [1 ]
Parviainen, Mikko [1 ]
Virtanen, Tuomas [1 ]
Zelov, Roman [2 ]
Glasman, Alex [2 ]
机构
[1] Tampere Univ Technol, Dept Signal Proc, Korkeakoulunkatu 1, Tampere 33720, Finland
[2] Behavox, Leve139,One Canada Sq, London E14 5AB, England
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Detection of whispered speech in the presence of high levels of background noise has applications in fraudulent behaviour recognition. For instance, it can serve as an indicator of possible insider trading. We propose a deep neural network (DNN) -based whispering detection system, which operates on both magnitude and phase features, including the group delay feature from all-pole models (APGD). We show that the APGD feature outperforms the conventional ones. Trained and evaluated on the collected diverse dataset of whispered and normal speech with emulated phone line distortions and significant amounts of added background noise, the proposed system performs with accuracies as high as 91.8%.
引用
收藏
页码:2310 / 2314
页数:5
相关论文
共 50 条
  • [1] Noise-Robust Deep Spiking Neural Networks with Temporal Information
    Park, Seongsik
    Lee, Dongjin
    Yoon, Sungroh
    [J]. 2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 373 - 378
  • [2] A Spectral Masking Approach to Noise-Robust Speech Recognition Using Deep Neural Networks
    Li, Bo
    Sim, Khe Chai
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (08) : 1296 - 1305
  • [3] Noise-Robust Hardware Implementation of Neural Networks
    Canals, Vincent
    Alomar, Miquel L.
    Morro, Antoni
    Oliver, Antoni
    Rossello, Josep L.
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [4] Training Noise-Robust Deep Neural Networks via Meta-Learning
    Wang, Zhen
    Hu, Guosheng
    Hu, Qinghua
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4523 - 4532
  • [5] EXPLOITING SYNCHRONY SPECTRA AND DEEP NEURAL NETWORKS FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
    Ma, Ning
    Marxer, Ricard
    Barker, Jon
    Brown, Guy J.
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 490 - 495
  • [6] A NOISE-ROBUST SIGNAL PROCESSING STRATEGY FOR COCHLEAR IMPLANTS USING NEURAL NETWORKS
    Zheng, Nengheng
    Shi, Yupeng
    Kang, Yuyong
    Meng, Qinglin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8343 - 8347
  • [7] Noise-robust registration of microscopic height data using convolutional neural networks
    Siemens, Stefan
    Kaestner, Markus
    Reithmeier, Eduard
    [J]. SPIE FUTURE SENSING TECHNOLOGIES 2023, 2023, 12327
  • [8] Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks
    Valentini-Botinhao, Cassia
    Wang, Xin
    Takaki, Shinji
    Yamagishi, Junichi
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 352 - 356
  • [9] Deep Maxout Networks Applied to Noise-Robust Speech Recognition
    de-la-Calle-Silos, F.
    Gallardo-Antolin, A.
    Pelaez-Moreno, C.
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2014, 2014, 8854 : 109 - 118
  • [10] An Efficient Noise-Robust Automatic Speech Recognition System using Artificial Neural Networks
    Gupta, Santosh
    Bhurchandi, Kishor M.
    Keskar, Avinash G.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 1873 - 1877