Low Frequency Ultrasonic Voice Activity Detection using Convolutional Neural Networks

被引:0
|
作者
McLoughlin, Ian [1 ,2 ]
Song, Yan [2 ]
机构
[1] Univ Kent, Sch Comp Sci, Rochester, Kent, England
[2] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
关键词
Voice activity detection; speech activity detection; ultrasonic speech; SaVAD;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Low frequency ultrasonic mouth state detection uses reflected audio chirps from the face in the region of the mouth to determine lip state, whether open, closed or partially open. The chirps are located in a frequency range just above the threshold of human hearing and are thus both inaudible as well as unaffected by interfering speech, yet can be produced and sensed using inexpensive equipment. To determine mouth open or closed state, and hence form a measure of voice activity detection, this recently invented technique relies upon the difference in the reflected chirp caused by resonances introduced by the open or partially open mouth cavity. Voice activity is then inferred from lip state through patterns of mouth movement, in a similar way to video-based lip-reading technologies. This paper introduces a new metric based on spectrogram features extracted from the reflected chirp, with a convolutional neural network classification back-end, that yields excellent performance without needing the periodic resetting of the template closed-mouth reflection required by the original technique.
引用
收藏
页码:2400 / 2404
页数:5
相关论文
共 50 条
  • [31] Exploring Channel Properties to Improve Singing Voice Detection with Convolutional Neural Networks
    Gui, Wenming
    Li, Yukun
    Zang, Xian
    Zhang, Jinglan
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [32] Deepfake detection using convolutional vision transformers and convolutional neural networks
    Soudy, Ahmed Hatem
    Sayed, Omnia
    Tag-Elser, Hala
    Ragab, Rewaa
    Mohsen, Sohaila
    Mostafa, Tarek
    Abohany, Amr A.
    Slim, Salwa O.
    Neural Computing and Applications, 2024, 36 (31) : 19759 - 19775
  • [33] A Model-based Voice Activity Detection Algorithm using probabilistic neural networks
    Farsinejad, M.
    Mohammadi, M.
    Nasersharif, B.
    Akbari, A.
    2008 14TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, (APCC), VOLS 1 AND 2, 2008, : 942 - 945
  • [34] DENOISING DEEP NEURAL NETWORKS BASED VOICE ACTIVITY DETECTION
    Zhang, Xiao-Lei
    Wu, Ji
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 853 - 857
  • [35] Voice activity detection based on deep neural networks and Viterbi
    Bai, Liang
    Zhang, Zhen
    Hu, Jun
    2017 2ND INTERNATIONAL SEMINAR ON ADVANCES IN MATERIALS SCIENCE AND ENGINEERING, 2017, 231
  • [36] A Comparison of Boosted Deep Neural Networks for Voice Activity Detection
    Krishnakumar, Harshit
    Williamson, Donald S.
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [37] SPIKING NEURAL NETWORKS TRAINED WITH BACKPROPAGATION FOR LOW POWER NEUROMORPHIC IMPLEMENTATION OF VOICE ACTIVITY DETECTION
    Martinelli, Flavio
    Dellaferrera, Giorgia
    Mainar, Pablo
    Cernak, Milos
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8544 - 8548
  • [38] Detection of Cattle Using Drones and Convolutional Neural Networks
    Rivas, Alberto
    Chamoso, Pablo
    Gonzalez-Briones, Alfonso
    Manuel Corchado, Juan
    SENSORS, 2018, 18 (07)
  • [39] Melanoma Detection Using Regular Convolutional Neural Networks
    Abu Ali, Aya
    Al-Marzouqi, Hasan
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2017, : 363 - 367
  • [40] QR Code Detection Using Convolutional Neural Networks
    Chou, Tzu-Han
    Ho, Chuan-Sheng
    Kuo, Yan-Fu
    2015 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND INTELLIGENT SYSTEMS (ARIS), 2015,