Low Frequency Ultrasonic Voice Activity Detection using Convolutional Neural Networks

被引:0
|
作者
McLoughlin, Ian [1 ,2 ]
Song, Yan [2 ]
机构
[1] Univ Kent, Sch Comp Sci, Rochester, Kent, England
[2] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
关键词
Voice activity detection; speech activity detection; ultrasonic speech; SaVAD;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Low frequency ultrasonic mouth state detection uses reflected audio chirps from the face in the region of the mouth to determine lip state, whether open, closed or partially open. The chirps are located in a frequency range just above the threshold of human hearing and are thus both inaudible as well as unaffected by interfering speech, yet can be produced and sensed using inexpensive equipment. To determine mouth open or closed state, and hence form a measure of voice activity detection, this recently invented technique relies upon the difference in the reflected chirp caused by resonances introduced by the open or partially open mouth cavity. Voice activity is then inferred from lip state through patterns of mouth movement, in a similar way to video-based lip-reading technologies. This paper introduces a new metric based on spectrogram features extracted from the reflected chirp, with a convolutional neural network classification back-end, that yields excellent performance without needing the periodic resetting of the template closed-mouth reflection required by the original technique.
引用
收藏
页码:2400 / 2404
页数:5
相关论文
共 50 条
  • [21] Improvement of low-frequency ultrasonic image quality using a enhanced convolutional neural network
    Lei, Miao
    Zhang, Wendong
    Zhang, Tian
    Wu, Yang
    Gao, Dan
    Tao, Xiaoyan
    Li, Kangning
    Shao, Xingling
    Yang, Yuhua
    SENSORS AND ACTUATORS A-PHYSICAL, 2024, 365
  • [22] Automatic equine activity detection by convolutional neural networks using accelerometer data
    Eerdekens, Anniek
    Deruyck, Margot
    Fontaine, Jaron
    Martens, Luc
    De Poorter, Eli
    Joseph, Wout
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 168
  • [23] HUMAN ACTIVITY DETECTION AND ACTION RECOGNITION IN VIDEOS USING CONVOLUTIONAL NEURAL NETWORKS
    Basavaiah, Jagadeesh
    Patil, Chandrashekar Mohan
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2020, 19 (02): : 157 - 183
  • [24] ROBUST VOICE ACTIVITY DETECTION USING A MASKED AUDITORY ENCODER BASED CONVOLUTIONAL NEURAL NETWORK
    Li, Nan
    Wang, Longbiao
    Unoki, Masashi
    Li, Sheng
    Wang, Rui
    Ge, Meng
    Dang, Jianwu
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6828 - 6832
  • [25] Voice activity detection using neural network
    Ikedo, J
    IEICE TRANSACTIONS ON COMMUNICATIONS, 1998, E81B (12) : 2509 - 2513
  • [26] Contactless Fall Detection Using Time-Frequency Analysis and Convolutional Neural Networks
    Sadreazami, Hamidreza
    Bolic, Miodrag
    Rajan, Sreeraman
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (10) : 6842 - 6851
  • [27] A Computational Approach to Heat Detection in Radio Frequency Transmitters Using Convolutional Neural Networks
    Zulu, Bongiwe
    Sumbwanyambe, Mbuyu
    5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, BIG DATA, COMPUTING AND DATA COMMUNICATION SYSTEMS (ICABCD2022), 2022,
  • [28] Detection of pathological voice using convolutional neural network (CNN) and mel frequency cepstral coefficient ( MFCC)
    Lee, S. H.
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 23 - 23
  • [29] Breast lesion classification based on ultrasonic radio-frequency signals using convolutional neural networks
    Jarosik, Piotr
    Klimonda, Ziemowit
    Lewandowski, Marcin
    Byra, Michal
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2020, 40 (03) : 977 - 986
  • [30] Voice over LTE Quality Evaluation Using Convolutional Neural Networks
    Gorman, Thomas
    Larijani, Hadi
    Qureshi, Ayyaz-Ul-Haq
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,