Low Frequency Ultrasonic Voice Activity Detection using Convolutional Neural Networks

被引:0
|
作者
McLoughlin, Ian [1 ,2 ]
Song, Yan [2 ]
机构
[1] Univ Kent, Sch Comp Sci, Rochester, Kent, England
[2] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
关键词
Voice activity detection; speech activity detection; ultrasonic speech; SaVAD;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Low frequency ultrasonic mouth state detection uses reflected audio chirps from the face in the region of the mouth to determine lip state, whether open, closed or partially open. The chirps are located in a frequency range just above the threshold of human hearing and are thus both inaudible as well as unaffected by interfering speech, yet can be produced and sensed using inexpensive equipment. To determine mouth open or closed state, and hence form a measure of voice activity detection, this recently invented technique relies upon the difference in the reflected chirp caused by resonances introduced by the open or partially open mouth cavity. Voice activity is then inferred from lip state through patterns of mouth movement, in a similar way to video-based lip-reading technologies. This paper introduces a new metric based on spectrogram features extracted from the reflected chirp, with a convolutional neural network classification back-end, that yields excellent performance without needing the periodic resetting of the template closed-mouth reflection required by the original technique.
引用
收藏
页码:2400 / 2404
页数:5
相关论文
共 50 条
  • [1] Convolutional Neural Networks for Pathological Voice Detection
    Wu, Huiyi
    Soraghan, John
    Lowit, Anja
    Di Caterina, Gaetano
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 4784 - 4787
  • [2] Detection of Flaws in Concrete Using Ultrasonic Tomography and Convolutional Neural Networks
    Slonski, Marek
    Schabowicz, Krzysztof
    Krawczyk, Ewa
    MATERIALS, 2020, 13 (07)
  • [3] Singing Voice Detection Based on Convolutional Neural Networks
    Huang, Hong-Ming
    Chen, Woei-Kae
    Liu, Chien-Hung
    You, Shingchern D.
    2018 7TH IEEE INTERNATIONAL SYMPOSIUM ON NEXT-GENERATION ELECTRONICS (ISNE), 2018, : 223 - 226
  • [4] Radio Frequency Classification and Anomaly Detection using Convolutional Neural Networks
    Conn, Marvin A.
    Josyula, Darsana
    2019 IEEE RADAR CONFERENCE (RADARCONF), 2019,
  • [5] RECOGNITION OF SPOOFED VOICE USING CONVOLUTIONAL NEURAL NETWORKS
    Liang, Huixin
    Lin, Xiaodan
    Zhang, Qiong
    Kang, Xiangui
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 293 - 297
  • [6] Deep Neural Networks for Voice Activity Detection
    Mihalache, Serban
    Ivanov, Ioan-Alexandru
    Burileanu, Dragos
    2021 44TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2021, : 191 - 194
  • [7] RECURRENT NEURAL NETWORKS FOR VOICE ACTIVITY DETECTION
    Hughes, Thad
    Mierle, Keir
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7378 - 7382
  • [8] Joint Detection and Classification of Singing Voice Melody Using Convolutional Recurrent Neural Networks
    Kum, Sangeun
    Nam, Juhan
    APPLIED SCIENCES-BASEL, 2019, 9 (07):
  • [9] Radio Frequency Interference Detection for SMAP Radiometer Using Convolutional Neural Networks
    Alam, Ahmed Manavi
    Kurum, Mehmet
    Gurbuz, Ali C.
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 10099 - 10112
  • [10] Detection of Arrhythmia Using Convolutional Neural Networks
    Greeshma, Burla
    Sireesha, Moturi
    Rao, S. N. Thirumala
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 21 - 30