A Voice Activity Detection Algorithm Using Sparse Non-negative Matrix Factorization-based Model Learning in Spectro-Temporal Domain

被引:0
|
作者
Mavaddati, S. [1 ]
机构
[1] Univ Mazandaran, Fac Engn & Technol, Babolsar, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2023年 / 36卷 / 08期
关键词
Voice Activity Detector; Spectro-temporal Domain; Spectro-temporal Sparse Structured Principal Component; Analysis; Sparse Non-negative Matrix Factorization; RECOGNITION; NOISE;
D O I
10.5829/ije.2023.36.08b.08
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Voice activity detectors are presented to extract silence/speech segments of the speech signal to eliminate different background noise signals. A novel voice activity detector is proposed in this paper using spectro-temporal features extracted from the auditory model of the speech signal. After extracting the scale, rate, and frequency features from this feature space, a sparse structured principal component analysis algorithm is used to consider the basic components of these features and reduce the dimension of learning data. Then these feature vectors are employed to learn the models by the sparse non-negative matrix factorization algorithm. The model learning procedure is performed to represent each feature vector with a proper sparse rate based on the selected atoms. Voice activity detection of the input frames is performed by computing the energy of the sparse representation for each input frame over the composite model. If the calculated energy exceeds a specified threshold, it indicates that the input frame has a structure similar to the atoms of the learned models and concludes that the observed frame has voice content. The results of the proposed detector were compared with other baseline methods and classifiers in this processing field. These results in the presence of stationary, non-stationary and periodic noises were investigated and they are shown that the proposed method based on model learning with spectro-temporal features can correctly detect the silence/speech activities.doi: 10.5829/ije.2023.36.08b.08
引用
收藏
页码:1478 / 1488
页数:11
相关论文
共 50 条
  • [41] Nonlinear Non-negative Matrix Factorization using Deep Learning
    Zhang, Hui
    Liu, Huaping
    Song, Rui
    Sun, Fuchun
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 477 - 482
  • [42] AN APPROACH TO DOUBLETALK DETECTION BASED ON NON-NEGATIVE MATRIX FACTORIZATION
    Cahill, Niall
    Lawlor, Robert
    ISSPIT: 8TH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2008, : 497 - 501
  • [43] CRATER DETECTION BASED ON LOCAL NON-NEGATIVE MATRIX FACTORIZATION
    Li, Hui
    Yin, Jihao
    Gu, Zetong
    2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014,
  • [44] Non-negative matrix factorization-based time-frequency feature extraction of voice signal for Parkinson?s disease prediction
    Karan, Biswajit
    Sahu, Sitanshu Sekhar
    Orozco-Arroyave, Juan Rafael
    Mahto, Kartik
    COMPUTER SPEECH AND LANGUAGE, 2021, 69
  • [45] Double feature selection algorithm based on low-rank sparse non-negative matrix factorization
    Ronghua Shang
    Jiuzheng Song
    Licheng Jiao
    Yangyang Li
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 1891 - 1908
  • [46] VOICE CONVERSION BASED ON NON-NEGATIVE MATRIX FACTORIZATION USING PHONEME-CATEGORIZED DICTIONARY
    Aihara, Ryo
    Nakashika, Toru
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [47] MULTIMODAL VOICE CONVERSION USING NON-NEGATIVE MATRIX FACTORIZATION IN NOISY ENVIRONMENTS
    Masaka, Kenta
    Aihara, Ryo
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [48] Significance of platelet adhesion-related genes in colon cancer based on non-negative matrix factorization-based clustering algorithm
    Chi, Xiao-jv
    Song, Yi-bei
    Liu, Deng-he
    Wei, Li-qiang
    An, Xin
    Feng, Zi-zhen
    Lan, Xiao-hua
    Lan, Dong
    Huang, Chao
    DIGITAL HEALTH, 2023, 9
  • [49] Double feature selection algorithm based on low-rank sparse non-negative matrix factorization
    Shang, Ronghua
    Song, Jiuzheng
    Jiao, Licheng
    Li, Yangyang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (08) : 1891 - 1908
  • [50] SALIENCY-BASED MODELING OF ACOUSTIC SCENES USING SPARSE NON-NEGATIVE MATRIX FACTORIZATION
    Cauchi, Benjamin
    Lagrange, Mathieu
    Misdariis, Nicolas
    Cont, Arshia
    2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS), 2013,