A Voice Activity Detection Algorithm Using Sparse Non-negative Matrix Factorization-based Model Learning in Spectro-Temporal Domain

被引:0
|
作者
Mavaddati, S. [1 ]
机构
[1] Univ Mazandaran, Fac Engn & Technol, Babolsar, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2023年 / 36卷 / 08期
关键词
Voice Activity Detector; Spectro-temporal Domain; Spectro-temporal Sparse Structured Principal Component; Analysis; Sparse Non-negative Matrix Factorization; RECOGNITION; NOISE;
D O I
10.5829/ije.2023.36.08b.08
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Voice activity detectors are presented to extract silence/speech segments of the speech signal to eliminate different background noise signals. A novel voice activity detector is proposed in this paper using spectro-temporal features extracted from the auditory model of the speech signal. After extracting the scale, rate, and frequency features from this feature space, a sparse structured principal component analysis algorithm is used to consider the basic components of these features and reduce the dimension of learning data. Then these feature vectors are employed to learn the models by the sparse non-negative matrix factorization algorithm. The model learning procedure is performed to represent each feature vector with a proper sparse rate based on the selected atoms. Voice activity detection of the input frames is performed by computing the energy of the sparse representation for each input frame over the composite model. If the calculated energy exceeds a specified threshold, it indicates that the input frame has a structure similar to the atoms of the learned models and concludes that the observed frame has voice content. The results of the proposed detector were compared with other baseline methods and classifiers in this processing field. These results in the presence of stationary, non-stationary and periodic noises were investigated and they are shown that the proposed method based on model learning with spectro-temporal features can correctly detect the silence/speech activities.doi: 10.5829/ije.2023.36.08b.08
引用
收藏
页码:1478 / 1488
页数:11
相关论文
共 50 条
  • [21] A Generalized Deep Learning Clustering Algorithm Based on Non-Negative Matrix Factorization
    Wang, Dexian
    Li, Tianrui
    Deng, Ping
    Zhang, Fan
    Huang, Wei
    Zhang, Pengfei
    Liu, Jia
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (07)
  • [22] Web Behavior Analysis Using Sparse Non-Negative Matrix Factorization
    Demachi, Akihiro
    Matsushima, Shin
    Yamanishi, Kenji
    PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, : 574 - 583
  • [23] Study on characteristic dimension and sparse factor in Non-negative Matrix Factorization algorithm
    Hou Mo
    Yang Mao-yun
    Qiao Shu-yun
    Wang Gai-ge
    Gao Li-qun
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 2957 - 2961
  • [24] INmfCA Algorithm for Training of Nonparallel Voice Conversion Systems Based on Non-Negative Matrix Factorization
    Suda, Hitoshi
    Kotani, Gaku
    Saito, Daisuke
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (06) : 1196 - 1210
  • [25] Exemplar-based Emotional Voice Conversion Using Non-negative Matrix Factorization
    Aihara, Ryo
    Ueda, Reina
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [26] Acoustic Event Classification using spectral band selection and Non-Negative Matrix Factorization-based features
    Ludena-Choez, Jimmy
    Gallardo-Antolin, Ascension
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 46 : 77 - 86
  • [27] Nonlinear hyperspectral unmixing based on sparse non-negative matrix factorization
    Li, Jing
    Li, Xiaorun
    Zhao, Liaoying
    JOURNAL OF APPLIED REMOTE SENSING, 2016, 10
  • [28] Image Denoising based on Sparse Representation and Non-Negative Matrix Factorization
    Farouk, R. M.
    Khalil, H. A.
    LIFE SCIENCE JOURNAL-ACTA ZHENGZHOU UNIVERSITY OVERSEAS EDITION, 2012, 9 (01): : 337 - 341
  • [29] Improved Non-Negative Matrix Factorization-Based Noise Reduction of Leakage Acoustic Signals
    Yu, Yongsheng
    Hu, Yongwen
    Wang, Yingming
    Cai, Zhuoran
    SENSORS, 2024, 24 (16)
  • [30] Voice Conversion based on Non-negative Matrix Factorization in Noisy Environments
    Fujii, Takao
    Aihara, Ryo
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2013 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2013, : 495 - 498