ROBUST FEATURE CLUSTERING FOR UNSUPERVISED SPEECH ACTIVITY DETECTION

被引:0
|
作者
Dubey, Harishchandra [1 ]
Sangwan, Abhijeet [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, Robust Speech Technol Lab, Ctr Robust Speech Syst, Richardson, TX 75080 USA
关键词
Clustering; Hartigan dip test; NIST OpenSAD; NIST OpenSAT; speech activity detection; zero-resource speech processing; unsupervised learning; SYSTEM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In certain applications such as zero-resource speech processing or very-low resource speech-language systems, it might not be feasible to collect speech activity detection (SAD) annotations. However, the state-of-the-art supervised SAD techniques based on neural networks or other machine learning methods require annotated training data matched to the target domain. This paper establish a clustering approach for fully unsupervised SAD useful for cases where SAD annotations are not available. The proposed approach leverages Hartigan dip test in a recursive strategy for segmenting the feature space into prominent modes. Statistical dip is invariant to distortions that lends robustness to the proposed method. We evaluate the method on NIST OpenSAD 2015 and NIST OpenSAT 2017 public safety communications data. The results showed the superiority of proposed approach over the two-component GMM baseline.
引用
收藏
页码:2726 / 2730
页数:5
相关论文
共 50 条
  • [1] A robust unsupervised speaker clustering of speech utterances
    Zhang, SL
    Zhang, SW
    Xu, B
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 115 - 120
  • [2] Embedded Discriminant Analysis based Speech Activity Detection for Unsupervised Stress Speech Clustering
    Prasetio, Barlian Henryranu
    Tamura, Hiroki
    Tanno, Koichi
    [J]. 2020 JOINT 9TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2020 4TH INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2020,
  • [3] A robust unsupervised pattern discovery and clustering of speech signals
    Kumar, Kishore R.
    Birla, Lokendra
    Rao, Sreenivasa K.
    [J]. PATTERN RECOGNITION LETTERS, 2018, 116 : 254 - 261
  • [4] Manifold Regularized Robust Unsupervised Feature Selection for Image Clustering
    Shi, Yuqing
    Du, Shiqiang
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11161 - 11165
  • [5] Unsupervised video anomaly detection using feature clustering
    Li, H.
    Achim, A.
    Bull, D.
    [J]. IET SIGNAL PROCESSING, 2012, 6 (05) : 521 - 533
  • [6] Unsupervised Feature Selection with Feature Clustering
    Cheung, Yiu-ming
    Jia, Hong
    [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 9 - 15
  • [7] Robust speech recognition with on-line unsupervised acoustic feature compensation
    Buera, Luis
    Miguel, Antonio
    Lleida, Eduardo
    Saz, Oscar
    Ortega, Alfonso
    [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 105 - 110
  • [8] Recurrent Clustering for Unsupervised Feature Extraction with Application to Sequence Detection
    Young, Steven R.
    Arel, Itamar
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, 2012, : 54 - 55
  • [9] Unsupervised anomaly detection model combining total attributes clustering and feature attributes clustering
    Liu W.-G.
    Zhang Z.-L.
    [J]. Tiedao Xuebao/Journal of the China Railway Society, 2010, 32 (05): : 59 - 64
  • [10] Noise Robust Speech Activity Detection
    Abdulla, Waleed H.
    Guan, Zhou
    Sou, Hou Chi
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 473 - 477