ROBUST FEATURE CLUSTERING FOR UNSUPERVISED SPEECH ACTIVITY DETECTION

被引:0
|
作者
Dubey, Harishchandra [1 ]
Sangwan, Abhijeet [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, Robust Speech Technol Lab, Ctr Robust Speech Syst, Richardson, TX 75080 USA
关键词
Clustering; Hartigan dip test; NIST OpenSAD; NIST OpenSAT; speech activity detection; zero-resource speech processing; unsupervised learning; SYSTEM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In certain applications such as zero-resource speech processing or very-low resource speech-language systems, it might not be feasible to collect speech activity detection (SAD) annotations. However, the state-of-the-art supervised SAD techniques based on neural networks or other machine learning methods require annotated training data matched to the target domain. This paper establish a clustering approach for fully unsupervised SAD useful for cases where SAD annotations are not available. The proposed approach leverages Hartigan dip test in a recursive strategy for segmenting the feature space into prominent modes. Statistical dip is invariant to distortions that lends robustness to the proposed method. We evaluate the method on NIST OpenSAD 2015 and NIST OpenSAT 2017 public safety communications data. The results showed the superiority of proposed approach over the two-component GMM baseline.
引用
收藏
页码:2726 / 2730
页数:5
相关论文
共 50 条
  • [21] Robust autoencoder feature selector for unsupervised feature selection
    Ling, Yunzhi
    Nie, Feiping
    Yu, Weizhong
    Ling, Yunhao
    Li, Xuelong
    [J]. INFORMATION SCIENCES, 2024, 660
  • [22] Unsupervised Robust Bayesian Feature Selection
    Sun, Jianyong
    Zhou, Aimin
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 558 - 564
  • [23] Robust unsupervised dimensionality reduction based on feature clustering for single-cell imaging data
    Peralta, Daniel
    Saeys, Yvan
    [J]. APPLIED SOFT COMPUTING, 2020, 93
  • [24] Unsupervised learning of overlapped speech model parameters for multichannel speech activity detection in meetings
    Laskowski, Kornel
    Schultz, Tanja
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 993 - 996
  • [25] An efficient unsupervised feature selection procedure through feature clustering
    Yan, Xuyang
    Nazmi, Shabnam
    Erol, Berat A.
    Homaifar, Abdollah
    Gebru, Biniam
    Tunstel, Edward
    [J]. PATTERN RECOGNITION LETTERS, 2020, 131 : 277 - 284
  • [26] Unsupervised topic clustering of switchboard speech messages
    Carlson, BA
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 315 - 318
  • [27] Unsupervised Clustering of Prosodic Patterns in Spontaneous Speech
    Beke, Andras
    Szaszak, Gyoergy
    [J]. TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 648 - 655
  • [28] Unsupervised Feature Selection with Joint Clustering Analysis
    An, Shuai
    Wang, Jun
    Wei, Jinmao
    Yang, Zhenglu
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1639 - 1648
  • [29] Unsupervised adaptive hashing based on feature clustering
    Yuan, Tongtong
    Deng, Weihong
    Hu, Jiani
    An, Zhanfu
    Tang, Yinan
    [J]. NEUROCOMPUTING, 2019, 323 : 373 - 382
  • [30] A unifying criterion for unsupervised clustering and feature selection
    Breaban, Mihaela
    Luchian, Henri
    [J]. PATTERN RECOGNITION, 2011, 44 (04) : 854 - 865