ROBUST FEATURE CLUSTERING FOR UNSUPERVISED SPEECH ACTIVITY DETECTION

被引:0
|
作者
Dubey, Harishchandra [1 ]
Sangwan, Abhijeet [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, Robust Speech Technol Lab, Ctr Robust Speech Syst, Richardson, TX 75080 USA
关键词
Clustering; Hartigan dip test; NIST OpenSAD; NIST OpenSAT; speech activity detection; zero-resource speech processing; unsupervised learning; SYSTEM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In certain applications such as zero-resource speech processing or very-low resource speech-language systems, it might not be feasible to collect speech activity detection (SAD) annotations. However, the state-of-the-art supervised SAD techniques based on neural networks or other machine learning methods require annotated training data matched to the target domain. This paper establish a clustering approach for fully unsupervised SAD useful for cases where SAD annotations are not available. The proposed approach leverages Hartigan dip test in a recursive strategy for segmenting the feature space into prominent modes. Statistical dip is invariant to distortions that lends robustness to the proposed method. We evaluate the method on NIST OpenSAD 2015 and NIST OpenSAT 2017 public safety communications data. The results showed the superiority of proposed approach over the two-component GMM baseline.
引用
收藏
页码:2726 / 2730
页数:5
相关论文
共 50 条
  • [41] Feature clustering for robust frequency-domain classification of EEG activity
    Myrden, Andrew
    Chau, Tom
    [J]. JOURNAL OF NEUROSCIENCE METHODS, 2016, 262 : 77 - 84
  • [42] Improved Facial-Feature Detection for AVSP via Unsupervised Clustering and Discriminant Analysis
    Simon Lucey
    Sridha Sridharan
    Vinod Chandran
    [J]. EURASIP Journal on Advances in Signal Processing, 2003
  • [43] Unsupervised Feature Selection Based on Fuzzy Clustering for Fault Detection of the Tennessee Eastman Process
    Bedoya, C.
    Uribe, C.
    Isaza, C.
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2012, 2012, 7637 : 350 - 360
  • [44] Improved facial-feature detection for AVSP via unsupervised clustering and discriminant analysis
    Lucey, S
    Sridharan, S
    Chandran, V
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (03) : 264 - 275
  • [45] Adversarial Feature Learning and Unsupervised Clustering Based Speech Synthesis for Found Data With Acoustic and Textual Noise
    Yang, Shan
    Wang, Yuxuan
    Xie, Lei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1730 - 1734
  • [46] Robust Voice Activity Detection Using Feature Combination
    Haghani, Sahar Khaksar
    Ahadi, Seyed Mohammad
    [J]. 2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [47] Unsupervised segmentation of meeting configurations and activities using speech activity detection
    Brdiczka, Oliver
    Vaufreydaz, Dominique
    Maisonnasse, Jerome
    Reignier, Patrick
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2006, 204 : 195 - +
  • [48] Unsupervised Adaptation of Deep Speech Activity Detection Models to Unseen Domains
    Gimeno, Pablo
    Ribas, Dayana
    Ortega, Alfonso
    Miguel, Antonio
    Lleida, Eduardo
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (04):
  • [49] Unsupervised robust clustering for image database categorization
    Le Saux, B
    Boujemaa, N
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS, 2002, : 259 - 262
  • [50] Improving Unsupervised Image Clustering With Robust Learning
    Park, Sungwon
    Han, Sungwon
    Kim, Sundong
    Kim, Danu
    Park, Sungkyu
    Hong, Seunghoon
    Cha, Meeyoung
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12273 - 12282