SPEECH OVERLAP DETECTION USING CONVOLUTIVE NON-NEGATIVE SPARSE CODING: NEW IMPROVEMENTS AND INSIGHTS

被引:0
|
作者
Geiger, Juergen T. [1 ]
Vipperla, Ravichander [2 ]
Evans, Nicholas [2 ]
Schuller, Bjoern [1 ]
Rigoll, Gerhard [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-8000 Munich, Germany
[2] EURECOM, Multimedia Commun Dept, Sophia Antipolis, France
关键词
speech overlap detection; convolutive non-negative sparse coding; speaker diarization;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents recent advances in the application of convolutive non-negative sparse coding (CNSC) to the problem of overlap detection in the context of conference meetings and speaker diarization. CNSC is used to project a mixed speaker signal onto separate speaker bases and hence to detect intervals of competing speech. We present new energy ratio and total energy features which give signicant improvements over our previous work. The system is assessed using a subset of the AMI meeting corpus. We report results which are comparable to the state of the art which support the potential of a new approach to overlap detection. An analysis of system performance highlights the importance of further work to addresses weaknesses in detecting particularly short segments of overlapping speech.
引用
收藏
页码:340 / 344
页数:5
相关论文
共 50 条
  • [21] Non-negative Local Sparse Coding for Subspace Clustering
    Hosseini, Babak
    Hammer, Barbara
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVII, IDA 2018, 2018, 11191 : 137 - 150
  • [22] Non-Negative Sparse Coding with Regularizer for Image Classification
    Mukherjee, Lopamudra
    Hall, Alex
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 852 - 859
  • [23] A DIAGONALIZED NEWTON ALGORITHM FOR NON-NEGATIVE SPARSE CODING
    Van Hamme, Hugo
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7299 - 7303
  • [24] Modeling receptive fields with non-negative sparse coding
    Hoyer, PO
    NEUROCOMPUTING, 2003, 52-4 : 547 - 552
  • [25] NON-NEGATIVE SPARSE CODING FOR HUMAN ACTION RECOGNITION
    Amiri, S. Mohsen
    Nasiopoulos, Panos
    Leung, Victor C. M.
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1421 - 1424
  • [26] Non-Negative Kernel Sparse Coding for Image Classification
    Zhang, Yungang
    Xu, Tianwei
    Ma, Jieming
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 531 - 540
  • [27] Face recognition using localized features based on non-negative sparse coding
    Bhavin J. Shastri
    Martin D. Levine
    Machine Vision and Applications, 2007, 18 : 107 - 122
  • [28] Noise removal using a novel non-negative sparse coding shrinkage technique
    Shang, L
    Huang, DS
    Zheng, CH
    Sun, ZL
    NEUROCOMPUTING, 2006, 69 (7-9) : 874 - 877
  • [29] Face recognition using localized features based on non-negative sparse coding
    Shastri, Bhavin J.
    Levine, Martin D.
    MACHINE VISION AND APPLICATIONS, 2007, 18 (02) : 107 - 122
  • [30] Improvement in monaural speech separation using sparse non-negative tucker decomposition
    Varshney, Yash Vardhan
    Upadhyaya, Prashant
    Abbasi, Zia Ahmad
    Abidi, Musiur Raza
    Farooq, Omar
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 837 - 849