SPEECH ACTIVITY DETECTION: AN ECONOMICS APPROACH

被引:0
|
作者
Tsai, T. J. [1 ]
Morgan, Nelson [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Int Comp Sci Inst, Berkeley, CA 94720 USA
关键词
speech activity detection; feature specialization; PHASE-LOCKED LOOP; DISTORTED SIGNALS; KALMAN FILTER; ALGORITHM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes an approach to frame-level speech activity detection based on the extended metaphor of an economics marketplace. As in a real marketplace, the simulated marketplace encourages features to specialize. Features that might not have impressive average performance across the entire data set might nonetheless perform very well on a subset of the data, and the marketplace capitalizes on this specialization by consulting the features only when their expertise is relevant. On an experimental data set, we show that the framework is able to effectively utilize the expertise of a set of voicing-related features. For the 50% of the data that fell within these features' realm of expertise, we observe an 83% reduction in false alarm errors and 19% reduction in miss detect errors compared to a baseline HMM-GMM system with MFCCs. Even when we consult these features for the entire data set, thus including the other 50% of data outside their realm of expertise, we still observe a 20% total reduction in equal error rate compared to the baseline system. Analysis of the marketplace transactions also yields useful insight into how the errors are distributed across the data and which types of features are most useful.
引用
收藏
页码:6842 / 6846
页数:5
相关论文
共 50 条
  • [1] SPEECH ACTIVITY DETECTION: AN ECONOMICS APPROACH
    Tsai, T. J.
    Morgan, Nelson
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6842 - 6846
  • [2] A New Pitch Based Approach for Speech Activity Detection
    Punnoose, A. K.
    [J]. PROCEEDINGS OF 2019 5TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTING AND CONTROL (ISPCC 2K19), 2019, : 319 - 322
  • [3] A unified approach to speech enhancement and voice activity detection
    Kasap, Ceyhan
    Arslan, Mustafa Levent
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2013, 21 (02) : 527 - 547
  • [4] A neural network approach for speech activity detection for Apollo corpus
    Pannala, Vishala
    Yegnanarayana, B.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 65
  • [5] A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
    Zhang, Yan
    Tang, Zhen-min
    Li, Yan-ping
    Luo, Yang
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [6] Robust Approach to Speech Detection
    Liu, Ruolan
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-CHINA (ICCE-CHINA), 2016,
  • [7] A Novel Approach to EEG Speech Activity Detection with Visual Stimuli and Mobile BCI
    Kocturova, Marianna
    Juhar, Jozef
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (02): : 1 - 12
  • [8] A Federated Approach for Hate Speech Detection
    Gala, Jay
    Gandhi, Deep
    Mehta, Jash
    Talat, Zeerak
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3248 - 3259
  • [9] Noise Robust Speech Activity Detection
    Abdulla, Waleed H.
    Guan, Zhou
    Sou, Hou Chi
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 473 - 477
  • [10] Speech Activity Detection using Accelerometer
    Matic, Aleksandar
    Osmani, Venet
    Mayora, Oscar
    [J]. 2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2012, : 2112 - 2115