Usable speech detection using a context dependent Gaussian Mixture Model classifier

被引:0
|
作者
Yantorno, RE [1 ]
Smolenski, BY [1 ]
Iyer, AN [1 ]
Shah, JK [1 ]
机构
[1] Temple Univ, ECE Dept, Philadelphia, PA 19122 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech that is corrupted by nonstationary interference, but contains segments that are still usable for applications such as speaker identification or speech recognition, is referred to as "usable" speech. A common example of nonstationary interference occurs when there is more than one person talking at the same time, which is known as co-channel speech. In general the above speech processing applications do not work in co-channel environments; however, they can work on the extracted usable segments. Unfortunately, currently available usable speech measures only detect about 75% of the total available usable speech. The,first reason for this high error stems from the fact that no single feature can accurately identify all the usable speech characteristics. This situation can be resolved by using a Gaussian Mixture Model (GMM) based classifier to combine several usable speech features. A second source of error stems from the fact that the current usable speech measures treat each frame of co-channel data independently of the decisions made on adjacent frames. This situation can be resolved when a Hidden Markov Model (HMM) is used to incorporate any context dependent information in adjacent frames. Using this approach we were able to obtain 84% detection of usable speech with a 16% false alarm rate.
引用
收藏
页码:620 / 623
页数:4
相关论文
共 50 条
  • [1] Classification of stressed speech using Gaussian mixture model
    Patro, H
    Raja, GS
    Dandapat, S
    [J]. INDICON 2005 Proceedings, 2005, : 342 - 346
  • [2] EVENT DETECTION IN SHORT DURATION AUDIO USING GAUSSIAN MIXTURE MODEL AND RANDOM FOREST CLASSIFIER
    Kumar, Anurag
    Hegde, Rajesh M.
    Singh, Rita
    Raj, Bhiksha
    [J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [3] Detection of sEMG Muscle Activation Intervals Using Gaussian Mixture Model and Ant Colony Classifier
    Naseem, Amal
    Jabloun, Meryem
    Buttelli, Olivier
    Ravier, Phillippe
    [J]. 2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1713 - 1717
  • [4] A Gaussian mixture model classifier using supervised and unsupervised learning.
    Goodman, GL
    McMichael, DW
    [J]. ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 565 - 566
  • [5] Subspace Based Speech Enhancement Using Gaussian Mixture Model
    Kundu, Achintya
    Chatterjee, Saikat
    Sreenivas, T. V.
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 395 - 398
  • [6] DETECTION OF STOP LANDMARKS USING GAUSSIAN MIXTURE MODELING OF SPEECH SPECTRUM
    Jayan, A. R.
    Pandey, P. C.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4681 - 4684
  • [7] Detection of Speed Bumps Using Gaussian Mixture Model
    Srimongkon, Suchada
    Chiracharit, Werapon
    [J]. 2017 14TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2017, : 628 - 631
  • [8] HUMAN SKIN DETECTION USING GAUSSIAN MIXTURE MODEL
    Oancea, Romana
    Demeter, Stefan
    Kifor, Stefania
    [J]. 15TH INTERNATIONAL CONFERENCE THE KNOWLEDGE-BASED ORGANIZATION: APPLIED TECHNICAL SCIENCES AND ADVANCED MILITARY TECHNOLOGIES, CONFERENCE PROCEEDINGS 6, 2009, 6 : 113 - 118
  • [9] An Efficient Gaussian Mixture Model Classifier for Outdoor Surveillance Using Seismic Signals
    Aruchamy, Srinivasan
    Chakraborty, Anisom
    Das, Manisha
    Vadali, Siva Ram Krishna
    Ray, Ranjit
    Nandy, Sambhunath
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [10] A Multi-class Object Classifier Using Boosted Gaussian Mixture Model
    Lee, Wono
    Lee, Minho
    [J]. NEURAL INFORMATION PROCESSING: THEORY AND ALGORITHMS, PT I, 2010, 6443 : 430 - 437