Robust Voice Activity Detection Using Selectively Energy Features

被引:0
|
作者
Wakasugi, Junichiro [1 ]
Hayasaka, Noboru [2 ]
Iiguni, Youji [1 ]
机构
[1] Osaka Univ, Grad Sch Engn Sci, Osaka, Japan
[2] Osaka Electrocommun Univ, Dept Informat Engn, Osaka, Japan
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a robust voice activity detection algorithm that can switch the calculation method automatically depending on the noise in order to adapt various noise. We use entropy as an indicator for judging whether the noise is narrow-band or wide-band. Under narrow-band noise condition, spectral product is the suitable calculation method, on the other hand, under wide-band noise condition, using spectral summation is the suitable one. The proposed method decides the type of noise by entropy, then uses the suitable calculation method depending on the noise. We evaluated the proposed method compared with other conventional methods by ROC curves and the number of correct-segments. As the result of the experiments, the proposed method can detect the speech-segments more correctly than the other methods and shows the better performance in frame-level. The experimental result shows the proposed method can switch the calculation method appropriately depending on the noise.
引用
收藏
页码:359 / 362
页数:4
相关论文
共 50 条
  • [1] Robust Voice Activity Detection Using Feature Combination
    Haghani, Sahar Khaksar
    Ahadi, Seyed Mohammad
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [2] Robust voice activity detection using group delay functions
    Krishnan, Sree Hari P.
    Padmanabhan, R.
    Murthy, Heina A.
    2006 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-6, 2006, : 1704 - +
  • [3] Robust Voice Activity Detection Using Gammatone Filtering and Entropy
    Ong, W. Q.
    Tan, A. W. C.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND SCIENCES (ICORAS 2016), 2016,
  • [4] On Noise Robust Voice Activity Detection
    Dekens, Tomas
    Verhelst, Werner
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2660 - 2663
  • [5] IMPROVED VOICE ACTIVITY DETECTION USING STATIC HARMONIC FEATURES
    Fukuda, Takashi
    Ichikawa, Osamu
    Nishimura, Masafumi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4482 - 4485
  • [6] Noise Robust Voice Activity Detection Using Features Extracted From the Time-Domain Autocorrelation Function
    Ghaemmaghami, Houman
    Baker, Brendan
    Vogt, Robbie
    Sridharan, Sridha
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 3118 - 3121
  • [7] Robust voice activity detection using perceptual wavelet-packet transform and Teager energy operator
    Chen, Shi-Huang
    Wu, Hsin-Te
    Chang, Yukon
    Truong, T. K.
    PATTERN RECOGNITION LETTERS, 2007, 28 (11) : 1327 - 1332
  • [8] ROBUST VOICE ACTIVITY DETECTION BASED ON PITCH AND SUB-BAND ENERGY
    Zhang, Zhihao
    Lin, Jinlong
    SIGMAP 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2009, : 44 - 48
  • [9] Robust Voice Activity Detection Using the Spectral Peaks of Vowel Sounds
    Yoo, In-Chul
    Yook, Dongsuk
    ETRI JOURNAL, 2009, 31 (04) : 451 - 453
  • [10] A Fusion Model for Robust Voice Activity Detection
    Wang, Guan-Bo
    Zhang, Wei-Qiang
    2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,