Detection of spectral transition for speech perception based on time-frequency analysis

被引:0
|
作者
Zhao, Q [1 ]
Gao, QL [1 ]
Chi, HS [1 ]
机构
[1] Peking Univ, Nat Lab Machine Percept, Ctr Informat Sci, Beijing 100871, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current speech or speaker recognition system rely largely on voiced parts of utterance, though a great amount of information far speech perception is contained in the nonstationary consonants and transition. How to model and characterize the dynamic spectral features describing the transition still remains a question. This paper investigates the modeling and detection of the spectral transition based on time-frequency analysis. Linear acid nonlinear modeling of the transitions ate proposed using linear and quadratic frequency modulation signals. Then two strategies of detection of the spectral transition are presented, i.e., the Radon-Wigner transform (RWT) and Radon-Ambiguity transform (RAT). Both simulated and real speech data from TIMIT database are used to test the detection procedure.
引用
收藏
页码:522 / 526
页数:3
相关论文
共 50 条
  • [41] Crackles detection method based on time-frequency features analysis and SVM
    Li, Jiarui
    Hong, Ying
    [J]. PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1412 - 1416
  • [42] Time-Frequency Approach in Continuous Speech for Detection of Parkinson's Disease
    Villa-Canas, T.
    Arias-Londono, J. D.
    Vargas-Bonilla, J. F.
    Orozco-Arroyave, J. R.
    [J]. 2015 20TH SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND COMPUTER VISION (STSIVA), 2015,
  • [43] Time-Frequency Domain Impulsive Noise Detection System in Speech Signal
    Choi, Min-Seok
    Shin, Ho Seon
    Hwang, Young-Soo
    Kang, Hong-Goo
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (02): : 73 - 79
  • [44] Speech presence detection in the time-frequency domain using minimum statistics
    Sorensen, KV
    Andersen, SV
    [J]. NORSIG 2004: PROCEEDINGS OF THE 6TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2004, 46 : 340 - 343
  • [45] On time-frequency masking in voiced speech
    Skoglund, J
    Kleijn, WB
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 361 - 369
  • [46] Time-frequency methods for enhancing speech
    Kenny, OP
    Nelson, DJ
    [J]. ADVANCED SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VII, 1997, 3162 : 48 - 57
  • [47] Time-frequency analysis and auditory modeling for automatic recognition of speech
    Pitton, JW
    Wang, KS
    Juang, BH
    [J]. PROCEEDINGS OF THE IEEE, 1996, 84 (09) : 1199 - 1215
  • [48] Noise estimation based on time-frequency correlation for speech enhancement
    Yuan, Wenhao
    Lin, Jiajun
    An, Wei
    Wang, Yu
    Chen, Ning
    [J]. APPLIED ACOUSTICS, 2013, 74 (05) : 770 - 781
  • [49] Binaural Speech Separation Based on the Time-Frequency Binary Mask
    Mahmoodzadeh, A.
    Abutalebi, H. R.
    Soltanian-Zadeh, H.
    Sheikhzadeh, H.
    [J]. 2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 848 - 853
  • [50] Time-frequency representation based cepstral processing for speech recognition
    Fineberg, AB
    Yu, KC
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 25 - 28