Log-Sepctral Linear Regression Based on Voicing Cut-Off Frequency for Robust Speech Recognition

被引:0
|
作者
Lu, Yong [1 ]
Zhou, Lin [2 ]
机构
[1] Hohai Univ, Coll Comp & Informat Engn, Nanjing, Jiangsu, Peoples R China
[2] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Jiangsu, Peoples R China
来源
2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1 | 2015年
基金
中国国家自然科学基金;
关键词
voicing cut-off frequency; log-spectral linear regression; robust speech recognition; model adaptation; ADAPTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a maximum likelihood log-spectral linear regression algorithm based on voicing cut-off frequency for robust speech recognition, which converts the pre-trained acoustic model to the log-spectral domain by the inverse discrete cosine transform and ignores the high-frequency part of the training mean and variance. Then the testing mean and variance are obtained by the log-spectral linear regression and the linear regression parameters are estimated from small amounts of adaptive data using the expectation-maximization algorithm under the maximum likelihood criterion. The experimental results show that the proposed algorithm can obtain more accurate testing acoustic models and outperforms the traditional linear regression method.
引用
收藏
页码:542 / 545
页数:4
相关论文
共 50 条
  • [21] Robust speech/non-speech detection based on LDA-derived parameter and voicing parameter for speech recognition in noisy environments
    Martin, A
    Mauuary, L
    SPEECH COMMUNICATION, 2006, 48 (02) : 191 - 206
  • [22] Cepstral Distance and Log Energy Based Silence Feature Normalization for Robust Speech Recognition
    Shen, Guanghu
    Chung, Hyun-Yeol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2010, 29 (04): : 278 - 285
  • [23] Model Adaptation Algorithm Based on Central Subband Regression for Robust Speech Recognition
    Lu, Yong
    Zhou, Lin
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
  • [24] Noisy Constrained Maximum-Likelihood Linear Regression for Noise-Robust Speech Recognition
    Kim, D. K.
    Gales, M. J. F.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 315 - 325
  • [25] MHz cut-off frequency and permeability mechanism of iron-based soft magnetic composites
    Jin, Xiao-Wei
    Li, Tong
    Shi, Hui-Gang
    Xue, De-Sheng
    CHINESE PHYSICS B, 2024, 33 (09)
  • [26] Adaptive Training with Noisy Constrained Maximum Likelihood Linear Regression for Noise Robust Speech Recognition
    Kim, D. K.
    Gales, M. J. F.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2367 - 2370
  • [27] MHz cut-off frequency and permeability mechanism of iron-based soft magnetic composites
    金校伟
    李通
    史慧刚
    薛德胜
    Chinese Physics B, 2024, 33 (09) : 564 - 568
  • [28] The cut-off frequency - a key concept in the heat flow measurements based on the thermoelastic photoacoustic response
    Markushev, D. K.
    Brankovic, N. Lj
    Galovic, S. P.
    Djordjevic, K. Lj
    Aleksic, S. M.
    Pantic, D. S.
    Markushev, D. D.
    MEASUREMENT, 2025, 248
  • [29] Regularized Feature-Based Maximum Likelihood Linear Regression for Speech Recognition
    Omar, Mohamed Kamal
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2456 - 2459
  • [30] A log-based non-convex relaxation regularized regression for robust face recognition
    Liu, Ruonan
    Xu, Yitian
    INFORMATION SCIENCES, 2024, 667