Log-Sepctral Linear Regression Based on Voicing Cut-Off Frequency for Robust Speech Recognition

被引:0
|
作者
Lu, Yong [1 ]
Zhou, Lin [2 ]
机构
[1] Hohai Univ, Coll Comp & Informat Engn, Nanjing, Jiangsu, Peoples R China
[2] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
voicing cut-off frequency; log-spectral linear regression; robust speech recognition; model adaptation; ADAPTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a maximum likelihood log-spectral linear regression algorithm based on voicing cut-off frequency for robust speech recognition, which converts the pre-trained acoustic model to the log-spectral domain by the inverse discrete cosine transform and ignores the high-frequency part of the training mean and variance. Then the testing mean and variance are obtained by the log-spectral linear regression and the linear regression parameters are estimated from small amounts of adaptive data using the expectation-maximization algorithm under the maximum likelihood criterion. The experimental results show that the proposed algorithm can obtain more accurate testing acoustic models and outperforms the traditional linear regression method.
引用
收藏
页码:542 / 545
页数:4
相关论文
共 50 条
  • [1] Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies
    Hermus, Kris
    Girin, Laurent
    Van Hamme, Hugo
    Irhimeh, Sufian
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4473 - +
  • [2] Estimation of the voicing cut-off frequency contour based on a cumulative harmonicity score
    Hermus, Kris
    Van Hamme, Hugo
    Irhimeh, Sufian
    IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (11) : 820 - 823
  • [3] A straightforward method for calculating the voicing cut-off frequency for streaming HNM TTS
    Louw, J. A.
    Proceedings of the 2015 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), 2015, : 252 - 257
  • [4] Segmented Regression Based on Cut-off Polynomials
    Kanka, Milos
    STATISTIKA-STATISTICS AND ECONOMY JOURNAL, 2016, 96 (02) : 60 - 72
  • [5] Structured Log Linear Models for Noise Robust Speech Recognition
    Zhang, Shi-Xiong
    Ragni, Anton
    Gales, Mark John Francis
    IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (11) : 945 - 948
  • [6] Variable Selection Linear Regression for Robust Speech Recognition
    Tsao, Yu
    Hu, Ting-Yao
    Sakti, Sakriani
    Nakamura, Satoshi
    Lee, Lin-shan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (06) : 1477 - 1487
  • [7] An improved EEMD method based on cut-off frequency
    Huang, Jie
    Zhang, Mei-Jun
    Chai, Kai
    Chen, Hao
    Zhendong yu Chongji/Journal of Vibration and Shock, 2015, 34 (08): : 101 - 105
  • [8] Robust speech emotion recognition using log frequency power ratio
    Hyun, Kyung-Hak
    Kim, Eun-Ho
    Kwak, Yoon-Keun
    2006 SICE-ICASE INTERNATIONAL JOINT CONFERENCE, VOLS 1-13, 2006, : 229 - +
  • [9] Cut-off frequency of magnetostrictive materials based on permeability spectra
    Meng, Hao
    Zhang, Tianli
    Jiang, Chengbao
    JOURNAL OF MAGNETISM AND MAGNETIC MATERIALS, 2012, 324 (12) : 1933 - 1937
  • [10] NEW ROBUST CUT-OFF VALUES IN DETERMINING BAD LEVERAGE POINTS IN THE LOGISTIC REGRESSION MODEL
    Gundogan Asik, Ebru
    Altin Yavuz, Arzu
    Kucuk, Zafer
    JOURNAL OF MEHMET AKIF ERSOY UNIVERSITY ECONOMICS AND ADMINISTRATIVE SCIENCES FACULTY, 2021, 8 (02): : 630 - 650