Non-linear techniques for robust speech recognition

被引:0
|
作者
Ge, Yubo [1 ]
Niu, Jing [1 ]
Ge, Lingnan [2 ]
Shirai, Katsuhiko [2 ]
机构
[1] Tsinghua Univ, Dept Math Sci, Beijing 100084, Peoples R China
[2] Waseda Univ, Sch Sci & Engn, Tokyo 1698555, Japan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An acoustic speech digital signal can be regarded as a random process repeatedly alternating stationary segments with non-stationary ones. However, the current features used in the mostly recognition system are drawn based linear model theory and are hardly to describe non-stationary character. Consequently, some syllables can not be distinguished in speech parameter space with dimensions as high as 50. This paper tries to develop several features to describe non-stationary measure, a trend degree and the character of continuously exchange between stationary pieces and non-stationary pieces with the help of statistical theory and non-linear random model, a type of doubly random time series model. Our experiments have shown the proposed feature to increase the recognition accurate and the ability of adaptation and self-organisation of the system.
引用
收藏
页码:134 / +
页数:3
相关论文
共 50 条
  • [1] Non-linear transformations of the feature space for robust speech recognition
    de la Torre, A
    Segura, JC
    Benítez, C
    Peinado, AM
    Rubio, AJ
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 401 - 404
  • [2] NON-LINEAR MAPPING FOR MUTLI-CHANNEL SPEECH SEPARATION AND ROBUST OVERLAPPING SPEECH RECOGNITION
    Li, Weifeng
    Dines, John
    Magimai-Doss, Mathew
    Bourlard, Herve
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3921 - 3924
  • [3] Beyond Linear Transforms: Efficient Non-linear Dynamic Adaptation for Noise Robust Speech Recognition
    Rennie, Steven J.
    Dognin, Pierre L.
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1305 - 1308
  • [4] Non-linear feature extraction for robust speech recognition in stationary and non-stationary noise
    Zhu, QF
    Alwan, A
    [J]. COMPUTER SPEECH AND LANGUAGE, 2003, 17 (04): : 381 - 402
  • [5] Feature extraction using non-linear transformation for robust speech recognition on the AURORA database
    Sharma, S
    Ellis, D
    Kajarekar, S
    Jain, P
    Hermansky, H
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1117 - 1120
  • [6] Propagation of statistical information through non-linear feature extractions for robust speech recognition
    Astudillo, R. F.
    Kolossa, D.
    Orglmeister, R.
    [J]. BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2007, 954 : 245 - 252
  • [7] Non-linear independent component analysis for speech recognition
    Omar, MK
    Hasegawa-Johnson, M
    [J]. CCCT 2003, VOL6, PROCEEDINGS: COMPUTER, COMMUNICATION AND CONTROL TECHNOLOGIES: III, 2003, : 204 - 209
  • [8] Recognition of Emotion Using Non-Linear Dynamics of Speech
    Harimi, Ali
    Shalizadi, Ali
    Ahmadyfard, Alireza
    [J]. 2014 7th International Symposium on Telecommunications (IST), 2014, : 446 - 451
  • [9] Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition
    Surendran, AC
    Lee, CH
    Rahim, M
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1836 - 1839
  • [10] NON-LINEAR NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION USING GAUSS-NEWTON METHOD
    Zhao, Yong
    Juang, Biing-Hwang
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4796 - 4799