HIGH-DIMENSIONAL LINEAR REPRESENTATIONS FOR ROBUST SPEECH RECOGNITION

被引:0
|
作者
Ager, Matthew [1 ]
Cvetkovic, Zoran [2 ]
Sollich, Peter [1 ]
机构
[1] Kings Coll London, Dept Math, London WC2R 2LS, England
[2] Kings Coll London, Dept Elect Engn, London WC2R 2LS, England
关键词
acoustic waveforms; phoneme; classification; robust; speech recognition; PHONETIC CLASSIFICATION; CONFUSIONS; MACHINES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Phoneme classification is investigated in linear feature domains with the aim of improving the robustness to additive noise. Linear feature domains allow for exact noise adaptation and so should result in more accurate classification than representations involving nonlinear processing and dimensionality reduction. We develop a generative framework for phoneme classification using linear features. We first show results for a representation consisting of concatenated frames from the centre of the phoneme, each containing f frames. As no single f is optimal for all phonemes, we further average over models with a range of values of f. Next we improve results by including information from the entire phoneme. In the presence of additive noise, classification in this framework performs better than an analogous PLP classifier, adapted to noise using cepstral mean and variance normalisation, below 18dB SNR.
引用
收藏
页码:75 / 79
页数:5
相关论文
共 50 条
  • [21] Robust PCA for high-dimensional data
    Hubert, M
    Rousseeuw, PJ
    Verboven, S
    [J]. DEVELOPMENTS IN ROBUST STATISTICS, 2003, : 169 - 179
  • [22] On robust regression with high-dimensional predictors
    El Karoui, Noureddine
    Bean, Derek
    Bickel, Peter J.
    Lim, Chinghway
    Yu, Bin
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (36) : 14557 - 14562
  • [23] Robust transfer learning for high-dimensional quantile regression model with linear constraints
    Cao, Longjie
    Song, Yunquan
    [J]. APPLIED INTELLIGENCE, 2024, 54 (02) : 1263 - 1274
  • [24] General foundations of high-dimensional model representations
    Rabitz, H
    Alis, ÖF
    [J]. JOURNAL OF MATHEMATICAL CHEMISTRY, 1999, 25 (2-3) : 197 - 233
  • [25] Two representations of a high-dimensional perceptual space
    Victor, Jonathan D.
    Rizvi, Syed M.
    Conte, Mary M.
    [J]. VISION RESEARCH, 2017, 137 : 1 - 23
  • [26] Element similarity in high-dimensional materials representations
    Onwuli, Anthony
    Hegde, Ashish V.
    Nguyen, Kevin V. T.
    Butler, Keith T.
    Walsh, Aron
    [J]. DIGITAL DISCOVERY, 2023, 2 (05): : 1558 - 1564
  • [27] LEARNING NOISE-INVARIANT REPRESENTATIONS FOR ROBUST SPEECH RECOGNITION
    Liang, Davis
    Huang, Zhiheng
    Lipton, Zachary C.
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 56 - 63
  • [28] Feature extraction based on auditory representations for robust speech recognition
    Kim, DS
    Lee, SY
    Kil, RM
    Zhu, XL
    [J]. ELECTRONICS LETTERS, 1997, 33 (01) : 15 - 16
  • [29] Boosting for high-dimensional linear models
    Buhlmann, Peter
    [J]. ANNALS OF STATISTICS, 2006, 34 (02): : 559 - 583
  • [30] High-Dimensional Sparse Linear Bandits
    Hao, Botao
    Lattimore, Tor
    Wang, Mengdi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33