On the Combination of Auditory and Modulation Frequency Channels for ASR applications

被引:0
|
作者
Valente, Fabio [1 ]
Hermansky, Hynek [1 ]
机构
[1] IDIAP Res Inst, Martigny, Switzerland
关键词
Modulation spectrum; Neural Network; LVCSR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the combination of evidence coming from different frequency channels obtained filtering the speech signal at different auditory and modulation frequencies. In our previous work [1], we showed that combination of classifiers trained on different ranges of modulation frequencies is more effective if performed in sequential (hierarchical) fashion. In this work we verity that combination of classifiers trained on different ranges of auditory frequencies is more effective if performed in parallel fashion. Furthermore we propose an architecture based on neural networks for combining evidence coming from different auditory-modulation frequency sub-bands that takes advantages of previous findings. This reduces the final WER by 6.2% (from 45.8% to 39.6%) w.r.t the single classifier approach in a LVCSR task.
引用
收藏
页码:2242 / +
页数:2
相关论文
共 50 条
  • [31] CONTEXTUAL MODULATION OF FREQUENCY TUNING OF NEURONS IN THE RAT AUDITORY CORTEX
    Peng, Y.
    Sun, X.
    Zhang, J.
    NEUROSCIENCE, 2010, 169 (03) : 1403 - 1413
  • [32] Corticofugal modulation of the midbrain frequency map in the bat auditory system
    Wei Yan
    Nobuo Suga
    Nature Neuroscience, 1998, 1 : 54 - 58
  • [33] Modulation-Frequency-Specific Adaptation in Awake Auditory Cortex
    Malone, Brian J.
    Beitel, Ralph E.
    Vollmer, Maike
    Heiser, Marc A.
    Schreiner, Christoph E.
    JOURNAL OF NEUROSCIENCE, 2015, 35 (15): : 5904 - 5916
  • [34] FUSION OF AUDITORY COMPONENTS - EFFECTS OF THE FREQUENCY OF AMPLITUDE-MODULATION
    BREGMAN, AS
    LEVITAN, R
    LIAO, C
    PERCEPTION & PSYCHOPHYSICS, 1990, 47 (01): : 68 - 73
  • [35] PS-ZCPA based feature extraction with auditory masking, modulation enhancement and noise reduction for robust ASR
    Ghulam, M
    Fukuda, T
    Katsurada, K
    Horikawa, J
    Nitta, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03) : 1015 - 1023
  • [36] AUDITORY CHANNELS
    MASSARO, DW
    IDSON, W
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1974, 4 (NA4) : 257 - 257
  • [37] Fepstrum: An improved modulation spectrum for ASR
    Tyagi, Vivek
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1177 - 1180
  • [38] Representation of frequency modulation in the primary auditory cortex of New World monkeys
    Atencio, C
    Strata, F
    Blake, D
    Bonham, B
    Godey, B
    Merzenich, M
    Schreiner, C
    Cheung, S
    AUDITORY SIGNAL PROCESSINGP: PHYSIOLOGY, PSYCHOACOUSTICS, AND MODELS, 2005, : 169 - 175
  • [39] Thalamic modulation of high-frequency oscillating potentials in auditory cortex
    Barth, DS
    MacDonald, KD
    NATURE, 1996, 383 (6595) : 78 - 81
  • [40] MODULATION DETECTION INTERFERENCE - ACROSS-FREQUENCY PROCESSING AND AUDITORY GROUPING
    YOST, WA
    SHEFT, S
    HEARING RESEARCH, 1994, 79 (1-2) : 48 - 58