On the Combination of Auditory and Modulation Frequency Channels for ASR applications

被引:0
|
作者
Valente, Fabio [1 ]
Hermansky, Hynek [1 ]
机构
[1] IDIAP Res Inst, Martigny, Switzerland
关键词
Modulation spectrum; Neural Network; LVCSR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the combination of evidence coming from different frequency channels obtained filtering the speech signal at different auditory and modulation frequencies. In our previous work [1], we showed that combination of classifiers trained on different ranges of modulation frequencies is more effective if performed in sequential (hierarchical) fashion. In this work we verity that combination of classifiers trained on different ranges of auditory frequencies is more effective if performed in parallel fashion. Furthermore we propose an architecture based on neural networks for combining evidence coming from different auditory-modulation frequency sub-bands that takes advantages of previous findings. This reduces the final WER by 6.2% (from 45.8% to 39.6%) w.r.t the single classifier approach in a LVCSR task.
引用
收藏
页码:2242 / +
页数:2
相关论文
共 50 条
  • [41] BILATERAL COLLICULAR INTERACTION: MODULATION OF AUDITORY SIGNAL PROCESSING IN FREQUENCY DOMAIN
    Cheng, L.
    Mei, H. -X.
    Tang, J.
    Fu, Z-Y
    Jen, P. H. -S.
    Chen, Q. -C.
    NEUROSCIENCE, 2013, 235 : 27 - 39
  • [42] Encoding of frequency-modulation (FM) rates in human auditory cortex
    Okamoto, Hidehiko
    Kakigi, Ryusuke
    SCIENTIFIC REPORTS, 2015, 5
  • [43] Neural mechanism of corticofugal modulation of frequency processing in bat auditory system
    Kashimori, Yoshiki
    Hirooka, Seiichi
    Fujita, Kazuhisa
    NEUROSCIENCE RESEARCH, 2007, 58 : S99 - S99
  • [44] Encoding of frequency-modulation (FM) rates in human auditory cortex
    Hidehiko Okamoto
    Ryusuke Kakigi
    Scientific Reports, 5
  • [45] HUMAN AUDITORY ASSESSMENT OF MODULATION FREQUENCY OF AN AMPLITUDE-MODULATED TONE
    ISHCHENKO, SM
    SOVIET PHYSICS ACOUSTICS-USSR, 1977, 23 (01): : 35 - 38
  • [46] INDEPENDENT STEREOSCOPIC CHANNELS FOR THE SPATIAL-FREQUENCY OF DISPARITY MODULATION
    GANZ, L
    SCHUMER, RA
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1979, : 174 - 174
  • [47] Separable developmental trajectories for the abilities to detect auditory amplitude and frequency modulation
    Banai, Karen
    Sabin, Andrew T.
    Wright, Beverly A.
    HEARING RESEARCH, 2011, 280 (1-2) : 219 - 227
  • [48] Auditory processing of real and illusory changes in frequency modulation (FM) phase
    Carlyon, RP
    Micheyl, C
    Deeks, JM
    Moore, BCJ
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (06): : 3629 - 3639
  • [49] Frequency-specific modulation of population-level frequency tuning in human auditory cortex
    Hidehiko Okamoto
    Henning Stracke
    Pienie Zwitserlood
    Larry E Roberts
    Christo Pantev
    BMC Neuroscience, 10
  • [50] Frequency-specific modulation of population-level frequency tuning in human auditory cortex
    Okamoto, Hidehiko
    Stracke, Henning
    Zwitserlood, Pienie
    Roberts, Larry E.
    Pantev, Christo
    BMC NEUROSCIENCE, 2009, 10