Speech Recognition with Phonological Features: Some issues to attend

被引:0
|
作者
Stouten, Frederik [1 ]
Martens, Jean-Pierre [1 ]
机构
[1] Univ Ghent, ELIS, B-9000 Ghent, Belgium
关键词
speech recognition; phonological features; decorrelation; relevancy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is often argued that acoustic-phonetic or articulatory features could be beneficial to automatic speech recognition because they provide a convenient interface between the acoustic and the linguistic level. Former research has shown that a combination of acoustic and articulatory information can lead to improved ASR. However there exists no purely articulatory driven ASR system that outperforms state-of-the-art systems driven by acoustic features. In this paper we propose a novel method for improving ASR on the basis of articulatory features. It is designed to take account of (1) the correlations between articulatory features and (2) the fact that not all articulatory features are relevant for the description of a certain phonetic unit. We also investigate to what extend an acoustic and an articulatory feature driven system make different errors.
引用
收藏
页码:357 / 360
页数:4
相关论文
共 50 条
  • [21] Distributed Neural Representations of Phonological Features during Speech Perception
    Arsenault, Jessica S.
    Buchsbaum, Bradley R.
    JOURNAL OF NEUROSCIENCE, 2015, 35 (02): : 634 - 642
  • [22] Automated Intelligibility Assessment of Pathological Speech Using Phonological Features
    Catherine Middag
    Jean-Pierre Martens
    Gwen Van Nuffelen
    Marc De Bodt
    EURASIP Journal on Advances in Signal Processing, 2009
  • [23] Combined spectral and speech features for pig speech recognition
    Wu, Xuan
    Zhou, Silong
    Chen, Mingwei
    Zhao, Yihang
    Wang, Yifei
    Zhao, Xianmeng
    Li, Danyang
    Pu, Haibo
    PLOS ONE, 2022, 17 (12):
  • [24] Topological invariants as speech features for automatic speech recognition
    Kacur, Juraj
    Chudy, Vladimir
    INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2014, 7 (04) : 235 - 244
  • [25] LISTEN, ATTEND AND SPELL: A NEURAL NETWORK FOR LARGE VOCABULARY CONVERSATIONAL SPEECH RECOGNITION
    Chan, William
    Jaitly, Navdeep
    Quoc Le
    Vinyals, Oriol
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4960 - 4964
  • [26] On the Correlation and Transferability of Features between Automatic Speech Recognition and Speech Emotion Recognition
    Fayek, Haytham M.
    Lech, Margaret
    Cavedon, Lawrence
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3618 - 3622
  • [27] Articulatory Features for "Meeting" Speech Recognition
    Metze, Florian
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 581 - 584
  • [28] Selecting good speech features for recognition
    Lee, Y
    Hwang, KW
    ETRI JOURNAL, 1996, 18 (01) : 29 - 40
  • [29] New Features for Emotional Speech Recognition
    Palo, Hemanta Kumar
    Mohanty, Mihir Narayan
    Chandra, Mahesh
    2015 IEEE POWER, COMMUNICATION AND INFORMATION TECHNOLOGY CONFERENCE (PCITC-2015), 2015, : 424 - 429
  • [30] Sparse Representation Features for Speech Recognition
    Sainath, Tara N.
    Ramabhadran, Bhuvana
    Nahamoo, David
    Kanevsky, Dimitri
    Sethy, Abhinav
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2254 - 2257