Speech Recognition with Phonological Features: Some issues to attend

被引:0
|
作者
Stouten, Frederik [1 ]
Martens, Jean-Pierre [1 ]
机构
[1] Univ Ghent, ELIS, B-9000 Ghent, Belgium
关键词
speech recognition; phonological features; decorrelation; relevancy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is often argued that acoustic-phonetic or articulatory features could be beneficial to automatic speech recognition because they provide a convenient interface between the acoustic and the linguistic level. Former research has shown that a combination of acoustic and articulatory information can lead to improved ASR. However there exists no purely articulatory driven ASR system that outperforms state-of-the-art systems driven by acoustic features. In this paper we propose a novel method for improving ASR on the basis of articulatory features. It is designed to take account of (1) the correlations between articulatory features and (2) the fact that not all articulatory features are relevant for the description of a certain phonetic unit. We also investigate to what extend an acoustic and an articulatory feature driven system make different errors.
引用
收藏
页码:357 / 360
页数:4
相关论文
共 50 条
  • [1] Significance of Phonological Features in Speech Emotion Recognition
    Wei Wang
    Paul A. Watters
    Xinyi Cao
    Lingjie Shen
    Bo Li
    International Journal of Speech Technology, 2020, 23 : 633 - 642
  • [2] Significance of Phonological Features in Speech Emotion Recognition
    Wang, Wei
    Watters, Paul A.
    Cao, Xinyi
    Shen, Lingjie
    Li, Bo
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 633 - 642
  • [3] Exploiting complementary aspects of phonological features in automatic speech recognition
    Momayyez, Parya
    Waterhouse, James
    Rose, Richard
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 47 - 52
  • [4] Maximum mutual information based acoustic-features representation of phonological features for speech recognition
    Omar, MK
    Hasegawa-Johnson, M
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 81 - 84
  • [5] Integrated-multilingual speech recognition using universal phonological features in a functional speech production model
    Deng, L
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1007 - 1010
  • [6] Phonological traces in early speech recognition
    Friedrich, Claudia
    Schild, Ulrike
    Roeder, Brigitte
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 672 - 672
  • [7] The impact of phonological rules on Arabic speech recognition
    Al-Anzi F.S.
    AbuZeina D.
    International Journal of Speech Technology, 2017, 20 (3) : 715 - 723
  • [8] PHONOLOGICAL FEATURES IN DISCRIMINATIVE CLASSIFICATION OF DYSARTHRIC SPEECH
    Rudzicz, Frank
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4605 - 4608
  • [9] Phonological features and phonotactic constraints in speech production
    Goldrick, M
    JOURNAL OF MEMORY AND LANGUAGE, 2004, 51 (04) : 586 - 603
  • [10] Online Automatic Speech Recognition With Listen, Attend and Spell Model
    Hsiao, Roger
    Can, Dogan
    Ng, Tim
    Travadi, Ruchir
    Ghoshal, Arnab
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1889 - 1893