Speech Recognition with Phonological Features: Some issues to attend

被引:0
|
作者
Stouten, Frederik [1 ]
Martens, Jean-Pierre [1 ]
机构
[1] Univ Ghent, ELIS, B-9000 Ghent, Belgium
关键词
speech recognition; phonological features; decorrelation; relevancy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is often argued that acoustic-phonetic or articulatory features could be beneficial to automatic speech recognition because they provide a convenient interface between the acoustic and the linguistic level. Former research has shown that a combination of acoustic and articulatory information can lead to improved ASR. However there exists no purely articulatory driven ASR system that outperforms state-of-the-art systems driven by acoustic features. In this paper we propose a novel method for improving ASR on the basis of articulatory features. It is designed to take account of (1) the correlations between articulatory features and (2) the fact that not all articulatory features are relevant for the description of a certain phonetic unit. We also investigate to what extend an acoustic and an articulatory feature driven system make different errors.
引用
收藏
页码:357 / 360
页数:4
相关论文
共 50 条
  • [41] An evaluation of visual speech features for the tasks of speech and speaker recognition
    Lucey, S
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 260 - 267
  • [42] Enhancing the magnitude spectrum of speech features for robust speech recognition
    Jeih-weih Hung
    Hao-teng Fan
    Wen-hsiang Tu
    EURASIP Journal on Advances in Signal Processing, 2012
  • [43] Enhancing the magnitude spectrum of speech features for robust speech recognition
    Hung, Jeih-weih
    Fan, Hao-teng
    Tu, Wen-hsiang
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [44] SPEECH ENHANCEMENT AND FEATURES COMPENSATION ALGORITHMS FOR CONTINUOUS SPEECH RECOGNITION
    Arcos, Christian
    Grivet, Marco
    Alcaim, Abraham
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 27 - 31
  • [45] Automated Screening of Speech Development Issues in Children By Identifying Phonological Error Patterns
    Ward, Lauren
    Stefani, Alessandro
    Smith, Daniel
    Duenser, Andreas
    Freyne, Jill
    Dodd, Barbara
    Morgan, Angela
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2661 - 2665
  • [46] SOME PHONOLOGICAL AND MORPHONOLOGICAL FEATURES OF THE LANGUAGE OF PETAR II PETROVIC NJEGOS COMPARED TO OTHER MONTENEGRIN SPEECH PATTERNS (WITH AN EMPHASIS ON NJEGUSI'S SPEECH PATTERN)
    Cirgic, Adnan
    LINGUA MONTENEGRINA, 2013, 12 : 7 - 13
  • [47] The Performance Evaluation of Continuous Speech Recognition Based on Korean Phonological Rules of Cloud-Based Speech Recognition Open API
    Yoo, Hyun Jae
    Seo, Sungwoong
    Im, Sun Woo
    Gim, Gwang Yong
    INTERNATIONAL JOURNAL OF NETWORKED AND DISTRIBUTED COMPUTING, 2021, 9 (01) : 10 - 18
  • [48] The Performance Evaluation of Continuous Speech Recognition Based on Korean Phonological Rules of Cloud-Based Speech Recognition Open API
    Hyun Jae Yoo
    Sungwoong Seo
    Sun Woo Im
    Gwang Yong Gim
    International Journal of Networked and Distributed Computing, 2021, 9 : 10 - 18
  • [49] Prominence features: Effective emotional features for speech emotion recognition
    Jing, Shaoling
    Mao, Xia
    Chen, Lijiang
    DIGITAL SIGNAL PROCESSING, 2018, 72 : 216 - 231
  • [50] Robust speech recognition using harmonic features
    Goh, Yeh Huann
    Raveendran, Paramesran
    Jamuar, Sudhanshu Shekhar
    IET SIGNAL PROCESSING, 2014, 8 (02) : 167 - 175