A NEW NEURAL-NETWORK FOR ARTICULATORY SPEECH RECOGNITION AND ITS APPLICATION TO VOWEL IDENTIFICATION

被引:7
|
作者
ZACKS, J [1 ]
THOMAS, TR [1 ]
机构
[1] LOS ALAMOS NATL LAB,LOS ALAMOS,NM 87544
来源
COMPUTER SPEECH AND LANGUAGE | 1994年 / 8卷 / 03期
关键词
D O I
10.1006/csla.1994.1009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A system for automatic speech recognition (ASR) based on a new neural network design and a theory of articulatory phonology is presented. This system operates in two stages. In the first, speech acoustics are mapped by a neural network onto the movements of the tongue and lips that produced those acoustics (the neural networks are trained on X-ray microbeam recordings of actual articulatory movements); in the second stage, gestures are recovered from those movements. The neural network is built around a new objective function, Correlational + Scaling Error (COSE). When compared to a traditional neural network system, the COSE system trains faster, produces output which better represents the shape of the articulatory movements, and yields higher recognition rates for vowel gestures. After training on two speakers, recognition rates up to 96% for tokens from the training set and 87% for tokens spoken by a novel speaker were achieved.
引用
收藏
页码:189 / 209
页数:21
相关论文
共 50 条
  • [1] INTELLIGENT JUDGE NEURAL-NETWORK FOR SPEECH RECOGNITION
    KIM, DS
    LEE, SY
    [J]. NEURAL PROCESSING LETTERS, 1994, 1 (01) : 17 - 20
  • [2] A FUZZY NEURAL-NETWORK AND ITS APPLICATION TO PATTERN-RECOGNITION
    KWAN, HK
    CAI, YL
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1994, 2 (03) : 185 - 193
  • [3] Morphological normalization of vowel images for articulatory speech recognition
    Wei, Jianguo
    Zhang, Jingshu
    Ji, Yan
    Fang, Qiang
    Lu, Wenhuan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 41 : 352 - 360
  • [4] Fuzzy-rough neural network and its application to vowel recognition
    College of Electrical and Information Engineering, Hu'nan University, Changsha 410082, China
    不详
    [J]. Kongzhi yu Juece Control Decis, 2006, 2 (221-224):
  • [5] Dynamic adaptive fuzzy neural-network identification and its application
    Pei, Z
    Qin, K
    Xu, Y
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4974 - 4979
  • [6] A NOVEL FEATURE RECOGNITION NEURAL-NETWORK AND ITS APPLICATION TO CHARACTER-RECOGNITION
    HUSSAIN, B
    KABUKA, MR
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (01) : 98 - 106
  • [7] Articulatory feature extraction for speech recognition using neural network
    Huda, Mohammad Nurul
    Hasan, Mohammad Mahedi
    Hassan, Foyzul
    Kotwal, Mohammed Rokibul Alam
    Muhammad, Ghulam
    Rahman, Chowdhury Mofizur
    [J]. International Review on Computers and Software, 2011, 6 (01) : 25 - 31
  • [8] Effect of Articulatory Δ and ΔΔ Parameters on Multilayer Neural Network based Speech Recognition
    Banik, Manoj
    Kotwal, Mohammed Rokibul Alam
    Hassan, Foyzul
    Islam, Gazi Md. Moshfiqul
    Rahman, Sharif Mohammad Musfiqur
    Hasan, Mohammad Mahedi
    Muhammad, Ghulam
    Huda, Mohammad Nurul
    [J]. PROCEEDINGS OF THE 2010 IEEE ASIA PACIFIC CONFERENCE ON CIRCUIT AND SYSTEM (APCCAS), 2010, : 624 - 627
  • [9] NEURAL-NETWORK APPLICATION TO WELDING DEFECT IDENTIFICATION
    ONDA, H
    NISHINAGA, Y
    ONO, K
    [J]. FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 1993, 29 (03): : 271 - 277
  • [10] A new fuzzy clustering neural network and its application to speech signal system identification
    Liu, Yu-Hong
    Liu, Qiao
    Ren, Qiang
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2005, 18 (05): : 522 - 527