The potential role of speech production models in automatic speech recognition

被引:27
|
作者
Rose, RC
Schroeter, J
Sondhi, MM
机构
[1] AT and T Bell Laboratories, Murray Hill
来源
关键词
D O I
10.1121/1.414679
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the issues that are associated with applying speech production models to automatic speech recognition (ASR). Here the applicability of articulatory representations to ASR is considered independently of the role of articulatory representations in speech perception. While the question of whether it is necessary or even possible for human listeners to recover the state of the articulators during the process of perceiving speech is an important one, it is not considered here. Hence, the authors refrain from posing completely new paradigms for ASR which more closely parallel the relationship between speech production and human speech understanding. Instead, work aimed at integrating speech production models into existing ASR formalisms is described. (C) 1996 Acoustical Society of America.
引用
收藏
页码:1699 / 1709
页数:11
相关论文
共 50 条
  • [1] Critique: The potential role of speech production models in automatic speech recognition
    Moore, RK
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (03): : 1710 - 1713
  • [2] Speech production parameters for automatic speech recognition
    McGowan, RS
    Faber, A
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (01): : 28 - 28
  • [3] Speech production knowledge in automatic speech recognition
    King, Simon
    Frankel, Joe
    Livescu, Karen
    McDermott, Erik
    Richmond, Korin
    Wester, Mirjam
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (02): : 723 - 742
  • [4] Production models as a structural basis for automatic speech recognition
    Deng, L
    Ramsay, G
    Sun, D
    [J]. SPEECH COMMUNICATION, 1997, 22 (2-3) : 93 - 111
  • [5] Graphical models and automatic speech recognition
    Bilmes, JA
    [J]. MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 191 - 245
  • [6] On the predictive connectionist models for automatic speech recognition
    Petek, B
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3442 - 3445
  • [7] A Survey of Multilingual Models for Automatic Speech Recognition
    Yadav, Hemant
    Sitaram, Sunayana
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5071 - 5079
  • [8] AUTOMATIC SPEECH RECOGNITION USING PSYCHOACOUSTIC MODELS
    ZWICKER, E
    TERHARDT, E
    PAULUS, E
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (02): : 487 - 498
  • [9] Canonical State Models for Automatic Speech Recognition
    Gales, M. J. F.
    Yu, K.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 58 - 61
  • [10] GEOGRAPHIC LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Xiao, Xiaoqiang
    Chen, Hong
    Zylak, Mark
    Sosa, Daniela
    Desu, Suma
    Krishnamoorthy, Mahesh
    Liu, Daben
    Paulik, Matthias
    Zhang, Yuchen
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6124 - 6128