Improvements in recognition of conversational telephone speech

被引:6
|
作者
Peskin, B [1 ]
Newman, M [1 ]
McAllaster, D [1 ]
Nagesha, V [1 ]
Richards, H [1 ]
Wegmann, S [1 ]
Hunt, M [1 ]
Gillick, L [1 ]
机构
[1] Dragon Syst Inc, Newton, MA 02460 USA
关键词
D O I
10.1109/ICASSP.1999.758060
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes recent changes in Dragon's speech recognition system which have markedly improved performance on conversational telephone speech. Key changes include: the conversion to modified PLP-based cepstra from mel-cepstra; the replacement of our usual IMELDA transformation by a new transform using "semi-tied covariance"; a new multi-pass adaptation protocol; probabilities on alternate pronunciations in the lexicon; the addition of word-boundary tags in our acoustic models and the redistribution of model parameters to build fewer output distributions but with more mixture components per model.
引用
收藏
页码:53 / 56
页数:4
相关论文
共 50 条
  • [41] ACOUSTIC PROCESSOR IN A CONVERSATIONAL SPEECH RECOGNITION SYSTEM
    NAKATSU, R
    KOHDA, M
    [J]. REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1978, 26 (11-1): : 1486 - 1504
  • [42] Toward Human Parity in Conversational Speech Recognition
    Xiong, Wayne
    Droppo, Jasha
    Huang, Xuedong
    Seide, Frank
    Seltzer, Michael L.
    Stolcke, Andreas
    Yu, Dong
    Zweig, Geoffrey
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (12) : 2410 - 2423
  • [43] ROLE ANNOTATED SPEECH RECOGNITION FOR CONVERSATIONAL INTERACTIONS
    Flemotomos, Nikolaos
    Chen, Zhuohao
    Atkins, David C.
    Narayanan, Shrikanth
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1036 - 1043
  • [44] Attention Shift Decoding for Conversational Speech Recognition
    Kumaran, Raghunandan
    Bilmes, Jeff
    Kirchhoff, Katrin
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2908 - 2911
  • [45] Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system
    Gales, MJF
    Jia, B
    Liu, X
    Sim, KC
    Woodland, P
    Yu, K
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 841 - 844
  • [46] Robust speech detection method for telephone speech recognition system
    Kuroiwa, S
    Naito, M
    Yamamoto, S
    Higuchi, N
    [J]. SPEECH COMMUNICATION, 1999, 27 (02) : 135 - 148
  • [47] Pronunciation change in conversational speech and its implications for automatic speech recognition
    Saraçlar, M
    Khudanpur, S
    [J]. COMPUTER SPEECH AND LANGUAGE, 2004, 18 (04): : 375 - 395
  • [48] Estimation of channel bias for telephone speech recognition
    Chien, JT
    Wang, HC
    Lee, LM
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1840 - 1843
  • [49] Multilingual phone recognition of spontaneous telephone speech
    Corredor-Ardoy, C
    Lamel, L
    Adda-Decker, M
    Gauvain, JL
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 413 - 416
  • [50] Deconvolution of telephone line effects for speech recognition
    Mokbel, C
    Jouvet, D
    Monne, J
    [J]. SPEECH COMMUNICATION, 1996, 19 (03) : 185 - 196