Fertility channel model for post-correction of continuous speech recognition

被引:0
|
作者
Ringger, EK
Allen, JF
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We have implemented a post-processor called SPEECHPP to comet word-level errors committed by an arbitrary speech recognizer. Ap plying a noisy-channelmodel, SPEECHPP uses a Viterbi beam-search that employs language and channel models. Previous work demonstrated that a simple word-for-word channel model was sufficient to yield substantial increases in word accuracy. This paper demonstrates that some improvements in word accuracy result from augmenting the channel model with an account of word fertility in the channel. This work further demonstrates that a modem continuous speech recognizer can be used in ''black-box'' fashion for robustly recognizing speech for which the recognizer was not originally trained. This work also demonstrates that in the case where the recognizer can be tuned to the new task, environment or speaker, the post-processor can also contribute to performance improvements.
引用
收藏
页码:897 / 900
页数:4
相关论文
共 50 条
  • [1] Tensor Completion Post-Correction
    Fanaee-T, Hadi
    ADVANCES IN INTELLIGENT DATA ANALYSIS XX, IDA 2022, 2022, 13205 : 89 - 101
  • [2] Error correction via a post-processor for continuous speech recognition
    Ringger, EK
    Allen, JF
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 427 - 430
  • [3] An improved noisy channel model for speech recognition error correction
    Li, Baoxiang
    Liu, Gang
    Guo, Jun
    Lu, Yueming
    International Journal of Advancements in Computing Technology, 2012, 4 (12) : 110 - 118
  • [4] Optical character recognition with neural networks and post-correction with finite state methods
    Drobac, Senka
    Linden, Krister
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2020, 23 (04) : 279 - 295
  • [5] Optical character recognition with neural networks and post-correction with finite state methods
    Senka Drobac
    Krister Lindén
    International Journal on Document Analysis and Recognition (IJDAR), 2020, 23 : 279 - 295
  • [6] SIMPLE POST-CORRECTION FOR NONRESTORING DIVISION
    BASU, D
    JAYASHREE, T
    IEEE TRANSACTIONS ON COMPUTERS, 1975, 24 (10) : 1019 - 1020
  • [7] Optimizing dynamic post-correction of AD converters
    Lundin, H
    Skoglund, M
    Händel, P
    IMTC/O3: PROCEEDINGS OF THE 20TH IEEE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1 AND 2, 2003, : 1206 - 1211
  • [8] A time continuous model for speech recognition
    Euler, S
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 889 - 892
  • [9] Crossmodal Error Correction of Continuous Handwriting Recognition by Speech
    Ao, Xiang
    Wang, Xugang
    Tian, Feng
    Dai, Guozhong
    Wang, Hongan
    2007 INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2007, : 243 - 250
  • [10] Toward the optimized crowdsourcing strategy for OCR post-correction
    Suissa, Omri
    Elmalech, Avshalom
    Zhitomirsky-Geffet, Maayan
    ASLIB JOURNAL OF INFORMATION MANAGEMENT, 2019, 72 (02) : 179 - 197