Lattice-based lightly-supervised acoustic model training

被引:3
|
作者
Fainberg, Joachim [1 ]
Klejch, Ondrej [1 ]
Renals, Steve [1 ]
Bell, Peter [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland
来源
关键词
Automatic speech recognition; lightly supervised training; LF-MMI; broadcast media; TRANSCRIPTION; SELECTION;
D O I
10.21437/Interspeech.2019-2533
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In the broadcast domain there is an abundance of related text data and partial transcriptions, such as closed captions and subtitles. This text data can be used for lightly supervised training, in which text matching the audio is selected using an existing speech recognition model. Current approaches to light supervision typically filter the data based on matching error rates between the transcriptions and biased decoding hypotheses. In contrast, semi-supervised training does not require matching text data, instead generating a hypothesis using a background language model. State-of-the-art semi-supervised training uses lattice-based supervision with the lattice-free MMI (LF-MMI) objective function. We propose a technique to combine inaccurate transcriptions with the lattices generated for semisupervised training, thus preserving uncertainty in the lattice where appropriate. We demonstrate that this combined approach reduces the expected error rates over the lattices, and reduces the word error rate (WER) on a broadcast task.
引用
收藏
页码:1596 / 1600
页数:5
相关论文
共 50 条
  • [21] A lattice-based approach to model distraction osteogenesis
    Reina-Romo, E.
    Gomez-Benito, M. J.
    Dominguez, J.
    Garcia-Aznar, J. M.
    JOURNAL OF BIOMECHANICS, 2012, 45 (16) : 2736 - 2742
  • [22] Lattice-Based Refinement in Bounded Model Checking
    Even-Mendoza, Karine
    Asadi, Sepideh
    Hyvarinen, Antti E. J.
    Chockler, Hana
    Sharygina, Natasha
    VERIFIED SOFTWARE: THEORIES, TOOLS, AND EXPERIMENTS, (VSTTE 2018), 2018, 11294 : 50 - 68
  • [23] Lattice-Based Semantics for Combinatorial Model Evolution
    Tzoref-Brill, Rachel
    Maoz, Shahar
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, ATVA 2015, 2015, 9364 : 276 - 292
  • [24] Improved Lattice-Based Signcryption in the Standard Model
    Yang, Xiaopeng
    Cao, Hao
    Li, Weichun
    Xuan, Hejun
    IEEE ACCESS, 2019, 7 : 155552 - 155562
  • [25] Lattice-Based Turn Model for Adaptive Routing
    Fusella, Edoardo
    Cilardo, Alessandro
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (05) : 1117 - 1130
  • [26] Lightly Supervised Training for Risk-Based Discriminative Language Models
    Kobayashi, Akio
    Oku, Takahiro
    Fujita, Yuya
    Sato, Shoei
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1212 - 1216
  • [27] Improving Lightly Supervised Training for Broadcast Transcription
    Long, Y.
    Gales, M. J. F.
    Lanchantin, P.
    Liu, X.
    Seigel, M. S.
    Woodland, P. C.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2186 - 2190
  • [28] Lattice-based discriminative training for large vocabulary speech recognition
    Valtchev, V
    Odell, JJ
    Woodland, PC
    Young, SJ
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 605 - 608
  • [29] Lattice-Based Training of Bottleneck Feature Extraction Neural Networks
    Paulik, Matthias
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 89 - 93
  • [30] Lattice-based cryptography
    Regev, Oded
    ADVANCES IN CRYPTOLOGY - CRYPTO 2006, PROCEEDINGS, 2006, 4117 : 131 - 141