Lattice-based lightly-supervised acoustic model training

被引:3
|
作者
Fainberg, Joachim [1 ]
Klejch, Ondrej [1 ]
Renals, Steve [1 ]
Bell, Peter [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland
来源
关键词
Automatic speech recognition; lightly supervised training; LF-MMI; broadcast media; TRANSCRIPTION; SELECTION;
D O I
10.21437/Interspeech.2019-2533
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In the broadcast domain there is an abundance of related text data and partial transcriptions, such as closed captions and subtitles. This text data can be used for lightly supervised training, in which text matching the audio is selected using an existing speech recognition model. Current approaches to light supervision typically filter the data based on matching error rates between the transcriptions and biased decoding hypotheses. In contrast, semi-supervised training does not require matching text data, instead generating a hypothesis using a background language model. State-of-the-art semi-supervised training uses lattice-based supervision with the lattice-free MMI (LF-MMI) objective function. We propose a technique to combine inaccurate transcriptions with the lattices generated for semisupervised training, thus preserving uncertainty in the lattice where appropriate. We demonstrate that this combined approach reduces the expected error rates over the lattices, and reduces the word error rate (WER) on a broadcast task.
引用
收藏
页码:1596 / 1600
页数:5
相关论文
共 50 条
  • [31] Comparison of Lattice-Free and Lattice-Based Sequence Discriminative Training Criteria for LVCSR
    Michel, Wilfried
    Schlueter, Ralf
    Ney, Hermann
    INTERSPEECH 2019, 2019, : 1601 - 1605
  • [32] Lattice-based Cryptography
    Mohsen, Ayman Wagih
    Bahaa-Eldin, Ayman M.
    Sobh, Mohamed Ali
    2017 12TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2017, : 462 - 467
  • [33] Lattice-based sums
    El-Zekey, Moataz
    Medina, Jesus
    Mesiar, Radko
    INFORMATION SCIENCES, 2013, 223 : 270 - 284
  • [34] A lattice-based model of the kinetics of twin boundary motion
    Abeyaratne, R
    Vedantam, S
    JOURNAL OF THE MECHANICS AND PHYSICS OF SOLIDS, 2003, 51 (09) : 1675 - 1700
  • [35] Lattice-Based IBE with Equality Test in Standard Model
    Dung Hoang Duong
    Le, Huy Quoc
    Roy, Partha Sarathi
    Susilo, Willy
    PROVABLE SECURITY, PROVSEC 2019, 2019, 11821 : 19 - 40
  • [36] Lattice-based signcryption with equality test in standard model
    Le, Huy Quoc
    Duong, Dung Hoang
    Roy, Partha Sarathi
    Susilo, Willy
    Fukushima, Kazuhide
    Kiyomoto, Shinsaku
    COMPUTER STANDARDS & INTERFACES, 2021, 76 (76)
  • [37] Lattice-based signcryption
    Li, Fagen
    Bin Muhaya, Fahad T.
    Khan, Muhammad Khurram
    Takagi, Tsuyoshi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (14): : 2112 - 2122
  • [38] Unsupervised Lattice-based Acoustic Model Adaptation for Speaker-Dependent Conversational Telephone Speech Transcription
    Thambiratnam, K.
    Seide, E.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1567 - 1570
  • [39] Lattice-based linearly homomorphic signatures in the standard model
    Chen, Wenbin
    Lei, Hao
    Qi, Ke
    THEORETICAL COMPUTER SCIENCE, 2016, 634 : 47 - 54
  • [40] A Lattice-based Access Control Model for Social Networks
    Zhang, Yingjun
    Chen, Kai
    Liu, Yuling
    Lian, Yifeng
    2016 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY PROCEEDINGS - CYBERC 2016, 2016, : 54 - 61