O-1: Self-training with Oracle and 1-best Hypothesis

被引:0
|
作者
Baskar, Murali Karthick [1 ]
Rosenberg, Andrew [1 ]
Ramabhadran, Bhuvana [1 ]
Audhkhasi, Kartik [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
来源
关键词
Self-training; EMBR; O-1; ASR; speech recognition; discriminative training;
D O I
10.21437/Interspeech.2023-2166
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We introduce O-1, a new self-training objective to reduce training bias and unify training and evaluation metrics for speech recognition. O-1 is a faster variant of Expected Minimum Bayes Risk (EMBR), that boosts the oracle hypothesis and can accommodate both supervised and unsupervised data. We demonstrate the effectiveness of our approach in terms of recognition on publicly available SpeechStew datasets and a large-scale, inhouse data set. On Speechstew, the O-1 objective closes the gap between the actual and oracle performance by 80% relative compared to EMBR which bridges the gap by 43% relative. O-1 achieves 13% to 25% relative improvement over EMBR on the various datasets that SpeechStew comprises of, and a 12% relative gap reduction with respect to the oracle WER over EMBR training on the in-house dataset. Overall, O-1 results in a 9% relative improvement in WER over EMBR, thereby speaking to the scalability of the proposed objective for large-scale datasets.
引用
收藏
页码:77 / 81
页数:5
相关论文
共 50 条
  • [21] Crystal structure of cyclo-bis(mu 4-2,2-diallylmalonato-kappa(6) O-1, O-3 : O-3 : O-1 ', O-3 ': O-1 ') tetrakis(triphenylphosphane-kappa P) tetrasilver(I)
    Frenzel, Peter
    Jakob, Alexander
    Schaarschmidt, Dieter
    Rueffer, Tobias
    Lang, Heinrich
    ACTA CRYSTALLOGRAPHICA SECTION E-CRYSTALLOGRAPHIC COMMUNICATIONS, 2014, 70 : 174 - +
  • [22] PHASE AND T-STRUCTURE OF EXCHANGE AMPLITUDES - APPLICATION TO O-1/2+-]O-1/2+, O-3/2+ REACTIONS
    SHTOKHAMER, R
    BERLAD, G
    EILAM, G
    NUCLEAR PHYSICS B, 1971, B 29 (01) : 1 - +
  • [23] THE EFFECTS OF O-1(2) AND PEROXIDES ON BACTERIAL-DNA
    HOMMA, S
    HORIUCHI, H
    WAKAYAMA, Y
    TAKAGI, M
    YANO, K
    JOURNAL OF RADIATION RESEARCH, 1984, 25 (01) : 62 - 62
  • [24] THE EFFECT OF O-1(2) ON ESCHERICHIA-COLI PLASMIDS
    TAKAGI, M
    HORIUCHI, H
    HOMMA, S
    YANO, K
    JOURNAL OF RADIATION RESEARCH, 1982, 23 (01) : 75 - 76
  • [25] LIFE SUPPORTING 1ST AID (LSFA) AND INFANT CPR (ICPR) SELF-TRAINING IN CHILDREN
    BIRCHER, N
    SAFAR, P
    CRITICAL CARE MEDICINE, 1983, 11 (03) : 251 - 251
  • [26] Conditional use of Word Lattices, Confusion Networks and 1-best string hypotheses in a Sequential Interpretation Strategy
    Minescu, Bogdan
    Damnati, Geraldine
    Bechet, Frederic
    De Mori, Renato
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1945 - +
  • [27] O-1(+)-]O+ ELECTRIC MONOPOLE TRANSITION IN CD-112
    GIANNATIEMPO, A
    LIBERATI, G
    SONA, P
    ZEITSCHRIFT FUR PHYSIK A-HADRONS AND NUCLEI, 1979, 290 (04): : 411 - 414
  • [28] O-1 QUINTET AND TRIPLET TERMS BELOW THE IONIZATION LIMIT
    ERIKSSON, KBS
    ISBERG, HBS
    ARKIV FOR FYSIK, 1963, 24 (06): : 549 - +
  • [29] (mu(2)-2-Methoxyethanol-kappa(3) O-1 : O-1,O-3)(2-methoxyethanol-kappa O-1) tris(mu(2)- 3,4,5,6-tetrafluoro-o-phenylene-kappa(2) C-1 :C-2) trimercury(II)
    Castaneda, Raul
    Draguta, Sergiu
    Yakovenko, Andrey
    Fonari, Marina
    Timofeeva, Tatiana
    ACTA CRYSTALLOGRAPHICA SECTION E-CRYSTALLOGRAPHIC COMMUNICATIONS, 2014, 70 : M164 - +
  • [30] Electronic substituent effects in quenching of O-1(2) by diaryl tellurides
    Serguievski, P
    Detty, MR
    ORGANOMETALLICS, 1997, 16 (20) : 4386 - 4391