O-1: Self-training with Oracle and 1-best Hypothesis

被引:0
|
作者
Baskar, Murali Karthick [1 ]
Rosenberg, Andrew [1 ]
Ramabhadran, Bhuvana [1 ]
Audhkhasi, Kartik [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
来源
关键词
Self-training; EMBR; O-1; ASR; speech recognition; discriminative training;
D O I
10.21437/Interspeech.2023-2166
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We introduce O-1, a new self-training objective to reduce training bias and unify training and evaluation metrics for speech recognition. O-1 is a faster variant of Expected Minimum Bayes Risk (EMBR), that boosts the oracle hypothesis and can accommodate both supervised and unsupervised data. We demonstrate the effectiveness of our approach in terms of recognition on publicly available SpeechStew datasets and a large-scale, inhouse data set. On Speechstew, the O-1 objective closes the gap between the actual and oracle performance by 80% relative compared to EMBR which bridges the gap by 43% relative. O-1 achieves 13% to 25% relative improvement over EMBR on the various datasets that SpeechStew comprises of, and a 12% relative gap reduction with respect to the oracle WER over EMBR training on the in-house dataset. Overall, O-1 results in a 9% relative improvement in WER over EMBR, thereby speaking to the scalability of the proposed objective for large-scale datasets.
引用
收藏
页码:77 / 81
页数:5
相关论文
共 50 条
  • [1] REGGE QUARK MODEL FOR O-1/2+-]O-1/2+ SCATTERING
    MOORE, RW
    MORIARTY, KJ
    MIGNERON, JH
    JOURNAL OF PHYSICS PART A GENERAL, 1971, 4 (02): : 244 - &
  • [2] Poly[di-mu-aqua-diaquabis(mu(7)-oxalato-kappa O-9:O-1:O-1,O-2:O-2:O-2 ',:O-2 ',O-1 ',:O-1 ')calciumdicaesium]
    Kherfi, Hamza
    Hamadene, Malika
    Guehria-Laidoudi, Achoura
    Dahaoui, Slimane
    Lecomte, Claude
    ACTA CRYSTALLOGRAPHICA SECTION E-CRYSTALLOGRAPHIC COMMUNICATIONS, 2013, 69 : M493 - +
  • [3] LIFE-SUPPORTING 1ST AID SELF-TRAINING
    BREIVIK, H
    ULVIK, NM
    BLIKRA, G
    LIND, B
    CRITICAL CARE MEDICINE, 1980, 8 (11) : 654 - 658
  • [4] Bis(mu-3-nitrobenzene-1,2-dicarboxylato)-k(4)O(1),O-2:O-1,O-1';k(4)O(1),O-1':O-1,O-2-bis[triaqua(6-carboxy-2-nitrobenzoato-k(2)O(1),O-6)neodymium(III)] dihydrate
    Chang, Yin-cheng
    Pei, Zhi-chao
    Shuai, Qi
    ACTA CRYSTALLOGRAPHICA SECTION E-CRYSTALLOGRAPHIC COMMUNICATIONS, 2012, 68 : M1379 - +
  • [5] IDENTIFICATION OF SALMONELLA WITH O-1 BACTERIOPHAGE
    WELKOS, S
    SCHREIBER, M
    BAER, H
    APPLIED MICROBIOLOGY, 1974, 28 (04) : 618 - 622
  • [6] DETERMINATION OF RATE CONSTANTS OF O-1(2) CONSUMPTION BY O-1(2) ACCEPTORS IN WEAKLY DEACTIVATING SOLVENTS
    OPRIEL, U
    SEIKEL, K
    SCHMIDT, R
    BRAUER, HD
    JOURNAL OF PHOTOCHEMISTRY AND PHOTOBIOLOGY A-CHEMISTRY, 1989, 49 (03) : 299 - 309
  • [7] A standardized technic for Safranin O-1
    Sawyer, CH
    STAIN TECHNOLOGY, 1940, 15 (01): : 3 - 7
  • [8] Automatic Speech Recognition of Code Switching Speech using 1-Best Rescoring
    Ahmed, Basem H. A.
    Tan, Tien-Ping
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 137 - 140
  • [9] Poly[tetraaquabis(mu(3)-oxalato-kappa O-5,O-2:O-1':O-1',O-2'('))(mu(2)-oxalato-kappa(4) O-1,O-2:O-1',O-2') dipraseodymium(III)]
    Hao, Cheng-Jun
    Xie, Hui
    ACTA CRYSTALLOGRAPHICA SECTION E-CRYSTALLOGRAPHIC COMMUNICATIONS, 2012, 68 : M444 - +
  • [10] A STUDY OF O-1(2) PRODUCTION BY IMMOBILIZED SENSITIZER OUTSIDE THE SOLUTION - MEASUREMENT OF O-1(2) GENERATION
    KRISHNA, CM
    LION, Y
    RIESZ, P
    PHOTOCHEMISTRY AND PHOTOBIOLOGY, 1987, 45 (01) : 1 - 6