Word-level Speech Recognition with a Letter to Word Encoder

被引:0
|
作者
Collobert, Ronan [1 ]
Hannun, Awni [1 ]
Synnaeve, Gabriel [1 ]
机构
[1] Facebook AI Res, Menlo Pk, CA USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a direct-to-word sequence model which uses a word network to learn word embeddings from letters. The word network can be integrated seamlessly with arbitrary sequence models including Connectionist Temporal Classification and encoder-decoder models with attention. We show our direct-to-word model can achieve word error rate gains over sub-word level models for speech recognition. We also show that our direct-to-word approach retains the ability to predict words not seen at training time without any retraining. Finally, we demonstrate that a word-level model can use a larger stride than a sub-word level model while maintaining accuracy. This makes the model more efficient both for training and inference.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Apraxia of speech as a disruption of word-level schemata: Some durational evidence
    Varley, R
    Whiteside, S
    Luff, H
    JOURNAL OF MEDICAL SPEECH-LANGUAGE PATHOLOGY, 1999, 7 (02) : 127 - 132
  • [32] Word-level Perturbation Considering Word Length and Compositional Subwords
    Hiraoka, Tatsuya
    Takase, Sho
    Uchiumi, Kei
    Keyaki, Atsushi
    Okazaki, Naoaki
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3268 - 3275
  • [33] Is Word-Level Recursion Actually Recursion?
    Miller, Taylor L.
    Sande, Hannah
    LANGUAGES, 2021, 6 (02)
  • [34] Lifting propositional interpolants to the word-level
    Kroening, Daniel
    Weissenbacher, Georg
    FMCAD 2007: FORMAL METHODS IN COMPUTER AIDED DESIGN, PROCEEDINGS, 2007, : 85 - 89
  • [35] WORD-LEVEL RECOGNITION OF SMALL SETS OF HAND-WRITTEN WORDS
    ELIAZ, A
    GEIGER, D
    PATTERN RECOGNITION LETTERS, 1995, 16 (10) : 999 - 1009
  • [36] WARP: Word-level Adversarial ReProgramming
    Hambardzumyan, Karen
    Khachatrian, Hrant
    May, Jonathan
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4921 - 4933
  • [37] A word-level graph manipulation package
    Höreth S.
    International Journal on Software Tools for Technology Transfer, 2001, 3 (02) : 182 - 192
  • [38] Word-Level Symbolic Trajectory Evaluation
    Chakraborty, Supratik
    Khasidashvili, Zurab
    Seger, Carl-Johan H.
    Gajavelly, Rajkumar
    Haldankar, Tanmay
    Chhatani, Dinesh
    Mistry, Rakesh
    COMPUTER AIDED VERIFICATION, CAV 2015, PT II, 2015, 9207 : 128 - 143
  • [39] The Phonetics of Paiwan Word-Level Prosody
    Chen, Chun-Mei
    LANGUAGE AND LINGUISTICS, 2009, 10 (03) : 593 - 625
  • [40] Word-Level ASL Recognition and Trigger Sign Detection with RF Sensors
    Rahman, M. Mahbubur
    Kurtoglu, Emre
    Mdrafi, Robiulhossain
    Gurbuz, Ali C.
    Malaia, Evie
    Crawford, Chris
    Griffin, Darrin
    Gurbuz, Sevgi Z.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8233 - 8237