Word-level Speech Recognition with a Letter to Word Encoder

被引:0
|
作者
Collobert, Ronan [1 ]
Hannun, Awni [1 ]
Synnaeve, Gabriel [1 ]
机构
[1] Facebook AI Res, Menlo Pk, CA USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a direct-to-word sequence model which uses a word network to learn word embeddings from letters. The word network can be integrated seamlessly with arbitrary sequence models including Connectionist Temporal Classification and encoder-decoder models with attention. We show our direct-to-word model can achieve word error rate gains over sub-word level models for speech recognition. We also show that our direct-to-word approach retains the ability to predict words not seen at training time without any retraining. Finally, we demonstrate that a word-level model can use a larger stride than a sub-word level model while maintaining accuracy. This makes the model more efficient both for training and inference.
引用
下载
收藏
页数:11
相关论文
共 50 条
  • [1] Word-level Speech Recognition with a Letter to Word Encoder
    Collobert, Ronan
    Hannun, Awni
    Synnaeve, Gabriel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [2] Affinity Maturation of Homophones in Word-Level Speech Recognition
    Ghosh, P.
    Chingtham, T. S.
    Ghose, M. K.
    RECENT DEVELOPMENTS IN MACHINE LEARNING AND DATA ANALYTICS, 2019, 740 : 137 - 142
  • [3] WORD-LEVEL TONE MODELING FOR MANDARIN SPEECH RECOGNITION
    Lei, Xin
    Ostendorf, Mari
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 665 - +
  • [4] HOLISM REVISITED - EVIDENCE FOR PARALLEL INDEPENDENT WORD-LEVEL AND LETTER-LEVEL PROCESSORS DURING WORD RECOGNITION
    ALLEN, PA
    EMERSON, PL
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1991, 17 (02) : 489 - 511
  • [5] WORD-LEVEL RECOGNITION OF CURSIVE SCRIPT
    FARAG, RFH
    IEEE TRANSACTIONS ON COMPUTERS, 1979, 28 (02) : 172 - 175
  • [6] EFFECTS OF WORD-LEVEL AND SENTENCE-LEVEL CONTEXTS UPON WORD RECOGNITION
    COLOMBO, L
    WILLIAMS, J
    MEMORY & COGNITION, 1990, 18 (02) : 153 - 163
  • [7] An analytical handwritten word recognition system with word-level discriminant training
    Tay, YH
    Lallican, PM
    Khalid, M
    Knerr, S
    Viard-Gaudin, C
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 726 - 730
  • [8] Word-Level Speech Dataset Creation for Sourashtra and Recognition System Using Kaldi
    Vancha, Punitha
    Nagarajan, Harshitha
    Inakollu, Vishnu Sai
    Gupta, Deepa
    Vekkot, Susmitha
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [9] Preserving Word-Level Emphasis in Speech-to-Speech Translation
    Quoc Truong Do
    Toda, Tomoki
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (03) : 544 - 556
  • [10] Modeling word-level rate-of-speech variation in large vocabulary conversational speech recognition
    Zheng, J
    Franco, H
    Stolcke, A
    SPEECH COMMUNICATION, 2003, 41 (2-3) : 273 - 285