Learning to Create and Reuse Words in Open-Vocabulary Neural Language Modeling

被引:8
|
作者
Kawakami, Kazuya [1 ]
Dyer, Chris [2 ]
Blunsom, Phil [1 ,2 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
[2] DeepMind, London, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.18653/v1/P17-1137
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Fixed-vocabulary language models fail to account for one of the most characteristic statistical facts of natural language: the frequent creation and reuse of new word types. Although character-level language models offer a partial solution in that they can create word types not attested in the training corpus, they do not capture the "bursty" distribution of such words. In this paper, we augment a hierarchical LSTM language model that generates sequences of word tokens character by character with a caching mechanism that learns to reuse previously generated words. To validate our model we construct a new open-vocabulary language modeling corpus (the Multilingual Wikipedia Corpus; MWC) from comparable Wikipedia articles in 7 typologically diverse languages and demonstrate the effectiveness of our model across this range of languages.
引用
收藏
页码:1492 / 1502
页数:11
相关论文
共 50 条
  • [31] MVP-SEG: Multi-view Prompt Learning for Open-Vocabulary Semantic Segmentation
    Guo, Jie
    Wang, Qimeng
    Gao, Yan
    Jiang, Xiaolong
    Lin, Shaohui
    Zhang, Baochang
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 158 - 171
  • [32] Open Vocabulary Learning for Neural Chinese Pinyin IME
    Zhang, Zhuosheng
    Huang, Yafang
    Zhao, Hai
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1584 - 1594
  • [33] First language words as extra-stimulus prompts in learning second language vocabulary
    Elliott, RT
    Adepoju, AA
    [J]. IRAL-INTERNATIONAL REVIEW OF APPLIED LINGUISTICS IN LANGUAGE TEACHING, 1997, 35 (04): : 237 - 250
  • [34] OVIS: Open-Vocabulary Visual Instance Search via Visual-Semantic Aligned Representation Learning
    Liu, Sheng
    Lin, Kevin
    Wang, Lijuan
    Yuan, Junsong
    Liu, Zicheng
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1773 - 1781
  • [35] Unbounded cache model for online language modeling with open vocabulary
    Grave, Edouard
    Cisse, Moustapha
    Joulin, Armand
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [36] Neural Correlates of High Performance in Foreign Language Vocabulary Learning
    Macedonia, Manuela
    Mueller, Karsten
    Friederici, Angela D.
    [J]. MIND BRAIN AND EDUCATION, 2010, 4 (03) : 125 - 134
  • [37] The Language of Romance: An Open Vocabulary Analysis of the Highest Rated Words Used in Romance Novels
    Olivarez, Omar
    Hardie, Ryan
    Blackburn, Kate G.
    [J]. JOURNAL OF LANGUAGE AND SOCIAL PSYCHOLOGY, 2018, 37 (06) : 680 - 691
  • [38] Contrasting orthographically similar words facilitates adult second language vocabulary learning
    Baxter, Peta
    Bekkering, Harold
    Dijkstra, Ton
    Droop, Mienke
    Van den Hurk, Marianne
    Leone, Frank
    [J]. LEARNING AND INSTRUCTION, 2022, 80
  • [39] A Study of BPE-based Language Modeling for Open Vocabulary Latin Language OCR
    Hu, Wenping
    Luo, Yikang
    Meng, Ji
    Qian, Zifei
    Huo, Qiang
    [J]. 2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 133 - 138
  • [40] Learning Spoken Language Representations with Neural Lattice Language Modeling
    Huang, Chao-Wei
    Chen, Yun-Nung
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3764 - 3769