Locality-Sensitive Hashing for Long Context Neural Machine Translation

被引:0
|
作者
Petrick, Frithjof [1 ]
Rosendahl, Jan [1 ]
Herold, Christian [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Comp Sci Dept, Human Language Technol & Pattern Recognit Grp, D-52056 Aachen, Germany
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
After its introduction, the Transformer architecture (Vaswani et al., 2017) quickly became the gold standard for the task of neural machine translation. A major advantage of the Transformer compared to previous architectures is the faster training speed achieved by complete parallelization across timesteps due to the use of attention over recurrent layers. However, this also leads to one of the biggest problems of the Transformer, namely the quadratic time and memory complexity with respect to the input length. In this work we adapt the locality-sensitive hashing approach of Kitaev et al. (2020) to self-attention in the Transformer, we extended it to cross-attention and apply this memory efficient framework to sentence- and document-level machine translation. Our experiments show that the LSH attention scheme for sentence-level comes at the cost of slightly reduced translation quality. For document-level NMT we are able to include much bigger context sizes than what is possible with the baseline Transformer. However, more context does neither improve translation quality nor improve scores on targeted test suites.
引用
收藏
页码:32 / 42
页数:11
相关论文
共 50 条
  • [21] A Machine Learning approach for anomaly detection on the Internet of Things based on Locality-Sensitive Hashing
    Hernandez-Jaimes, Mireya Lucia
    Martinez-Cruz, Alfonso
    Ramirez-Gutierrez, Kelseyalejandra
    [J]. INTEGRATION-THE VLSI JOURNAL, 2024, 96
  • [22] Cross-media retrieval based on locality-sensitive hashing and neural network algorithms
    [J]. 2018, National University of Defense Technology (40):
  • [23] A Scalable ECG Identification System Based on Locality-Sensitive Hashing
    Chu, Hui-Yu
    Lin, Tzu-Yun
    Lee, Song-Hong
    Chiu, Jui-Kun
    Nien, Cing-Ping
    Wu, Shun-Chi
    [J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [24] Similar Pair Identification using Locality-Sensitive Hashing Technique
    Lee, Kyung Mi
    Lee, Keon Myung
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 2117 - 2119
  • [25] Digital Watermarks for Videos Based on a Locality-Sensitive Hashing Algorithm
    Sun, Yajuan
    Srivastava, Gautam
    [J]. MOBILE NETWORKS & APPLICATIONS, 2023, 28 (05): : 1724 - 1737
  • [26] Frequent-Itemset Mining Using Locality-Sensitive Hashing
    Bera, Debajyoti
    Pratap, Rameshwar
    [J]. COMPUTING AND COMBINATORICS, COCOON 2016, 2016, 9797 : 143 - 155
  • [27] Fast Access for Star Catalog Based on Locality-Sensitive Hashing
    Zhu, Hailong
    Liang, Bin
    Zhang, Tao
    [J]. Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2018, 36 (05): : 988 - 994
  • [28] Locality-Sensitive Hashing for Finding Nearest Neighbors in Probability Distributions
    Tang, Yi-Kun
    Mao, Xian-Ling
    Hao, Yi-Jing
    Xu, Cheng
    Huang, Heyan
    [J]. SOCIAL MEDIA PROCESSING, SMP 2017, 2017, 774 : 3 - 15
  • [29] An improved method of locality-sensitive hashing for scalable instance matching
    Mehmet Aydar
    Serkan Ayvaz
    [J]. Knowledge and Information Systems, 2019, 58 : 275 - 294
  • [30] On the Problem of p1-1 in Locality-Sensitive Hashing
    Ahle, Thomas Dybdahl
    [J]. SIMILARITY SEARCH AND APPLICATIONS, SISAP 2020, 2020, 12440 : 85 - 93