On Approximately Searching for Similar Word Embeddings

被引:0
|
作者
Sugawara, Kohei [1 ]
Kobayashi, Hayato [1 ]
Iwasaki, Masajiro [1 ]
机构
[1] Yahoo Japan Corp, Chiyoda Ku, 1-3 Kioicho, Tokyo 1028282, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We discuss an approximate similarity search for word embeddings, which is an operation to approximately find embeddings close to a given vector. We compared several metric-based search algorithms with hash-, tree-, and graph-based indexing from different aspects. Our experimental results showed that a graph-based indexing exhibits robust performance and additionally provided useful information, e.g., vector normalization achieves an efficient search with cosine similarity.
引用
收藏
页码:2265 / 2275
页数:11
相关论文
共 50 条
  • [31] Word Embeddings for Comment Coherence
    Cimasa, Alfonso
    Corazza, Anna
    Coviello, Carmen
    Scanniello, Giuseppe
    [J]. 2019 45TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2019), 2019, : 244 - 251
  • [32] Chinese Word Embeddings with Subwords
    Yang, Gang
    Xu, Hongzhe
    Li, Wen
    [J]. 2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [33] Complementary Learning of Word Embeddings
    Song, Yan
    Shi, Shuming
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4368 - 4374
  • [34] Unsupervised Multilingual Word Embeddings
    Chen, Xilun
    Cardie, Claire
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 261 - 270
  • [35] Word Embeddings Evaluation and Combination
    Ghannay, Sahar
    Favre, Benoit
    Esteve, Yannick
    Camelin, Nathalie
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 300 - 305
  • [36] Eigenwords: Spectral word embeddings
    Dhillon, Paramveer S.
    Foster, Dean P.
    Ungar, Lyle H.
    [J]. Journal of Machine Learning Research, 2015, 16 : 3035 - 3078
  • [37] Word Embeddings for the Polish Language
    Rogalski, Marek
    Szczepaniak, Piotr S.
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2016, 2016, 9692 : 126 - 135
  • [38] Unsupervised Word Sense Disambiguation Using Word Embeddings
    Moradi, Behzad
    Ansari, Ebrahim
    Zabokrtsky, Zdenek
    [J]. PROCEEDINGS OF THE 2019 25TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 228 - 233
  • [39] Integrating Word Embeddings into IBM Word Alignment Models
    Anh-Cuong Le
    Tuan-Phong Nguyen
    Quoc-Long Tran
    [J]. PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2018, : 79 - 84
  • [40] Searching for the last word
    Sastry, Tom
    [J]. POETRY REVIEW, 2021, 111 (04): : 84 - 85