On Approximately Searching for Similar Word Embeddings

被引:0
|
作者
Sugawara, Kohei [1 ]
Kobayashi, Hayato [1 ]
Iwasaki, Masajiro [1 ]
机构
[1] Yahoo Japan Corp, Chiyoda Ku, 1-3 Kioicho, Tokyo 1028282, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We discuss an approximate similarity search for word embeddings, which is an operation to approximately find embeddings close to a given vector. We compared several metric-based search algorithms with hash-, tree-, and graph-based indexing from different aspects. Our experimental results showed that a graph-based indexing exhibits robust performance and additionally provided useful information, e.g., vector normalization achieves an efficient search with cosine similarity.
引用
收藏
页码:2265 / 2275
页数:11
相关论文
共 50 条
  • [21] Compositional Demographic Word Embeddings
    Welch, Charles
    Kummerfeld, Jonathan K.
    Perez-Rosas, Veronica
    Mihalcea, Rada
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4076 - 4089
  • [22] Eigenwords: Spectral Word Embeddings
    Dhillon, Paramveer S.
    Foster, Dean P.
    Ungar, Lyle H.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 3035 - 3078
  • [23] Word Embeddings for Speech Recognition
    Bengio, Samy
    Heigold, Georg
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1053 - 1057
  • [24] Linguistic Information in Word Embeddings
    Basirat, Ali
    Tang, Marc
    [J]. AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2018, 2019, 11352 : 492 - 513
  • [25] Evaluation of Croatian Word Embeddings
    Svoboda, Lukas
    Beliga, Slobodan
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1512 - 1518
  • [26] Adaptive Compression of Word Embeddings
    Kim, Yeachan
    Kim, Kang-Min
    Lee, SangKeun
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3950 - 3959
  • [27] Exploring Numeracy in Word Embeddings
    Naik, Aakanksha
    Ravichander, Abhilasha
    Rose, Carolyn
    Hovy, Eduard
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3374 - 3380
  • [28] Word Embeddings with Limited Memory
    Ling, Shaoshi
    Song, Yangqiu
    Roth, Dan
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 387 - 392
  • [29] Ontology Matching with Word Embeddings
    Zhang, Yuanzhe
    Wang, Xuepeng
    Lai, Siwei
    He, Shizhu
    Liu, Kang
    Zhao, Jun
    Lv, Xueqiang
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 34 - 45
  • [30] Word Embeddings as Statistical Estimators
    Dey, Neil
    Singer, Matthew
    Williams, Jonathan P.
    Sengupta, Srijan
    [J]. SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2024,