A graph-based cache for large-scale similarity search engines

被引:0
|
作者
Veronica Gil-Costa
Mauricio Marin
Carolina Bonacic
Roberto Solar
机构
[1] Universidad Nacional de San Luis,DIINF
[2] CONICET,CITIAPS
[3] CeBiB,undefined
[4] Centre for Biotechnology and Bioengineering,undefined
[5] Universidad de Santiago de Chile,undefined
[6] Universidad de Santiago de Chile,undefined
来源
The Journal of Supercomputing | 2018年 / 74卷
关键词
Approximate similarity search; Metric space cache; Distributed large-scale search engines;
D O I
暂无
中图分类号
学科分类号
摘要
Large-scale similarity search engines are complex systems devised to process unstructured data like images and videos. These systems are deployed on clusters of distributed processors communicated through high-speed networks. To process a new query, a distance function is evaluated between the query and the objects stored in the database. This process relays on a metric space index distributed among the processors. In this paper, we propose a cache-based strategy devised to reduce the number of computations required to retrieve the top-k object results for user queries by using pre-computed information. Our proposal executes an approximate similarity search algorithm, which takes advantage of the links between objects stored in the cache memory. Those links form a graph of similarity among pre-computed queries. Compared to the previous methods in the literature, the proposed approach reduces the number of distance evaluations up to 60%.
引用
收藏
页码:2006 / 2034
页数:28
相关论文
共 50 条
  • [1] A graph-based cache for large-scale similarity search engines
    Gil-Costa, Veronica
    Marin, Mauricio
    Bonacic, Carolina
    Solar, Roberto
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (05): : 2006 - 2034
  • [2] EGM: Enhanced Graph-based Model for Large-scale Video Advertisement Search
    Yu, Tan
    Liu, Jie
    Yang, Yi
    Li, Yi
    Fei, Hongliang
    Li, Ping
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4443 - 4451
  • [3] Accelerating Large-Scale Graph-Based Nearest Neighbor Search on a Computational Storage Platform
    Kim, Ji-Hoon
    Park, Yeo-Reum
    Do, Jaeyoung
    Ji, Soo-Young
    Kim, Joo-Young
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (01) : 278 - 290
  • [4] High Quality Graph-Based Similarity Search
    Yu, Weiren
    McCann, Julie A.
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 83 - 92
  • [5] An Intersection Cache Based on Frequent Itemset Mining in Large Scale Search Engines
    Zhou, Wanwan
    Li, Ruixuan
    Dong, Xinhua
    Xu, Zhiyong
    Xiao, Weijun
    2015 THIRD IEEE WORKSHOP ON HOT TOPICS IN WEB SYSTEMS AND TECHNOLOGIES (HOTWEB), 2015, : 19 - 24
  • [6] Grid graph-based large-scale point clouds registration
    Han, Yi
    Zhang, Guangyun
    Zhang, Rongting
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (01) : 2448 - 2466
  • [7] Graph-based visual analysis for large-scale hydrological modeling
    Leonard, Lorne
    MacEachren, Alan M.
    Madduri, Kamesh
    INFORMATION VISUALIZATION, 2017, 16 (03) : 205 - 216
  • [8] A large-scale graph search algorithms Based LRU
    Xiao Li
    Xiao Jing-Zhong
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 3, 2011, : 72 - 75
  • [9] An Enhanced Graph-based Infrastructure for Software Search Engines
    Schumacher, Marcus
    Atkinson, Colin
    12TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2015), 2015, : 386 - 390
  • [10] Graph-Based Deep Decomposition for Overlapping Large-Scale Optimization Problems
    Zhang, Xin
    Ding, Bo-Wen
    Xu, Xin-Xin
    Li, Jian-Yu
    Zhan, Zhi-Hui
    Qian, Pengjiang
    Fang, Wei
    Lai, Kuei-Kuei
    Zhang, Jun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (04): : 2374 - 2386