Large-Scale Visual Search with Binary Distributed Graph at Alibaba

被引:7
|
作者
Zhao, Kang [1 ]
Pan, Pan [1 ]
Zheng, Yun [1 ]
Zhang, Yanhao [1 ]
Wang, Changxu [1 ]
Zhang, Yingya [1 ]
Xu, Yinghui [1 ]
Jin, Rong [1 ]
机构
[1] Alibaba Grp, Machine Intelligence Technol Lab, Hangzhou, Peoples R China
关键词
Visual Search; Binary Codes; Distributed Algorithm; Graph Construction; PRODUCT QUANTIZATION; NEIGHBOR; ALGORITHM;
D O I
10.1145/3357384.3357834
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Graph-based approximate nearest neighbor search has attracted more and more attentions due to its online search advantages. Numbers of methods studying the enhancement of speed and recall have been put forward. However, few of them focus on the efficiency and scale of offline graph-construction. For a deployed visual search system with several billions of online images in total, building a billion-scale offline graph in hours is essential, which is almost unachievable by most existing methods. In this paper, we propose a novel algorithm called Binary Distributed Graph to solve this problem. Specifically, we combine binary codes with graph structure to speedup both offline and online procedures, and achieve comparable performance with the ones that use real-value features, by recalling and reranking more binary candidates. Furthermore, the graph-construction is optimized to completely distributed implementation, which significantly accelerates the offline process and gets rid of the limitation of single machine, such as memory and storage. Experimental comparisons on Alibaba Commodity Data Set (more than three billion images) show that the proposed method outperforms the state-of-the-art with respect to the online/offline trade-off.
引用
收藏
页码:2567 / 2575
页数:9
相关论文
共 50 条
  • [1] Large-scale Multi-modal Search and QA at Alibaba
    Jin, Rong
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 8 - 8
  • [2] Discrete Multi-graph Hashing for Large-Scale Visual Search
    Xiang, Lingyun
    Shen, Xiaobo
    Qin, Jiaohua
    Hao, Wei
    [J]. NEURAL PROCESSING LETTERS, 2019, 49 (03) : 1055 - 1069
  • [3] Discrete Multi-graph Hashing for Large-Scale Visual Search
    Lingyun Xiang
    Xiaobo Shen
    Jiaohua Qin
    Wei Hao
    [J]. Neural Processing Letters, 2019, 49 : 1055 - 1069
  • [4] Large-Scale Patch Recommendation at Alibaba
    Zhang, Xindong
    Zhu, Chenguang
    Li, Yi
    Guo, Jianmei
    Liu, Lihua
    Gu, Haobo
    [J]. 2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2020), 2020, : 252 - 253
  • [5] Distributed large-scale graph processing on FPGAs
    Sahebi, Amin
    Barbone, Marco
    Procaccini, Marco
    Luk, Wayne
    Gaydadjiev, Georgi
    Giorgi, Roberto
    [J]. JOURNAL OF BIG DATA, 2023, 10 (01)
  • [6] On the Distributed Complexity of Large-Scale Graph Computations
    Pandurangan, Gopal
    Robinson, Peter
    Scquizzato, Michele
    [J]. ACM TRANSACTIONS ON PARALLEL COMPUTING, 2021, 8 (02)
  • [7] Large-Scale Graph Neural Architecture Search
    Guan, Chaoyu
    Wang, Xin
    Chen, Hong
    Zhang, Ziwei
    Zhu, Wenwu
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [8] On the Distributed Complexity of Large-Scale Graph Computations
    Pandurangan, Gopal
    Robinson, Peter
    Scquizzato, Michele
    [J]. SPAA'18: PROCEEDINGS OF THE 30TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2018, : 405 - 414
  • [9] A Distributed Algorithm for Large-Scale Graph Partitioning
    Rahimian, Fatemeh
    Payberah, Amir H.
    Girdzijauskas, Sarunas
    Jelasity, Mark
    Haridi, Seif
    [J]. ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 10 (02)
  • [10] Distributed large-scale graph processing on FPGAs
    Amin Sahebi
    Marco Barbone
    Marco Procaccini
    Wayne Luk
    Georgi Gaydadjiev
    Roberto Giorgi
    [J]. Journal of Big Data, 10