A meta-learning configuration framework for graph-based similarity search indexes

被引:2
|
作者
Oyamada, Rafael S. [1 ]
Shimomura, Larissa C. [2 ]
Barbon Jr, Sylvio [3 ]
Kaster, Daniel S. [4 ]
机构
[1] Univ Milan, Milan, Italy
[2] Eindhoven Univ Technol, Eindhoven, Netherlands
[3] Univ Trieste, Trieste, Italy
[4] Univ Londrina, Londrina, Parana, Brazil
关键词
Similarity searching; Graph -based indexes; Parameter recommending; Meta; -learning; ALGORITHM;
D O I
10.1016/j.is.2022.102123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity searches retrieve elements in a dataset with similar characteristics to the input query element. Recent works show that graph-based methods have outperformed others in the literature, such as tree-based and hash-based methods. However, graphs are highly parameter-sensitive for indexing and searching, which usually demands extra time for finding a suitable trade-off for specific user requirements. Current approaches to select parameters rely on observing published experimental results or Grid Search procedures. While the former has no guarantees that good settings for a dataset will also perform well on a different one, the latter is computationally expensive and limited to a small range of values. In this work, we propose a meta-learning-based recommender framework capable of providing a suitable graph configuration according to the characteristics of the input dataset. We present two instantiations of the framework: a global instantiation that uses the whole meta -database to train meta-models and a dataset-similarity-based instantiation that relies on clustering to generate meta-models tailored to datasets with similar characteristics. We also developed generic and tuned versions of the instantiations. The generic versions can satisfy user requirements in orders of magnitude faster than the traditional Grid Search. The tuned versions provide more accurate predictions at a higher cost. Our results show that the tuned methods outperform the Grid Search for most cases, providing recommendations close to the optimal one and being a suitable alternative, particularly for more challenging datasets.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Learning Efficient Hash Codes for Fast Graph-Based Data Similarity Retrieval
    Wang, Jinbao
    Xu, Shuo
    Zheng, Feng
    Lu, Ke
    Song, Jingkuan
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6321 - 6334
  • [43] Graph-based Partitioning of Ontology with Semantic Similarity
    Ghafourian, Soudabeh
    Rezaeian, Amin
    Naghibzadeh, Mahmoud
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2013), 2013, : 80 - 85
  • [44] On Similarity Measures for a Graph-Based Recommender System
    Kurt, Zuhal
    Bilge, Alper
    Ozkan, Kemal
    Gerek, Omer Nezih
    [J]. INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2019, 2019, 1078 : 136 - 147
  • [45] Graph-based similarity concepts in virtual screening
    Hutter, Michael C.
    [J]. FUTURE MEDICINAL CHEMISTRY, 2011, 3 (04) : 485 - 501
  • [46] Collaborative filtering with a graph-based similarity measure
    Do Thi Lien
    Nguyen Duy Phuong
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTING, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2014, : 251 - 256
  • [47] Graph-based Strategy for Establishing Morphology Similarity
    Juneja, Namit
    Zola, Jaroslaw
    Chandola, Varun
    Wodo, Olga
    [J]. 33RD INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2021), 2020, : 169 - 180
  • [48] A Collaborative Learning Framework via Federated Meta-Learning
    Lin, Sen
    Yang, Guang
    Zhang, Junshan
    [J]. 2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 289 - 299
  • [49] Graph-based Relational Learning
    NEC Laboratories Europe GmbH, Germany
    不详
    不详
    不详
    [J]. NEC Tech. J., 1 (101-105):
  • [50] Graph-based semisupervised learning
    Culp, Mark
    Michailidis, George
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (01) : 174 - 179