Impact of the Initialization in Tree-Based Fast Similarity Search Techniques

被引:0
|
作者
Serrano, Aureo [1 ]
Mico, Luisa [1 ]
Oncina, Jose [1 ]
机构
[1] Univ Alicante, Dept Lenguajes & Sistemas Informat, E-03080 Alicante, Spain
来源
关键词
METRIC-SPACES; QUERIES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many fast similarity search techniques relies on the use of pivots (specially selected points in the data set). Using these points, specific structures (indexes) are built speeding up the search when queering. Usually, pivot selection techniques are incremental, being the first one randomly chosen. This article explores several techniques to choose the first pivot in a tree-based fast similarity search technique. We provide experimental results showing that an adequate choice of this pivot leads to significant reductions in distance computations and time complexity. Moreover, most pivot tree-based indexes emphasizes in building balanced trees. We provide experimentally and theoretical support that very unbalanced trees can be a better choice than balanced ones.
引用
收藏
页码:163 / 176
页数:14
相关论文
共 50 条
  • [1] Impact of the Initialization in Tree-Based Fast Similarity Search Techniques
    Serrano, Aureo
    Mico, Luisa
    Oncina, Jose
    SIMILARITY-BASED PATTERN RECOGNITION: FIRST INTERNATIONAL WORKSHOP, SIMBAD 2011, 2011, 7005 : 163 - 176
  • [2] A Fast Tree-Based Search Algorithm for Cluster Search Engine
    Tsai, Chun-Wei
    Huang, Ko-Wei
    Chiang, Ming-Chao
    Yang, Chu-Sing
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1603 - +
  • [3] A Tree-Based Indexing Approach for Diverse Textual Similarity Search
    Yu, Minghe
    Chai, Chengliang
    Yu, Ge
    IEEE ACCESS, 2021, 9 : 8866 - 8876
  • [4] Initialization of dynamic time warping using tree-based fast Nearest Neighbor
    Poularakis, Stergios
    Katsavounidis, Ioannis
    PATTERN RECOGNITION LETTERS, 2016, 79 : 31 - 37
  • [5] On Tree-Based Methods for Similarity Learning
    Clemencon, Stephan
    Vogel, Robin
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, 2019, 11943 : 676 - 688
  • [6] A tabular pruning rule in tree-based fast nearest neighbor search algorithms
    Oncina, Jose
    Thollard, Franck
    Gomez-Ballester, Eva
    Mico, Luisa
    Moreno-Seco, Francisco
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2007, 4478 : 306 - +
  • [7] A tree-based similarity for evaluating concept proximities in an ontology
    Blanchard, Emmanuel
    Kuntz, Pascale
    Harzallah, Mounira
    Briand, Henri
    DATA SCIENCE AND CLASSIFICATION, 2006, : 3 - +
  • [8] Tree-based Microaggregation for the Anonymization of Search Logs
    Navarro-Arribas, Guillermo
    Torra, Vicenc
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 155 - 158
  • [9] A tree-based method for fast melodic retrieval
    Parker, C
    JCDL 2004: PROCEEDINGS OF THE FOURTH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES: GLOBAL REACH AND DIVERSE IMPACT, 2004, : 254 - 255
  • [10] Tree-based techniques to predict soil units
    Pinheiro, H. S. K.
    Owens, P. R.
    Anjos, L. H. C.
    Carvalho Junior, W.
    Chagas, C. S.
    SOIL RESEARCH, 2017, 55 (08) : 788 - 798