Impact of the Initialization in Tree-Based Fast Similarity Search Techniques

被引:0
|
作者
Serrano, Aureo [1 ]
Mico, Luisa [1 ]
Oncina, Jose [1 ]
机构
[1] Univ Alicante, Dept Lenguajes & Sistemas Informat, E-03080 Alicante, Spain
来源
关键词
METRIC-SPACES; QUERIES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many fast similarity search techniques relies on the use of pivots (specially selected points in the data set). Using these points, specific structures (indexes) are built speeding up the search when queering. Usually, pivot selection techniques are incremental, being the first one randomly chosen. This article explores several techniques to choose the first pivot in a tree-based fast similarity search technique. We provide experimental results showing that an adequate choice of this pivot leads to significant reductions in distance computations and time complexity. Moreover, most pivot tree-based indexes emphasizes in building balanced trees. We provide experimentally and theoretical support that very unbalanced trees can be a better choice than balanced ones.
引用
收藏
页码:163 / 176
页数:14
相关论文
共 50 条
  • [41] Tree-Based Ensemble Learning Techniques in the Analysis of Parkinsonian Syndromes
    Gorriz, J. M.
    Ramirez, J.
    Moreno-Caballero, M.
    Martinez-Murcia, F. J.
    Ortiz, A.
    Illan, I. A.
    Segovia, F.
    Salas-Gonzalez, D.
    Gomez-Rio, M.
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2017), 2017, 723 : 459 - 469
  • [42] Combining fast search and learning for fast similarity search
    Vassef, H
    Li, CS
    Castelli, V
    STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2000, 2000, 3972 : 32 - 42
  • [43] Tree-based classification and regression Part 3: Tree-based procedures
    Gunter, B
    QUALITY PROGRESS, 1998, 31 (02) : 121 - 123
  • [44] Maintaining and Analyzing Production Process Definitions Using a Tree-Based Similarity Measure
    Stumptner, Reinhard
    Lettner, Christian
    Freudenthaler, Bernhard
    Pichler, Josef
    Kirchmayr, Wilhelm
    Draxler, Ewald
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2015, 2015, 9343 : 366 - 380
  • [45] An N-ary Tree-based Model for Similarity Evaluation on Mathematical Formulae
    Dai, Yifan
    Chen, Liangyu
    Zhang, Zihan
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2578 - 2584
  • [46] Some approaches to improve tree-based nearest neighbour search algorithms
    Gómez-Ballester, E
    Micó, L
    Oncina, J
    PATTERN RECOGNITION, 2006, 39 (02) : 171 - 179
  • [47] Combining Elimination Rules in Tree-Based Nearest Neighbor Search Algorithms
    Gomez-Ballester, Eva
    Mico, Luisa
    Thollard, Franck
    Oncina, Jose
    Moreno-Seco, Francisco
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2010, 6218 : 80 - +
  • [48] Fast-RCM: Fast Tree-Based Unsupervised Rare-Class Mining
    Weng, Haiqin
    Ji, Shouling
    Liu, Changchang
    Wang, Ting
    He, Qinming
    Chen, Jianhai
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (10) : 5198 - 5211
  • [49] Network of shortcuts: An adaptive data structure for tree-based search methods
    Bergamini, A
    Kencl, L
    NETWORKING 2005: NETWORKING TECHNOLOGIES, SERVICES, AND PROTOCOLS; PERFORMANCE OF COMPUTER AND COMMUNICATION NETWORKS; MOBILE AND WIRELESS COMMUNICATIONS SYSTEMS, 2005, 3462 : 523 - 535
  • [50] Tree-Based Codification in Neural Architecture Search for Medical Image Segmentation
    Fuentes-Tomas, Jose-Antonio
    Mezura-Montes, Efren
    Acosta-Mesa, Hector-Gabriel
    Marquez-Grajales, Aldo
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2024, 28 (03) : 597 - 607