Dimensional Testing for Multi-Step Similarity Search

被引:25
|
作者
Houle, Michael E. [1 ]
Ma, Xiguo [2 ]
Nett, Michael [1 ,3 ]
Oria, Vincent [2 ]
机构
[1] Natl Inst Informat, Tokyo 1018430, Japan
[2] New Jersey Inst Technol, Newark, NJ 07102 USA
[3] Univ Tokyo, Tokyo 1138656, Japan
关键词
Similarity search; k-NN; nearest neighbor; intrinsic dimensionality; multi-step; adaptive similarity;
D O I
10.1109/ICDM.2012.91
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In data mining applications such as subspace clustering or feature selection, changes to the underlying feature set can require the reconstruction of search indices to support fundamental data mining tasks. For such situations, multi-step search approaches have been proposed that can accommodate changes in the underlying similarity measure without the need to rebuild the index. In this paper, we present a heuristic multi-step search algorithm that utilizes a measure of intrinsic dimension, the generalized expansion dimension (GED), as the basis of its search termination condition. Compared to the current state-of-the-art method, experimental results show that our heuristic approach is able to obtain significant improvements in both the number of candidates and the running time, while losing very little in the accuracy of the query results.
引用
收藏
页码:299 / 308
页数:10
相关论文
共 50 条
  • [31] Application of Genetic Multi-Step Search to Unsupervised Design of Morphological Filters for Noise Removal
    Hanada, Yoshiko
    Okuno, Hiroyuki
    Muneyasu, Mitsuji
    Asano, Akira
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2009), 2009, : 566 - +
  • [32] Similarity Search Problem Research on Multi-dimensional Data Sets
    Shi, Yong
    Graham, Brian
    PROCEEDINGS OF THE 2013 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, 2013, : 573 - 577
  • [33] Indexing expensive functions for efficient multi-dimensional similarity search
    Hanxiong Chen
    Jianquan Liu
    Kazutaka Furuse
    Jeffrey Xu Yu
    Nobuo Ohbo
    Knowledge and Information Systems, 2011, 27 : 165 - 192
  • [34] Indexing expensive functions for efficient multi-dimensional similarity search
    Chen, Hanxiong
    Liu, Jianquan
    Furuse, Kazutaka
    Yu, Jeffrey Xu
    Ohbo, Nobuo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 27 (02) : 165 - 192
  • [35] Boosting multi-step autoregressive forecasts
    Ben Taieb, Souhaib
    Hyndman, Rob J.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
  • [36] MULTI-STEP STRIPPING ON DEFORMED NUCLEI
    LUKYANOV, VK
    PETKOV, IZ
    PHYSICS LETTERS B, 1969, B 28 (06) : 368 - &
  • [37] A multi-step method for speaker identification
    Savastano, M.
    Luciano, A.
    Pagano, A.
    Peticone, B.
    Riccardi, L.
    2006 IEEE INFORMATION ASSURANCE WORKSHOP, 2006, : 393 - +
  • [38] A note on multi-step difference schemes
    Guo, Bing
    Wang, Ren-Hong
    Zhu, Chun-Gang
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2011, 236 (05) : 647 - 652
  • [39] Multi-Step Planning for Robotic Manipulation
    Pflueger, Max
    Sukhatme, Gaurav S.
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 2496 - 2501
  • [40] A multi-step conference for cooperative broadcast
    Dabora, Ron
    Servetto, Sergio D.
    2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, : 2190 - +