The out-of-core KNN awakens: the light side of computation force on large datasets

被引:0
|
作者
Javier Olivares
Anne-Marie Kermarrec
Nitin Chiluka
机构
[1] Inria Centre de Recherche Rennes Bretagne Atlantique,
来源
Computing | 2019年 / 101卷
关键词
K-nearest neighbors; Out-of-core; Graph processing; 05C85;
D O I
暂无
中图分类号
学科分类号
摘要
K-nearest neighbors (KNN) is a crucial tool for many applications, e.g. recommender systems, image classification and web-related applications. However, KNN is a resource greedy operation particularly for large datasets. We focus on the challenge of KNN computation over large datasets on a single commodity PC with limited memory. We propose a novel approach to compute KNN on large datasets by leveraging both disk and main memory efficiently. The main rationale of our approach is to minimize random accesses to disk, maximize sequential accesses to data and efficient usage of only the available memory. We evaluate our approach on large datasets, in terms of performance and memory consumption. The evaluation shows that our approach requires only 7% of the time needed by an in-memory baseline to compute a KNN graph.
引用
收藏
页码:19 / 38
页数:19
相关论文
共 50 条
  • [1] The Out-of-core KNN Awakens: The Light Side of Computation Force on Large Datasets
    Chiluka, Nitin
    Kermarrec, Anne-Marie
    Olivares, Javier
    [J]. NETWORKED SYSTEMS, NETYS 2016, 2016, 9944 : 295 - 310
  • [2] The out-of-core KNN awakens: the light side of computation force on large datasets
    Olivares, Javier
    Kermarrec, Anne-Marie
    Chiluka, Nitin
    [J]. COMPUTING, 2019, 101 (01) : 19 - 38
  • [3] GAMER with out-of-core computation
    Schive, Hsi-Yu
    Tsai, Yu-Chih
    Chiueh, Tzihong
    [J]. COMPUTATIONAL STAR FORMATION, 2011, (270): : 401 - 405
  • [4] Out-of-core clustering of volumetric datasets
    Granberg C.J.
    Li L.
    [J]. Journal of Zhejiang University-SCIENCE A, 2006, 7 (7): : 1134 - 1140
  • [5] Out-of-core clustering of volumetric datasets
    GRANBERG Carl J.
    [J]. Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2006, (07) : 1134 - 1140
  • [6] Large out-of-core tetrahedral meshing
    Alleaume, Aurelien
    Francez, Laurent
    Loriot, Alark
    Maman, Nathan
    [J]. PROCEEDINGS OF THE 16TH INTERNATIONAL MESHING ROUNDTABLE, 2008, : 461 - +
  • [7] The practice of I/O optimizations for out-of-core computation
    Tang, JQ
    Fang, BX
    Hu, MZ
    Zhang, HL
    [J]. PARALLEL AND DISTRIBUTED COMPUTING: APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2004, 3320 : 81 - 85
  • [8] Parallel Out-of-Core computation and updating of the QR factorization
    Gunter, BC
    Van De Geijn, RA
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2005, 31 (01): : 60 - 78
  • [9] Out-of-Core Computation of the QR Factorization on Multi-core Processors
    Marques, Mercedes
    Quintana-Orti, Gregorio
    Quintana-Orti, Enrique S.
    van de Geijn, Robert
    [J]. EURO-PAR 2009: PARALLEL PROCESSING, PROCEEDINGS, 2009, 5704 : 809 - +
  • [10] Out-of-core rendering of large, unstructured grids
    Farias, R
    Silva, CT
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2001, 21 (04) : 42 - 50