EFFICIENT ASTRONOMICAL DATA CONDENSATION USING APPROXIMATE NEAREST NEIGHBORS

被引:2
|
作者
Lukasik, Szymon [1 ,2 ]
Lalik, Konrad [1 ]
Sarna, Piotr [1 ]
Kowalski, Piotr A. [1 ,2 ]
Charytanowicz, Malgorzata [2 ,3 ]
Kulczycki, Piotr [1 ,2 ]
机构
[1] AGH Univ Sci & Technol, Fac Phys & Appl Comp Sci, Al Mickiewicza 30, PL-30059 Krakow, Poland
[2] Polish Acad Sci, Syst Res Inst, Ul Newelska 6, PL-01447 Warsaw, Poland
[3] Lublin Univ Technol, Fac Elect Engn & Comp Sci, Ul Nadbystrzycka 38D, PL-20618 Lublin, Poland
关键词
big data; astronomy; data reduction; nearest neighbor search; kd-trees; DATA REDUCTION;
D O I
10.2478/amcs-2019-0034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extracting useful information from astronomical observations represents one of the most challenging tasks of data exploration. This is largely due to the volume of the data acquired using advanced observational tools. While other challenges typical for the class of big data problems (like data variety) are also present, the size of datasets represents the most significant obstacle in visualization and subsequent analysis. This paper studies an efficient data condensation algorithm aimed at providing its compact representation. It is based on fast nearest neighbor calculation using tree structures and parallel processing. In addition to that, the possibility of using approximate identification of neighbors, to even further improve the algorithm time performance, is also evaluated. The properties of the proposed approach, both in terms of performance and condensation quality, are experimentally assessed on astronomical datasets related to the GAIA mission. It is concluded that the introduced technique might serve as a scalable method of alleviating the problem of the dataset size.
引用
收藏
页码:467 / 476
页数:10
相关论文
共 50 条
  • [41] Random Grids: Fast Approximate Nearest Neighbors and Range Searching for Image Search
    Aiger, Dror
    Kokiopoulou, Efi
    Rivlin, Ehud
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3471 - 3478
  • [42] An approximate nearest neighbors search algorithm for low-dimensional grid locations
    Adriano Petry
    André Grahl Pereira
    Jonas Rodrigues de Souza
    Earth Science Informatics, 2017, 10 : 183 - 196
  • [43] A Progressive k-d tree for Approximate k-Nearest Neighbors
    Jo, Jaemin
    Seo, Jinwook
    Fekete, Jean-Daniel
    2017 IEEE WORKSHOP ON DATA SYSTEMS FOR INTERACTIVE ANALYSIS (DSIA), 2017,
  • [44] PANENE: A Progressive Algorithm for Indexing and Querying Approximate k-Nearest Neighbors
    Jo, Jaemin
    Seo, Jinwook
    Fekete, Jean-Daniel
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (02) : 1347 - 1360
  • [45] Vehicle Re-Identification by Deep Feature Embedding and Approximate Nearest Neighbors
    Franco, Artur O. R.
    Soares, Felipe F.
    Lira Neto, Aloisio, V
    de Macedo, Jose A. F.
    Rego, Paulo A. L.
    Gomes, Fernando A. C.
    Maia, Jose G. R.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [46] An approximate nearest neighbors search algorithm for low-dimensional grid locations
    Petry, Adriano
    Pereira, Andre Grahl
    de Souza, Jonas Rodrigues
    EARTH SCIENCE INFORMATICS, 2017, 10 (02) : 183 - 196
  • [47] Nearest neighbors estimation for long memory functional data
    Wang, Lihong
    STATISTICAL METHODS AND APPLICATIONS, 2020, 29 (04): : 709 - 725
  • [48] SANNS: Scaling Up Secure Approximate k-Nearest Neighbors Search
    Chen, Hao
    Chillotti, Ilaria
    Dong, Yihe
    Poburinnaya, Oxana
    Razenshteyn, Ilya
    Riazi, M. Sadegh
    PROCEEDINGS OF THE 29TH USENIX SECURITY SYMPOSIUM, 2020, : 2111 - 2128
  • [49] Low-Power k Nearest Neighbors Classifiers with Approximate Nanoscale Memories
    Liu, Shanshan
    Tang, Xiaochen
    Reviriego, Pedro
    Liu, Weiqiang
    Tang, Wei
    Lombardi, Fabrizio
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON NANOTECHNOLOGY (NANO), 2022, : 8 - 11
  • [50] Nearest Neighbors Using Compact Sparse Codes
    Cherian, Anoop
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 1053 - 1061