EFFICIENT ASTRONOMICAL DATA CONDENSATION USING APPROXIMATE NEAREST NEIGHBORS

被引:2
|
作者
Lukasik, Szymon [1 ,2 ]
Lalik, Konrad [1 ]
Sarna, Piotr [1 ]
Kowalski, Piotr A. [1 ,2 ]
Charytanowicz, Malgorzata [2 ,3 ]
Kulczycki, Piotr [1 ,2 ]
机构
[1] AGH Univ Sci & Technol, Fac Phys & Appl Comp Sci, Al Mickiewicza 30, PL-30059 Krakow, Poland
[2] Polish Acad Sci, Syst Res Inst, Ul Newelska 6, PL-01447 Warsaw, Poland
[3] Lublin Univ Technol, Fac Elect Engn & Comp Sci, Ul Nadbystrzycka 38D, PL-20618 Lublin, Poland
关键词
big data; astronomy; data reduction; nearest neighbor search; kd-trees; DATA REDUCTION;
D O I
10.2478/amcs-2019-0034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extracting useful information from astronomical observations represents one of the most challenging tasks of data exploration. This is largely due to the volume of the data acquired using advanced observational tools. While other challenges typical for the class of big data problems (like data variety) are also present, the size of datasets represents the most significant obstacle in visualization and subsequent analysis. This paper studies an efficient data condensation algorithm aimed at providing its compact representation. It is based on fast nearest neighbor calculation using tree structures and parallel processing. In addition to that, the possibility of using approximate identification of neighbors, to even further improve the algorithm time performance, is also evaluated. The properties of the proposed approach, both in terms of performance and condensation quality, are experimentally assessed on astronomical datasets related to the GAIA mission. It is concluded that the introduced technique might serve as a scalable method of alleviating the problem of the dataset size.
引用
收藏
页码:467 / 476
页数:10
相关论文
共 50 条
  • [1] Efficient Astronomical Data Condensation Using Fast Nearest Neighbors Search
    Lukasik, Szymon
    Lalik, Konrad
    Sarna, Piotr
    Kowalski, Piotr A.
    Charytanowicz, Malgorzata
    Kulczycki, Piotr
    INFORMATION TECHNOLOGY, SYSTEMS RESEARCH, AND COMPUTATIONAL PHYSICS, 2020, 945 : 107 - 115
  • [2] APPROXIMATE NEAREST NEIGHBORS USING SPARSE REPRESENTATIONS
    Zepeda, Joaquin
    Kijak, Ewa
    Guillemot, Christine
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2370 - 2373
  • [3] AnnArbor: Approximate Nearest Neighbors Using Arborescence Coding
    Babenko, Artem
    Lempitsky, Victor
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4895 - 4903
  • [4] Nonlinear Dimensionality Reduction using Approximate Nearest Neighbors
    Plaku, Erion
    Kavraki, Lydia E.
    PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 180 - 191
  • [5] Randomized approximate nearest neighbors algorithm
    Jones, Peter Wilcox
    Osipov, Andrei
    Rokhlin, Vladimir
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (38) : 15679 - 15686
  • [6] Holder Homeomorphisms and Approximate Nearest Neighbors
    Andoni, Alexandr
    Naor, Assaf
    Nikolov, Aleksandar
    Razenshteyn, Ilya
    Waingarten, Erik
    2018 IEEE 59TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2018, : 159 - 169
  • [7] A randomized approximate nearest neighbors algorithm
    Jones, Peter W.
    Osipov, Andrei
    Rokhlin, Vladimir
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2013, 34 (03) : 415 - 444
  • [8] Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
    Ivison, Hamish
    Smith, Noah A.
    Hajishirzi, Hannaneh
    Dasigi, Pradeep
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9036 - 9061
  • [9] On approximate nearest neighbors under I∞ norm
    Indyk, P
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2001, 63 (04) : 627 - 638
  • [10] Hierarchical Clustering in General Metric Spaces using Approximate Nearest Neighbors
    Moseley, Benjamin
    Vassilvitskii, Sergei
    Wang, Yuyan
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130