Computational AstroStatistics: Fast and efficient tools for analysing huge astronomical data sources

被引:4
|
作者
Nichol, RC [1 ]
Chong, S [1 ]
Connolly, AJ [1 ]
Davies, S [1 ]
Genovese, C [1 ]
Hopkins, AM [1 ]
Miller, CJ [1 ]
Moore, AW [1 ]
Pelleg, D [1 ]
Richards, GT [1 ]
Schneider, J [1 ]
Szapudi, I [1 ]
Wasserman, L [1 ]
机构
[1] Carnegie Mellon Univ, Dept Phys, Pittsburgh, PA 15213 USA
关键词
D O I
10.1007/0-387-21529-8_18
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
I present here a review of past and present multi-disciplinary research of the Pittsburgh Computational AstroStatistics(2) (PiCA) group. This group is dedicated to developing fast and efficient statistical algorithms for analysing huge astronomical data sources. I begin with a short review of multi-resolutional kd-trees which are the building blocks for many of our algorithms. For example, quick range queries and fast N-point correlation functions. I will present new results from the use of Mixture Models (Connolly et al. 2000) in density estimation of multi-color data from the Sloan Digital Sky Survey (SDSS). Specifically, the selection of quasars and the automated identification of X-ray sources. I will also present a brief overview of the False Discovery Rate (FDR) procedure (Miller et al. 2001a) and show how it has been used in the detection of "Baryon Wiggles" in the local galaxy power spectrum and source identification in radio data. Finally, I will look forward to new research on an automated Bayes Network anomaly detector and the possible use of the Locally Linear Embedding algorithm (LLE, Roweis & Saul 2000) for spectral classification of SDSS spectra. This paper is followed by a commentary by statisticians Fionn D. Murtagh and Dianne Cook.
引用
收藏
页码:265 / 278
页数:14
相关论文
共 8 条
  • [1] Computational AstroStatistics: Fast algorithms and efficient statistics for density estimation in large astronomical datasets
    Nichol, RC
    Connolly, AJ
    Moore, AW
    Schneider, J
    Genovese, C
    Wasserman, L
    [J]. VIRTUAL OBSERVATORIES OF THE FUTURE, PROCEEDINGS, 2001, 225 : 265 - 271
  • [2] Efficient Astronomical Data Condensation Using Fast Nearest Neighbors Search
    Lukasik, Szymon
    Lalik, Konrad
    Sarna, Piotr
    Kowalski, Piotr A.
    Charytanowicz, Malgorzata
    Kulczycki, Piotr
    [J]. INFORMATION TECHNOLOGY, SYSTEMS RESEARCH, AND COMPUTATIONAL PHYSICS, 2020, 945 : 107 - 115
  • [3] FastIoT: an efficient and very fast compression model for displaying a huge volume of IoT data in web environments
    Melchiades, Mateus Begnini
    Paredes Crovato, Cesar David
    Nedel, Everton
    Schreiber, Lincoln Vinicius
    Righi, Rodigo da Rosa
    [J]. INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2021, 12 (5-6) : 605 - 617
  • [4] FastIoT: An efficient and very fast compression model for displaying a huge volume of IoT data in web environments
    Melchiades, Mateus Begnini
    Crovato, César David Paredes
    Nedel, Everton
    Schreiber, Lincoln Vinicius
    da Rosa Righi, Rodigo
    [J]. International Journal of Grid and Utility Computing, 2021, 12 (5-6): : 605 - 617
  • [5] Efficient algorithms for fast integration on large data sets from multiple sources
    Mi, Tian
    Rajasekaran, Sanguthevar
    Aseltine, Robert
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2012, 12
  • [6] Efficient algorithms for fast integration on large data sets from multiple sources
    Tian Mi
    Sanguthevar Rajasekaran
    Robert Aseltine
    [J]. BMC Medical Informatics and Decision Making, 12
  • [7] Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake
    Sreepathy, H., V
    Rao, B. Dinesh
    Kumar, J. Mohan
    Rao, B. Deepak
    [J]. METHODSX, 2023, 11
  • [8] A hybrid computational approach to process real-time streaming multi-sources data and improve classification for emergency patients triage services: moving forward to an efficient IoMT-based real-time telemedicine systems
    Salman, Omar Sadeq
    Latiff, Nurul Mu'azzah Abdul
    Salman, Omar H.
    Ariffin, Sharifah Hafizah Syed
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (17): : 10109 - 10122