Mining knowledge in astrophysical massive data sets

被引:1
|
作者
Brescia, Massimo [1 ]
Longo, Giuseppe [2 ]
Pasian, Fabio [3 ]
机构
[1] Osserv Astron Capodimonte, INAF, I-80131 Naples, Italy
[2] Univ Naples Federico 2, Dipartimento Fis, I-80125 Naples, Italy
[3] Osserv Astron Trieste, INAF, I-34143 Trieste, Italy
关键词
Astrophysics; Astroinformatics; Data mining; Virtual observatory; Distributed computing; Knowledge discovery; Machine learning;
D O I
10.1016/j.nima.2010.02.002
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
Modern scientific data mainly consist of huge data sets gathered by a very large number of techniques and stored in much diversified and often incompatible data repositories. More in general, in the e-science environment, it is considered as a critical and urgent requirement to integrate services across distributed, heterogeneous, dynamic "virtual organizations" formed by different resources within a single enterprise. In the last decade, Astronomy has become an immensely data-rich field due to the evolution of detectors (plates to digital to mosaics), telescopes and space instruments. The Virtual Observatory approach consists of the federation under common standards of all astronomical archives available worldwide, as well as data analysis, data mining and data exploration applications. The main drive behind such an effort is that once the infrastructure is complete, it will allow a new type of multi-wavelength, multi-epoch science, which can only be barely imagined. Data mining, or knowledge discovery in databases, while being the main methodology to extract the scientific information contained in such Massive Data Sets (MDS), poses crucial problems since it has to orchestrate complex problems posed by transparent access to different computing environments, scalability of algorithms, reusability of resources, etc. In the present paper we summarize the present status of the MDS in the Virtual Observatory and what is currently done and planned to bring advanced data mining methodologies in the case of the DAME (DAta Mining and Exploration) project. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:845 / 849
页数:5
相关论文
共 50 条
  • [1] Massive data sets, data mining, and decision support
    Dalal, S
    Dumais, S
    Kettenring, J
    Kurien, V
    McIntosh, A
    Maitra, R
    [J]. MINING AND MODELING MASSIVE DATA SETS IN SCIENCE, ENGINEERING, AND BUSINESS WITH A SUBTHEME IN ENVIRONMENTAL STATISTICS, 1997, 29 (01): : 329 - 329
  • [2] Warehousing and mining massive RFID data sets
    Han, Jiawei
    Gonzalez, Hector
    Li, Xiaolei
    Klabjan, Diego
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 1 - 18
  • [3] Rough sets for data mining and knowledge discovery
    Komorowski, J
    Polkowski, L
    Skowron, A
    [J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1263 : 393 - 393
  • [4] The Research of High Efficient Data Mining Algorithms for Massive Data Sets
    Tao Cuixia
    [J]. MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 3901 - 3904
  • [5] EXPLORATION OF MASSIVE CRIME DATA SETS THROUGH DATA MINING TECHNIQUES
    Lee, Ickjai
    Estivill-Castro, Vladimir
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2011, 25 (05) : 362 - 379
  • [6] Mining knowledge in One Night Stands data sets
    Sansaturio, M. E.
    Arratia, O.
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2012, 419 (04) : 3399 - 3405
  • [7] Data-mining massive time series astronomical data sets - A case study
    Ng, MK
    Huang, ZX
    Hegland, M
    [J]. RESEARCH AND DEVELOPMENT IN KNOWLEDGE DISCOVERY AND DATA MINING, 1998, 1394 : 401 - 402
  • [8] Data Mining Technique for Knowledge Discovery from Engineering Materials Data Sets
    Doreswamy
    Hemanth, K. S.
    Vastrad, Channabasayya M.
    Nagaraju, S.
    [J]. ADVANCES IN COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, PT I, 2011, 131 : 512 - +
  • [9] Distributed data mining for astrophysical datasets
    McConnell, SM
    Skillicorn, DB
    [J]. Astronomical Data Analysis Software and Systems XIV, Proceedings, 2005, 347 : 360 - 364
  • [10] Data Mining In Massive Spectral Data
    Wang, Wenyu
    Wang, Xinjun
    Jiang, Bin
    Pan, Jingchang
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (06): : 2357 - 2363