Random forest algorithm for classification of multiwavelength data

被引:41
|
作者
Gao, Dan [1 ,2 ]
Zhang, Yan-Xia [1 ]
Zhao, Yong-Heng [1 ]
机构
[1] Chinese Acad Sci, Natl Astron Observ, Beijing 100012, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
classification; astronomical databases: miscellaneous; catalogs; methods: data analysis; methods: statistical; SKY SURVEY;
D O I
10.1088/1674-4527/9/2/011
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We introduced a decision tree method called Random Forests for multiwavelength data classification. The data were adopted from different databases, including the Sloan Digital Sky Survey (SDSS) Data Release five, USNO, FIRST and ROSAT. We then studied the discrimination of quasars from stars and the classification of quasars, stars and galaxies with the sample from optical and radio bands and with that from optical and X-ray bands. Moreover, feature selection and feature weighting based on Random Forests were investigated. The performances based on different input patterns were compared. The experimental results show that the random forest method is an effective method for astronomical object classification and can be applied to other classification problems faced in astronomy. In addition, Random Forests will show its superiorities due to its own merits, e.g. classification, feature selection, feature weighting as well as outlier detection.
引用
收藏
页码:220 / 226
页数:7
相关论文
共 50 条
  • [1] Random forest algorithm for classification of multiwavelength data
    Dan Gao1
    2 Graduate University of Chinese Academy of Sciences
    [J]. Research in Astronomy and Astrophysics, 2009, 9 (02) : 220 - 226
  • [2] Random forest Algorithm for the Classification of Spectral Data of Astronomical Objects
    Solorio-Ramirez, Jose-Luis
    Jimenez-Cruz, Raul
    Villuendas-Rey, Yenny
    Yanez-Marquez, Cornelio
    [J]. ALGORITHMS, 2023, 16 (06)
  • [3] Smart meter data classification using optimized random forest algorithm
    Zakariazadeh, Alireza
    [J]. ISA TRANSACTIONS, 2022, 126 : 361 - 369
  • [4] Credit Data Classification Based on Ant Colony Algorithm and Random Forest
    Feng, Ruiqi
    Han, Lu
    Chen, Muzi
    [J]. 2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 144 - 149
  • [5] UNBALANCED BIG DATA CLASSIFICATION BASED ON IMPROVED RANDOM FOREST ALGORITHM
    Zheng, Xin
    Huang, Li
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2024, 20 (02): : 575 - 590
  • [6] Object-based classification of hyperspectral data using Random Forest algorithm
    Amini, Saeid
    Homayouni, Saeid
    Safari, Abdolreza
    Darvishsefat, Ali A.
    [J]. GEO-SPATIAL INFORMATION SCIENCE, 2018, 21 (02) : 127 - 138
  • [7] Random Forest Algorithm for HR Data Classification and Performance Analysis in Cloud Environments
    Dong, Fangfang
    [J]. International Journal of Advanced Computer Science and Applications, 2024, 15 (11) : 501 - 508
  • [8] Application of Data Denoising and Classification Algorithm Based on RPCA and Multigroup Random Walk Random Forest in Engineering
    Wang, Renchao
    Wang, Yanlei
    Ma, Yuming
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [9] Enhanced SMOTE Algorithm for Classification of Imbalanced Big-Data using Random Forest
    Bhagat, Reshma C.
    Patil, Sachin S.
    [J]. 2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 403 - 408
  • [10] Research on the Classification of High Dimensional Imbalanced Data based on the Optimization of Random Forest Algorithm
    Ma Xiaojuan
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON BIG DATA ENGINEERING AND TECHNOLOGY (BDET 2018), 2018, : 60 - 67