Galaxy morphology classification using automated machine learning

被引:15
|
作者
Reza, Moonzarin [1 ]
机构
[1] Texas A&M Univ, Dept Phys & Astron, College Stn, TX 77843 USA
关键词
Galaxy; Morphology; Machine learning; Computational complexity; DIGITAL SKY SURVEY; DARK ENERGY SURVEY; ENVIRONMENTAL DEPENDENCE; ZOO; COSMOS; COLOR; SYSTEMS; FIELD; 1ST;
D O I
10.1016/j.ascom.2021.100492
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
In this paper, we apply five different machine learning algorithms to classify samples into four categories - spirals, ellipticals, mergers and stars (don't know) using data from the Sloan Digital Sky Survey to assess the feasibility of using machine learning methods for future surveys. Classifying mergers as a separate class poses a challenge as this category is easily confused with both ellipticals and spirals, and as a result, most previous studies have not included mergers as a distinct morphological class. The dataset is highly imbalanced with the number of ellipticals/spirals being much larger than the number of stars/mergers, and this is another challenge we aim to address. Starting with 62 features, we perform principal component analysis and use the 25 most significant principal components as inputs to the machine learning models. We compare our results with the Galaxy Zoo labels and obtain an overall test accuracy of 98.2% and 97.5% using Artificial Neural Network and ExtraTrees respectively. However, ExtraTrees outperforms Neural Network in classifying mergers and stars. We also perform a parameter sensitivity test to compare the relative importance of different categories of features on the model's performance. Finally, we address the class imbalance problem and examine the effects of different sampling strategies. Our results show that the use of a balanced dataset with a large number of training samples leads to high recall values for the minority classes, and that oversampling methods lead to better performance than undersampling techniques. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] An automatic taxonomy of galaxy morphology using unsupervised machine learning
    Hocking, Alex
    Geach, James E.
    Sun, Yi
    Davey, Neil
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2018, 473 (01) : 1108 - 1129
  • [2] Automated brain histology classification using machine learning
    Ker, Justin
    Bai, Yeqi
    Lee, Hwei Yee
    Rao, Jai
    Wang, Lipo
    [J]. JOURNAL OF CLINICAL NEUROSCIENCE, 2019, 66 : 239 - 245
  • [3] Image feature extraction and galaxy classification: a novel and efficient approach with automated machine learning
    Tarsitano, F.
    Bruderer, C.
    Schawinski, K.
    Hartley, W. G.
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2022, 511 (03) : 3330 - 3338
  • [4] STAR-GALAXY CLASSIFICATION USING MACHINE LEARNING ALGORITHMS AND DEEP LEARNING
    Savyanavar, Amit Sadanand
    Mhala, Nikhil
    Sutar, Shiv H.
    [J]. INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2023, 15 (02): : 87 - 96
  • [5] Galaxy morphology - An unsupervised machine learning approach
    Schutter, A.
    Shamir, L.
    [J]. ASTRONOMY AND COMPUTING, 2015, 12 : 60 - 66
  • [6] Machine learning and galaxy morphology: for what purpose?
    Fraix-Burnet, D.
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2023, 523 (03) : 3974 - 3990
  • [7] The miniJPAS survey: star-galaxy classification using machine learning
    Baqui, P. O.
    Marra, V.
    Casarini, L.
    Angulo, R.
    Diaz-Garcia, L. A.
    Hernandez-Monteagudo, C.
    Lopes, P. A. A.
    Lopez-Sanjuan, C.
    Muniesa, D.
    Placco, V. M.
    Quartin, M.
    Queiroz, C.
    Sobral, D.
    Solano, E.
    Tempel, E.
    Varela, J.
    Vilchez, J. M.
    Abramo, R.
    Alcaniz, J.
    Benitez, N.
    Bonoli, S.
    Carneiro, S.
    Cenarro, A. J.
    Cristobal-Hornillos, D.
    de Amorim, A. L.
    de Oliveira, C. M.
    Dupke, R.
    Ederoclite, A.
    Gonzalez Delgado, R. M.
    Marin-Franch, A.
    Moles, M.
    Ramio, H. Vazquez
    Sodre, L.
    Taylor, K.
    [J]. ASTRONOMY & ASTROPHYSICS, 2021, 645
  • [8] Automated Honduran Banknote Image Classification using Machine Learning
    Castelar, Sarah
    Banegas, Leonardo A.
    Mendoza, David A.
    Soto, Jean Carlo
    Davila, Kenny
    [J]. PROCEEDINGS OF THE 2022 IEEE 40TH CENTRAL AMERICA AND PANAMA CONVENTION (CONCAPAN), 2022,
  • [9] Automated traffic classification and application identification using machine learning
    Zander, S
    Nguyen, T
    Armitage, G
    [J]. LCN 2005: 30TH CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, 2005, : 250 - 257
  • [10] Machine Learning for Automated Tender Classification
    Goswami, Sumit
    Kapoor, Sunaina
    Bhardwaj, Prakriti
    [J]. 2011 ANNUAL IEEE INDIA CONFERENCE (INDICON-2011): ENGINEERING SUSTAINABLE SOLUTIONS, 2011,