Pulsar candidate selection using pseudo-nearest centroid neighbour classifier

被引:10
|
作者
Xiao, Jiangping [1 ]
Li, Xiangru [2 ]
Linl, Haitao [1 ]
Qiu, Kaibin [1 ]
机构
[1] South China Normal Univ, Sch Math Sci, Guangzhou 510631, Peoples R China
[2] South China Normal Univ, Sch Comp Sci, Guangzhou 510631, Peoples R China
基金
中国国家自然科学基金;
关键词
methods: data analysis; methods: statistical; pulsars: general; SYSTEM;
D O I
10.1093/mnras/stz3539
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
A typical characteristic of the pulsar candidate classification task is the class imbalance between true pulsars and false candidates. This imbalance has negative effects on traditional classification methods. In this study, we introduce a strategy using a scatter matrix-based class separability measure to estimate the harmfulness of class imbalance on pulsar candidate classification. The measure quantitatively describes the damage of the imbalanced situations on the pulsar candidate classification problem and provides some priori information to guide us to select an appropriate data processing method and to construct an effective classifier. After that, we present a non-parametric data exploration technique, a pseudo-nearest centroid neighbour classifier (PNCN), to identify credible pulsar candidates from pulsar survey data sets. The PNCN algorithm can effectively resolve the class imbalance problem and is applicable to data streams. The proposed algorithm is tested on High Time Resolution Universe Pulsar Survey (HTRU) 2 (obtained by an analysis of HTRU Medium Latitude data) and LOTAAS 1 (obtained from the LOFAR Tied-Array All-Sky Survey). The experimental results show that the proposed classifier can excellently identify the pulsars with high performance: the precision and the recall on HTRU 2 are 92.3 per cent and 83.1 per cent, and those on LOTAAS 1 are 97.4 per cent and 95.6 per cent, respectively; the false positive rate (FPR) on HTRU 2 is 0.7 per cent, on LOTAAS 1 is 0.03 per cent, which is an order of magnitude lower than the corresponding FPR obtained in Lyon et al. (2016) and Tan et al. (2018).
引用
收藏
页码:2119 / 2127
页数:9
相关论文
共 50 条
  • [1] A pseudo nearest centroid neighbour classifier
    Ma, Hongxing
    Gou, Jianping
    Wang, Xili
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2018, 17 (01) : 55 - 68
  • [2] Finger Vein Recognition Using Principle Component Analysis and Adaptive k-Nearest Centroid Neighbour Classifier
    Han, Ng Tze
    Mukahar, Nordiana
    Rosdi, Bakhtiar Affendi
    [J]. INTERNATIONAL JOURNAL OF INTEGRATED ENGINEERING, 2021, 13 (01): : 177 - 187
  • [3] Design of digital classifier circuits with nearest neighbour prior sample selection
    Lacerda, WS
    Braga, AP
    [J]. HIS 2005: 5TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, : 531 - 533
  • [4] Using Hellinger distance in a nearest neighbour classifier for relational databases
    Lee, CH
    Shin, DG
    [J]. KNOWLEDGE-BASED SYSTEMS, 1999, 12 (07) : 363 - 370
  • [5] GENIFER:: A nearest neighbour based classifier system using GA
    Fàbrega, FXLI
    Guiu, JMGI
    [J]. GECCO-99: PROCEEDINGS OF THE GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 1999, : 797 - 797
  • [6] Handwritten Digit Recognition Using K-Nearest Neighbour Classifier
    Babu, U. Ravi
    Venkateswarlu, Y.
    Chintha, Aneel Kumar
    [J]. 2014 WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT 2014), 2014, : 60 - +
  • [7] An FPGA based coprocessor for cancer classification using nearest neighbour classifier
    Tahir, Muhammad Atif
    Bouridane, Ahmed
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 3463 - 3466
  • [8] Indonesian graphemic syllabification using a nearest neighbour classifier and recovery procedure
    Edwina Anky Parande
    Suyanto Suyanto
    [J]. International Journal of Speech Technology, 2019, 22 : 13 - 20
  • [9] Indonesian graphemic syllabification using a nearest neighbour classifier and recovery procedure
    Parande, Edwina Anky
    Suyanto, Suyanto
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 13 - 20
  • [10] Comparison of Music Genre Classification Using Nearest Centroid Classifier and k-Nearest Neighbours
    Tamatjita, Elizabeth Nurmiyati
    Mahastama, Aditya Wikan
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT AND TECHNOLOGY (ICIMTECH), 2016, : 118 - 123