A Novel Classification Method: Neighborhood-Based Positive Unlabeled Learning Using Decision Tree (NPULUD)

被引:1
|
作者
Ghasemkhani, Bita [1 ]
Balbal, Kadriye Filiz [2 ]
Birant, Kokten Ulas [3 ,4 ]
Birant, Derya [4 ]
机构
[1] Dokuz Eylul Univ, Grad Sch Nat & Appl Sci, TR-35390 Izmir, Turkiye
[2] Dokuz Eylul Univ, Dept Comp Sci, TR-35390 Izmir, Turkiye
[3] Dokuz Eylul Univ, Informat Technol Res & Applicat Ctr DEBTAM, TR-35390 Izmir, Turkiye
[4] Dokuz Eylul Univ, Dept Comp Engn, TR-35390 Izmir, Turkiye
关键词
artificial intelligence; machine learning; classification; positive unlabeled learning; decision tree; entropy measure; k-nearest neighbors; supervised learning; ALGORITHM;
D O I
10.3390/e26050403
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In a standard binary supervised classification task, the existence of both negative and positive samples in the training dataset are required to construct a classification model. However, this condition is not met in certain applications where only one class of samples is obtainable. To overcome this problem, a different classification method, which learns from positive and unlabeled (PU) data, must be incorporated. In this study, a novel method is presented: neighborhood-based positive unlabeled learning using decision tree (NPULUD). First, NPULUD uses the nearest neighborhood approach for the PU strategy and then employs a decision tree algorithm for the classification task by utilizing the entropy measure. Entropy played a pivotal role in assessing the level of uncertainty in the training dataset, as a decision tree was developed with the purpose of classification. Through experiments, we validated our method over 24 real-world datasets. The proposed method attained an average accuracy of 87.24%, while the traditional supervised learning approach obtained an average accuracy of 83.99% on the datasets. Additionally, it is also demonstrated that our method obtained a statistically notable enhancement (7.74%), with respect to state-of-the-art peers, on average.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Photosynthetic protein classification using genome neighborhood-based machine learning feature
    Apiwat Sangphukieo
    Teeraphan Laomettachit
    Marasri Ruengjitchatchawalya
    [J]. Scientific Reports, 10
  • [2] Photosynthetic protein classification using genome neighborhood-based machine learning feature
    Sangphukieo, Apiwat
    Laomettachit, Teeraphan
    Ruengjitchatchawalya, Marasri
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [3] RWN: A Novel Neighborhood-Based Method for Statistical Disclosure Control
    Perry, Noah
    Matloff, Norman
    Tendick, Patrick
    [J]. TRANSACTIONS ON DATA PRIVACY, 2024, 17 (02) : 55 - 88
  • [4] Pixel Classification using General Adaptive Neighborhood-based Features
    Gonzalez-Castro, Victor
    Debayle, Johan
    Curic, Vladimir
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3750 - 3755
  • [5] SVM based adaptive learning method for text classification from positive and unlabeled documents
    Peng, Tao
    Zuo, Wanli
    He, Fengling
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (03) : 281 - 301
  • [6] SVM based adaptive learning method for text classification from positive and unlabeled documents
    Tao Peng
    Wanli Zuo
    Fengling He
    [J]. Knowledge and Information Systems, 2008, 16 : 281 - 301
  • [7] A Two-Step Classification Method Based on Collaborative Representation for Positive and Unlabeled Learning
    Wang, Yijin
    Peng, Yali
    He, Kai
    Liu, Shigang
    Li, Jun
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (06) : 4239 - 4255
  • [8] A Two-Step Classification Method Based on Collaborative Representation for Positive and Unlabeled Learning
    Yijin Wang
    Yali Peng
    Kai He
    Shigang Liu
    Jun Li
    [J]. Neural Processing Letters, 2021, 53 : 4239 - 4255
  • [9] A novel reliable negative method based on clustering for learning from positive and unlabeled examples
    Zhang, Bangzuo
    Zuo, Wanli
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 385 - 392
  • [10] SAR image classification using adaptive neighborhood-based convolutional neural network
    Zhang, Anjun
    Yang, Xuezhi
    Jia, Lu
    Ai, Jiaqiu
    Dong, Zhangyu
    [J]. EUROPEAN JOURNAL OF REMOTE SENSING, 2019, 52 (01) : 178 - 193