Selecting training instances for supervised classification

被引:0
|
作者
Roiger, R
Cornell, L
机构
来源
PROCEEDINGS ISAI/IFIS 1996 - MEXICO - USA COLLABORATION IN INTELLIGENT SYSTEMS TECHNOLOGIES | 1996年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several experimental studies have tested the relative merits of various supervised machine learning models, Comparisons have been made along dimensions that include model complexity, prediction accuracy, training set size, and training time. Only limited work has been dane to study the effect of training set exemplar typicality on model performance, We present experimental results obtained in testing C4.5, SX-WEB, a backpropagation neural network ard linear discriminant analysis using a real-valued and a mixed form of a medical data set. We generated training sets of highly typical, widely-varied and atypical exemplars for both data sets. We tested the classification accuracy of each model using the generated training sera. Test set accuracy levels ranged between 76% and 86% when each model was trained with typical or varied training sets. The accuracy levels for C4.5, backpropagation neural net and discriminant analysis dropped significantly when atypical training sets were used, In contrast, with the exception of one test, SX-WEB was unaffected by training set choice. When comparing the correctness of each model, SX-WEB showed the best overall performance. We conclude this paper with directions for future research.
引用
收藏
页码:150 / 155
页数:2
相关论文
共 50 条
  • [41] PLIClass: Weakly Supervised Text Classification with Iterative Training and Denoisy Inference
    Xu, Xiantao
    Hu, Minghao
    Wang, Yongjie
    Luo, Wei
    Liu, Shilong
    Luo, ZhunChen
    Tan, Yushan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VII, 2024, 15022 : 292 - 305
  • [42] ARTIFICIAL NEURAL NETWORK CLASSIFICATION USING A MINIMAL TRAINING SET - COMPARISON TO CONVENTIONAL SUPERVISED CLASSIFICATION
    HEPNER, GF
    LOGAN, T
    RITTER, N
    BRYANT, N
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 1990, 56 (04): : 469 - 473
  • [43] Toward intelligent training of supervised image classifications: directing training data acquisition for SVM classification
    Foody, GM
    Mathur, A
    REMOTE SENSING OF ENVIRONMENT, 2004, 93 (1-2) : 107 - 117
  • [44] Selecting reliable instances based on evidence theory for transfer learning
    Lv, Ying
    Zhang, Bofeng
    Yue, Xiaodong
    Denoeux, Thierry
    Yue, Shan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [45] Identifying and eliminating mislabeled training instances
    Brodley, CE
    Friedl, MA
    PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, 1996, : 799 - 805
  • [46] Identifying Mislabeled Instances in Classification Datasets
    Mueller, Nicolas M.
    Markert, Karla
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [47] Instances and Labels: Hierarchy-aware Joint Supervised Contrastive Learning for Hierarchical Multi-Label Text Classification
    Lok, Simon Chi U.
    He, Jie
    Gutierrez-Basulto, Victor
    Pan, Jeff Z.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8858 - 8875
  • [48] Structure features for SAT instances classification
    Ansotegui, Carlos
    Luisa Bonet, Maria
    Giraldez-Cru, Jesus
    Levy, Jordi
    JOURNAL OF APPLIED LOGIC, 2017, 23 : 27 - 39
  • [49] Issues in the Classification of Disease Instances with Ontologies
    Burgun, Anita
    Bodenreider, Olivier
    Jacquelinet, Christian
    CONNECTING MEDICAL INFORMATICS AND BIO-INFORMATICS, 2005, 116 : 695 - 700
  • [50] Identifying and correcting mislabeled training instances
    Sun, Jiang-Wen
    Zhao, Feng-Ying
    Wang, Chong-Jun
    Chen, Shi-Fu
    PROCEEDINGS OF FUTURE GENERATION COMMUNICATION AND NETWORKING, MAIN CONFERENCE PAPERS, VOL 1, 2007, : 243 - 249