Uncertainty Sampling Methods for Selecting Datasets in Active Meta-Learning

被引:0
|
作者
Prudencio, Ricardo B. C. [1 ]
Soares, Carlos [2 ]
Ludermir, Teresa B. [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, BR-50732970 Recife, PE, Brazil
[2] Univ Porto, Fac Econ, LIAAD INESC Porto L A, Oporto 4050190, Portugal
关键词
ALGORITHM SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several meta-learning approaches have been developed for the problem of algorithm selection. In this context, it is of central importance to collect a sufficient number of datasets to be used as meta-examples in order to provide reliable results. Recently, some proposals to generate datasets have addressed this issue with successful results. These proposals include datasetoids, which is a simple manipulation method to obtain new datasets from existing ones. However, the increase in the number of datasets raises another issue: in order to generate meta-examples for training, it is necessary to estimate the performance of the algorithms on the datasets. This typically requires running all candidate algorithms on all datasets, which is computationally very expensive. In a recent paper, active meta-learning has been used to address this problem. An uncertainty sampling method for the k-NN algorithm using a least confidence score based on a distance measure was employed. Here we extend that work, namely by investigating three hypotheses: 1) is there advantage in using a frequency-based least confidence score over the distance-based score? 2) given that the meta-learning problem used has three classes, is it better to use a margin-based score? and 3) given that datasetoids are expected to contain some noise, are better results achieved by starting the search with all datasets already labeled? Some of the results obtained are unexpected and should be further analyzed. However, they confirm that active meta-learning can significantly reduce the computational cost of meta-learning with potential gains in accuracy.
引用
收藏
页码:1082 / 1089
页数:8
相关论文
共 50 条
  • [21] Analysis of Meta-Learning Approaches for TCGA Pan-cancer Datasets
    Chou, Jingyuan
    Bekiranov, Stefan
    Zang, Chongzhi
    Huai, Mengdi
    Zhang, Aidong
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 257 - 262
  • [22] Optimal Hyperparameter Tuning using Meta-Learning for Big Traffic Datasets
    Bui, Khac-Hoai Nam
    Yi, Hongsuk
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 48 - 54
  • [23] A new data characterization for selecting clustering algorithms using meta-learning
    Pimentel, Bruno Almeida
    de Carvalho, Andre C. P. L. F.
    INFORMATION SCIENCES, 2019, 477 : 203 - 219
  • [24] Meta-learning for dynamic tuning of active learning on stream classification
    Martins, Vinicius Eiji
    Cano, Alberto
    Barbon, Sylvio, Jr.
    PATTERN RECOGNITION, 2023, 138
  • [25] Evidential uncertainty sampling strategies for active learning
    Hoarau, Arthur
    Lemaire, Vincent
    Le Gall, Yolande
    Dubois, Jean-Christophe
    Martin, Arnaud
    MACHINE LEARNING, 2024, 113 (09) : 6453 - 6474
  • [26] Adaptive Gradient-Based Meta-Learning Methods
    Khodak, Mikhail
    Balcan, Maria-Florina
    Talwalkar, Ameet
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [27] Exploring Active Learning in Meta-learning: Enhancing Context Set Labeling
    Bae, Wonho
    Wang, Jing
    Sutherland, Danica J.
    COMPUTER VISION - ECCV 2024, PT LXXXIX, 2025, 15147 : 279 - 296
  • [28] Meta-learning for Automated Selection of Anomaly Detectors for Semi-supervised Datasets
    Schubert, David
    Gupta, Pritha
    Wever, Marcel
    ADVANCES IN INTELLIGENT DATA ANALYSIS XXI, IDA 2023, 2023, 13876 : 392 - 405
  • [29] Meta-learning evolutionary artificial neural network for selecting flexible manufacturing systems
    Bhattacharya, Arijit
    Abraham, Ajith
    Grosan, Crina
    Vasant, Pandian
    Han, Sangyong
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 891 - 897
  • [30] Learning Meta-Learning (LML) dataset: Survey data of meta-learning parameters
    Corraya, Sonia
    Al Mamun, Shamim
    Kaiser, M. Shamim
    DATA IN BRIEF, 2023, 51