Pool-Based Sequential Active Learning for Regression

被引:94
|
作者
Wu, Dongrui [1 ]
机构
[1] DataNova LLC, Clifton Pk, NY 12065 USA
关键词
Active learning (AL); inductive learning; passive sampling; ridge regression; transductive learning; MULTIPLE COMPARISONS; MODEL; QUERY;
D O I
10.1109/TNNLS.2018.2868649
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learning (AL) is a machine-learning approach for reducing the data labeling effort. Given a pool of unlabeled samples, it tries to select the most useful ones to label so that a model built from them can achieve the best possible performance. This paper focuses on pool-based sequential AL for regression (ALR). We first propose three essential criteria that an ALR approach should consider in selecting the most useful unlabeled samples: informativeness, representativeness, and diversity, and compare four existing ALR approaches against them. We then propose a new ALR approach using passive sampling, which considers both the representativeness and the diversity in both the initialization and subsequent iterations. Remarkably, this approach can also be integrated with other existing ALR approaches in the literature to further improve the performance. Extensive experiments on 11 University of California, Irvine, Carnegie Mellon University StatLib, and University of Florida Media Core data sets from various domains verified the effectiveness of our proposed ALR approaches.
引用
收藏
页码:1348 / 1359
页数:12
相关论文
共 50 条
  • [21] A Robust Zero-Sum Game Framework for Pool-based Active Learning
    Zhu, Dixian
    Li, Zhe
    Wang, Xiaoyu
    Gong, Boqing
    Yang, Tianbao
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 517 - 526
  • [22] Improved Algorithms for Agnostic Pool-based Active Classification
    Katz-Samuels, Julian
    Zhang, Jifan
    Jain, Lalit
    Jamieson, Kevin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [23] Application of pool-based active learning in reducing the number of required response history analyses
    Kiani, Jalal
    Camp, Charles
    Pezeshk, Shahram
    Khoshnevis, Naeem
    COMPUTERS & STRUCTURES, 2020, 241
  • [24] Pool-based active learning with optimal sampling distribution and its information geometrical interpretation
    Kanamori, Takafumi
    NEUROCOMPUTING, 2007, 71 (1-3) : 353 - 362
  • [25] Accelerating high-throughput virtual screening through molecular pool-based active learning
    Graff, David E.
    Shakhnovich, Eugene I.
    Coley, Connor W.
    CHEMICAL SCIENCE, 2021, 12 (22) : 7866 - 7881
  • [26] Application of Pool-Based Active Learning in Physics-Based Earthquake Ground-Motion Simulation
    Khoshnevis, Naeem
    Taborda, Ricardo
    SEISMOLOGICAL RESEARCH LETTERS, 2019, 90 (02) : 614 - 622
  • [27] A Pool-based Active Learning Method for Improving Farsi-English Machine Translation system
    Bakhshaei, Somayeh
    Khadivi, Shahram
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 822 - 826
  • [28] Early Stopping Heuristics in Pool-Based Incremental Active Learning for Least-Squares Probabilistic Classifier
    Kobayashi, Tsubasa
    Sugiyama, Masashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (08) : 2065 - 2073
  • [29] Picking groups instead of samples: A close look at Static Pool-based Meta-Active Learning
    Mas, Ignasi
    Ramon Morros, Josep
    Vilaplana, Veronica
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1354 - 1362
  • [30] MExMI: Pool-based Active Model Extraction Crossover Membership Inference
    Xiao, Yaxin
    Ye, Qingqing
    Hu, Haibo
    Zheng, Huadi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,