Pool-based active learning with optimal sampling distribution and its information geometrical interpretation

被引:13
|
作者
Kanamori, Takafumi [1 ]
机构
[1] Tokyo Inst Technol, Dept Math & Comp Sci, Meguro Ku, Tokyo 1528552, Japan
关键词
pool-based active learning; maximum weighted log-likelihood estimator; statistical risk; information geometry; mean curvature vector;
D O I
10.1016/j.neucom.2006.11.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a pool-based active learning algorithm with approximately optimal sampling distributions. An intuitive understanding of the effectiveness of active learning is also illustrated from the viewpoint of the information geometry. In active learning, one can choose informative input points or input distributions. Appropriate choice of data points is expected in order to make prediction performance more accurate than random data selection. Conventional active learning methods, however, yield serious estimation bias, when parametric statistical models do not include the true probability distribution. To correct the bias, we apply the maximum weighted log-likelihood estimator with approximately optimal input distribution. Optimal input distribution for active learning can be obtained by simple regression estimation. Numerical studies show the effectiveness of the proposed learning algorithm. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:353 / 362
页数:10
相关论文
共 50 条
  • [21] Application of pool-based active learning in reducing the number of required response history analyses
    Kiani, Jalal
    Camp, Charles
    Pezeshk, Shahram
    Khoshnevis, Naeem
    COMPUTERS & STRUCTURES, 2020, 241
  • [22] Improving importance estimation in pool-based batch active learning for approximate linear regression
    Kurihara, Nozomi
    Sugiyama, Masashi
    NEURAL NETWORKS, 2012, 36 : 73 - 82
  • [23] Accelerating high-throughput virtual screening through molecular pool-based active learning
    Graff, David E.
    Shakhnovich, Eugene I.
    Coley, Connor W.
    CHEMICAL SCIENCE, 2021, 12 (22) : 7866 - 7881
  • [24] Application of Pool-Based Active Learning in Physics-Based Earthquake Ground-Motion Simulation
    Khoshnevis, Naeem
    Taborda, Ricardo
    SEISMOLOGICAL RESEARCH LETTERS, 2019, 90 (02) : 614 - 622
  • [25] A Pool-based Active Learning Method for Improving Farsi-English Machine Translation system
    Bakhshaei, Somayeh
    Khadivi, Shahram
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 822 - 826
  • [26] A Pool-Based Model of the Spatial Distribution of Undiscovered Petroleum Resoufrces
    Haiyu Gao
    Zhuoheng Chen
    Kirk G. Osadetz
    Peter Hannigan
    Cameron Watson
    Mathematical Geology, 2000, 32 : 725 - 749
  • [27] A pool-based model of the spatial distribution of undiscovered petroleum resources
    Gao, HY
    Chen, ZH
    Osadetz, KG
    Hannigan, P
    Watson, C
    MATHEMATICAL GEOLOGY, 2000, 32 (06): : 725 - 749
  • [28] Early Stopping Heuristics in Pool-Based Incremental Active Learning for Least-Squares Probabilistic Classifier
    Kobayashi, Tsubasa
    Sugiyama, Masashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (08) : 2065 - 2073
  • [29] Picking groups instead of samples: A close look at Static Pool-based Meta-Active Learning
    Mas, Ignasi
    Ramon Morros, Josep
    Vilaplana, Veronica
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1354 - 1362
  • [30] Pool-based unsupervised active learning for regression using iterative representativeness-diversity maximization (iRDM)
    Liu, Ziang
    Jiang, Xue
    Luo, Hanbin
    Fang, Weili
    Liu, Jiajing
    Wu, Dongrui
    PATTERN RECOGNITION LETTERS, 2021, 142 : 11 - 19