Pool-based active learning with optimal sampling distribution and its information geometrical interpretation

被引:13
|
作者
Kanamori, Takafumi [1 ]
机构
[1] Tokyo Inst Technol, Dept Math & Comp Sci, Meguro Ku, Tokyo 1528552, Japan
关键词
pool-based active learning; maximum weighted log-likelihood estimator; statistical risk; information geometry; mean curvature vector;
D O I
10.1016/j.neucom.2006.11.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a pool-based active learning algorithm with approximately optimal sampling distributions. An intuitive understanding of the effectiveness of active learning is also illustrated from the viewpoint of the information geometry. In active learning, one can choose informative input points or input distributions. Appropriate choice of data points is expected in order to make prediction performance more accurate than random data selection. Conventional active learning methods, however, yield serious estimation bias, when parametric statistical models do not include the true probability distribution. To correct the bias, we apply the maximum weighted log-likelihood estimator with approximately optimal input distribution. Optimal input distribution for active learning can be obtained by simple regression estimation. Numerical studies show the effectiveness of the proposed learning algorithm. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:353 / 362
页数:10
相关论文
共 50 条
  • [41] Learning an Optimal Sampling Distribution for Efficient Motion Planning
    Cheng, Richard
    Shankar, Krishna
    Burdick, Joel W.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 7485 - 7492
  • [42] Optimal sampling placement based on value of information
    20153701272439
    (1) Tokyo City University, Tokyo, Japan; (2) Gifu University, Gifu, Japan, 1600, et al; Faculty of Science and Engineering, Waseda University, Japan; The Kajima Foundation, Japan; The Maeda Engineering Foundation, Japan; The Obayashi Foundation, Japan; Tokyo Metropolitan Government, Japan (CRC Press/Balkema):
  • [43] Optimal Sampling Placement Based on Value of Information
    Yoshida, I.
    Otake, Y.
    Honjo, Y.
    LIFE-CYCLE OF STRUCTURAL SYSTEMS: DESIGN, ASSESSMENT, MAINTENANCE AND MANAGEMENT, 2015, : 1362 - 1369
  • [44] Learning Regular Expressions for Interpretable Medical Text Classification Using a Pool-based Simulated Annealing Approach
    Tu, Chaofan
    Cui, Menglin
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [45] The highly secure polynomial pool-based key pre-distribution scheme for wireless sensor network
    Choudhary, Vishal
    Taruna, Sunil
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (01): : 95 - 114
  • [46] ESTIMATION OF STRESS INTENSITY FACTOR FOR SURFACE CRACKS IN THE FIRTREE GROOVE STRUCTURE OF A TURBINE DISK USING POOL-BASED ACTIVE LEARNING WITH GAUSSIAN PROCESS REGRESSION
    Guo, Kaimin
    Liu, Hongzhuo
    Yan, Han
    Song, Ziyuan
    Zhang, Shengming
    Huang, Dawei
    Yan, Xiaojun
    JOURNAL OF THEORETICAL AND APPLIED MECHANICS, 2024, 62 (01) : 89 - 101
  • [47] Information Gain Sampling for Active Learning in Medical Image Classification
    Mehta, Raghav
    Shui, Changjian
    Nichyporuk, Brennan
    Arbel, Tal
    UNCERTAINTY FOR SAFE UTILIZATION OF MACHINE LEARNING IN MEDICAL IMAGING, 2022, 13563 : 135 - 145
  • [48] Optimal Product Lifecycle and Partial Information with Active Learning
    Arik Sadeh
    Computational Economics, 2003, 21 (1-2) : 125 - 136
  • [49] Active learning with misclassification sampling based on committee
    Long, Jun
    Yin, Jianping
    Zhu, En
    Zhao, Wentao
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2008, 16 (01) : 55 - 70
  • [50] Active learning method based on instability sampling
    He H.
    Xie M.
    Huang S.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2022, 44 (03): : 50 - 56