Active learning in regression, with application to stochastic dynamic programming

被引:0
|
作者
Teytaud, Olivier [1 ]
Gelly, Sylvain [1 ]
Mary, Jeremie [2 ]
机构
[1] Univ Paris Sud, INRIA, CNRS, UMR 8623,TAO, Paris, France
[2] Univ Lille, Inria, Grappa, Villeneuve Dascq, France
关键词
intelligent control systems and optimization; machine learning in control applications; active learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study active learning as a derandomized form of sampling. We show that full derandomization is not suitable in a robust framework, propose partially derandomized samplings, and develop new active learning methods (i) in which expert knowledge is easy to integrate (ii) with a parameter for the exploration/exploitation dilemma (iii) less randomized than the full-random sampling (yet also not deterministic). Experiments are performed in the case of regression for value-function learning on a continuous domain. Our main results are (i) efficient partially derandomized point sets (ii) moderate-derandomization theorems (iii) experimental evidence of the importance of the frontier (iv) a new regression-specific user-friendly sampling tool less-robust than blind samplers but that sometimes works very efficiently in large dimensions. All experiments can be reproduced by downloading the source code and running the provided command line.
引用
收藏
页码:198 / +
页数:2
相关论文
共 50 条
  • [41] The application of Stochastic Order to Stochastic Multiobjective Programming Problems
    Zheng, Mingfa
    He, Qihang
    Wang, Zutong
    Su, Dongqing
    ADVANCES IN TRANSPORTATION, PTS 1 AND 2, 2014, 505-506 : 524 - +
  • [42] Stochastic Optimal Coordination of Small UAVs for Target Tracking using Regression-based Dynamic Programming
    Steven A. P. Quintero
    Michael Ludkovski
    João P. Hespanha
    Journal of Intelligent & Robotic Systems, 2016, 82 : 135 - 162
  • [43] Stochastic Optimal Coordination of Small UAVs for Target Tracking using Regression-based Dynamic Programming
    Quintero, Steven A. P.
    Ludkovski, Michael
    Hespanha, Joao P.
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 82 (01) : 135 - 162
  • [44] Unsupervised Adaptation of ASR Systems An Application of Dynamic Programming in Machine Learning
    Babu, Akella Amarendra
    Rao, Akepogu Ananda
    Yellasiri, Ramadevi
    2015 SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2015, : 245 - 253
  • [45] LEARNING AUTOMATA IN STOCHASTIC PROGRAMMING PROBLEMS
    POZNYAK, AS
    AUTOMATION AND REMOTE CONTROL, 1973, 34 (10) : 1608 - 1619
  • [46] STOCHASTIC TRENDS IN DYNAMIC REGRESSION-MODELS - AN APPLICATION TO THE EMPLOYMENT-OUTPUT EQUATION
    HARVEY, AC
    HENRY, SGB
    PETERS, S
    WRENLEWIS, S
    ECONOMIC JOURNAL, 1986, 96 (384): : 975 - 985
  • [47] STOCHASTIC DYNAMIC LINEAR PROGRAMMING: A SEQUENTIAL SAMPLING ALGORITHM FOR MULTISTAGE STOCHASTIC LINEAR PROGRAMMING\ast
    Gangammanavar, Harsha
    Sen, Suvrajeet
    SIAM JOURNAL ON OPTIMIZATION, 2021, 31 (03) : 2111 - 2140
  • [48] Dynamic learning in genetic programming
    Chiu, CC
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL I AND II, 1999, : 416 - 422
  • [49] Stochastic Featurization for Active Learning
    Le, Linh
    Nguyen, Minh-Tien
    Tran, Khai Phan
    Zhao, Genghong
    Xia, Zhang
    Zuccon, Guido
    Demartini, Gianluca
    TRUSTWORTHY ARTIFICIAL INTELLIGENCE FOR HEALTHCARE, TAI4H 2024, 2024, 14812 : 52 - 65
  • [50] Dynamic Programming for Multidimensional Stochastic Control Problems
    Jin Ma Department of Mathematics
    ActaMathematicaSinica(EnglishSeries), 1999, 15 (04) : 485 - 506