Active learning: an empirical study of common baselines

被引:61
|
作者
Ramirez-Loaiza, Maria E. [1 ]
Sharma, Manali [1 ]
Kumar, Geet [1 ]
Bilgic, Mustafa [1 ]
机构
[1] IIT, 10 W 31st St, Chicago, IL 60616 USA
基金
美国国家科学基金会;
关键词
Active learning; Query by committee; Uncertainty sampling; Empirical evaluation; LOGISTIC-REGRESSION; AREA;
D O I
10.1007/s10618-016-0469-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the empirical evaluations of active learning approaches in the literature have focused on a single classifier and a single performance measure. We present an extensive empirical evaluation of common active learning baselines using two probabilistic classifiers and several performance measures on a number of large datasets. In addition to providing important practical advice, our findings highlight the importance of overlooked choices in active learning experiments in the literature. For example, one of our findings shows that model selection is as important as devising an active learning approach, and choosing one classifier and one performance measure can often lead to unexpected and unwarranted conclusions. Active learning should generally improve the model's capability to distinguish between instances of different classes, but our findings show that the improvements provided by active learning for one performance measure often came at the expense of another measure. We present several such results, raise questions, guide users and researchers to better alternatives, caution against unforeseen side effects of active learning, and suggest future research directions.
引用
收藏
页码:287 / 313
页数:27
相关论文
共 50 条
  • [21] EMPIRICAL EVALUATION OF THE TRANSFER OF INFORMATION RESOURCES IN ACTIVE LEARNING
    Romansky, Radi
    INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2023, 15 (01): : 39 - 48
  • [22] An Active Learning Method for Empirical Modeling in Performance Tuning
    Zhang, Jiepeng
    Sun, Jingwei
    Zhou, Wenju
    Sun, Guangzhong
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 244 - 253
  • [23] DEVELOPING COMPETENCIES THROUGH ACTIVE LEARNING: AN EMPIRICAL ANALYSIS
    Ruiz Palomino, Pablo
    Elche Hortelano, Dioni
    Martinez Canas, Ricardo
    Valencia de Lara, Pilar
    Rodrigo Alarcon, Job
    Martinez Perez, Angela
    7TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2013), 2013, : 2625 - 2629
  • [24] Learning curves in learning with noise - An empirical study
    Gu, HZ
    Takahashi, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1997, E80D (01) : 78 - 85
  • [25] Typology of common psychiatric syndromes - An empirical study
    Sullivan, PF
    Kendler, KS
    BRITISH JOURNAL OF PSYCHIATRY, 1998, 173 : 312 - 319
  • [26] An Empirical Study on the Effectiveness of Common Security Measures
    Harrison, Keith
    White, Gregory
    43RD HAWAII INTERNATIONAL CONFERENCE ON SYSTEMS SCIENCES VOLS 1-5 (HICSS 2010), 2010, : 1939 - 1945
  • [27] Individualized empirical baselines for evaluating the energy performance of existing buildings
    Lou, Yingli
    Ye, Yunyang
    Yang, Yizhi
    Zuo, Wangda
    Wang, Gang
    Strong, Matthew
    Upadhyaya, Satish
    Payne, Chris
    SCIENCE AND TECHNOLOGY FOR THE BUILT ENVIRONMENT, 2023, 29 (01) : 19 - 33
  • [28] An Empirical Study of the Sample Size Variability of Optimal Active Learning Using Gaussian Process Regression
    Yeh, Flora Yu-Hui
    Gallagher, Marcus
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3787 - 3794
  • [29] Learning to communicate; baselines in Iberian wetland environments
    Sanchez Emeterio, Gema
    Garcia Fernandez, Beatriz
    HISTORIA Y COMUNICACION SOCIAL, 2013, 18
  • [30] KNOWLEDGE CONSTRUCTION IN e-LEARNING: AN EMPIRICAL VALIDATION OF AN ACTIVE LEARNING MODEL
    Koohang, Alex
    Paliszkiewicz, Joanna
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2013, 53 (03) : 109 - 114