Active Testing: An Efficient and Robust Framework for Estimating Accuracy

被引:0
|
作者
Phuc Nguyen [1 ]
Ramanan, Deva [2 ]
Fowlkes, Charless [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92697 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much recent work on visual recognition aims to scale up learning to massive, noisily-annotated datasets. We address the problem of scaling-up the evaluation of such models to large-scale datasets with noisy labels. Current protocols for doing so require a human user to either vet (re-annotate) a small fraction of the test set and ignore the rest, or else correct errors in annotation as they are found through manual inspection of results. In this work, we re-formulate the problem as one of active testing, and examine strategies for efficiently querying a user so as to obtain an accurate performance estimate with minimal vetting. We demonstrate the effectiveness of our proposed active testing framework on estimating two performance metrics, Precision@K and mean Average Precision, for two popular computer vision tasks, multi-label classification and instance segmentation. We further show that our approach is able to save significant human annotation effort and is more robust than alternative evaluation protocols.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A robust framework for estimating theoretical minimum energy requirements for industrial processes
    Bolson, Natanael
    Cullen, Luke
    Cullen, Jonathan
    ENERGY, 2025, 322
  • [32] Estimating spatial quantile regression with functional coefficients: A robust semiparametric framework
    Lu, Zudi
    Tang, Qingguo
    Cheng, Longsheng
    BERNOULLI, 2014, 20 (01) : 164 - 189
  • [33] Efficient and robust rub control with an active auxiliary bearing
    Ginzinger, Lucas
    Ulbrich, Heinz
    Proceedings of the ASME Turbo Expo 2007, Vol 5, 2007, : 1063 - 1071
  • [34] Robust Active Contour Segmentation with an Efficient Global Optimizer
    De Vylder, Jonas
    Aelterman, Jan
    Philips, Wilfried
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, 2011, 6915 : 195 - 206
  • [35] Estimating EDFA Output Power with an Efficient Numerical Modeling Framework
    Fei, Yue
    Fumagalli, Andrea
    Garrich, Miquel
    Sarti, Benjamin
    Moura, Uiara
    Gonzalez, Neil Guerrero
    Oliveira, Juliano
    2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2015, : 5222 - 5227
  • [36] Group testing. An efficient alternative for estimating animal prevalence
    Montesinos Lopez, Osval Antonio
    Montesinos Lopez, Abelardo
    Luna Espinoza, Ignacio
    Gaytan Lugo, Laura Sanely
    Espinosa Solares, Teodoro
    REVISTA MEXICANA DE CIENCIAS PECUARIAS, 2012, 3 (04) : 515 - 531
  • [37] A Framework and Algorithm for Model-Based Active Testing
    Feldman, Alexander
    Provan, Gregory
    van Gemund, Arjan
    2008 INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (PHM), 2008, : 378 - +
  • [38] CALFUZZER: An Extensible Active Testing Framework for Concurrent Programs
    Joshi, Pallavi
    Naik, Mayur
    Park, Chang-Seo
    Sen, Koushik
    COMPUTER AIDED VERIFICATION, PROCEEDINGS, 2009, 5643 : 675 - +
  • [39] ROBUST TESTING AND EVALUATION OF SYSTEMS - FRAMEWORK, APPROACHES, AND ILLUSTRATIVE TOOLS
    BITTNER, AC
    HUMAN FACTORS, 1992, 34 (04) : 477 - 484
  • [40] Unified Framework for Development, Deployment and Robust Testing of Neuroimaging Algorithms
    Alark Joshi
    Dustin Scheinost
    Hirohito Okuda
    Dominique Belhachemi
    Isabella Murphy
    Lawrence H. Staib
    Xenophon Papademetris
    Neuroinformatics, 2011, 9 : 69 - 84