Active Testing: An Efficient and Robust Framework for Estimating Accuracy

被引:0
|
作者
Phuc Nguyen [1 ]
Ramanan, Deva [2 ]
Fowlkes, Charless [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92697 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much recent work on visual recognition aims to scale up learning to massive, noisily-annotated datasets. We address the problem of scaling-up the evaluation of such models to large-scale datasets with noisy labels. Current protocols for doing so require a human user to either vet (re-annotate) a small fraction of the test set and ignore the rest, or else correct errors in annotation as they are found through manual inspection of results. In this work, we re-formulate the problem as one of active testing, and examine strategies for efficiently querying a user so as to obtain an accurate performance estimate with minimal vetting. We demonstrate the effectiveness of our proposed active testing framework on estimating two performance metrics, Precision@K and mean Average Precision, for two popular computer vision tasks, multi-label classification and instance segmentation. We further show that our approach is able to save significant human annotation effort and is more robust than alternative evaluation protocols.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [1] Robust Procedures for Estimating and Testing in the Framework of Divergence Measures
    Pardo, Leandro
    Martin, Nirian
    ENTROPY, 2021, 23 (04)
  • [2] An Active Learning Framework for Efficient Robust Policy Search
    Narayanaswami, Sai Kiran
    Sudarsanam, Nandan
    Ravindran, Balaraman
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 1 - 9
  • [3] SAFFRON: A Fast, Efficient, and Robust Framework for Group Testing Based on Sparse-Graph Codes
    Lee, Kangwook
    Chandrasekher, Kabir
    Pedarsani, Ramtin
    Ramchandran, Kannan
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (17) : 4649 - 4664
  • [4] SAFFRON: A Fast, Efficient, and Robust Framework for Group Testing based on Sparse-Graph Codes
    Lee, Kangwook
    Pedarsani, Ramtin
    Ramchandran, Kannan
    2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 2873 - 2877
  • [5] Estimating and testing process accuracy with extension to asymmetric tolerances
    Chien-Wei Wu
    Ming-Hung Shu
    W. L. Pearn
    Yi-Chang Tai
    Quality & Quantity, 2010, 44 : 985 - 995
  • [6] Estimating and testing process accuracy with extension to asymmetric tolerances
    Wu, Chien-Wei
    Shu, Ming-Hung
    Pearn, W. L.
    Tai, Yi-Chang
    QUALITY & QUANTITY, 2010, 44 (05) : 985 - 995
  • [7] RELATIVE ACCURACY OF ESTIMATING PRODUCTION AS AFFECTED BY LENGTH OF TESTING INTERVAL AND METHOD OF ESTIMATING
    LAMB, RC
    YOUNG, RM
    JOURNAL OF DAIRY SCIENCE, 1968, 51 (06) : 977 - &
  • [8] A robust framework for estimating linguistic alignment in Twitter conversations
    Doyle, Gabriel
    Yurovsky, Dan
    Frank, Michael C.
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 637 - 648
  • [9] An Examination of Classification Accuracy in the Continuous Testing Framework
    Coggeshall, Whitney Smiley
    EDUCATIONAL MEASUREMENT-ISSUES AND PRACTICE, 2021, 40 (01) : 28 - 35
  • [10] A framework for robust active super tier systems
    Dolev S.
    Gersten O.
    International Journal on Software Tools for Technology Transfer, 2010, 12 (1) : 53 - 67