The PASCAL Visual Object Classes Challenge: A Retrospective

被引:4128
|
作者
Everingham, Mark [1 ]
Eslami, S. M. Ali [2 ]
Van Gool, Luc [3 ,4 ]
Williams, Christopher K. I. [5 ]
Winn, John [2 ]
Zisserman, Andrew [6 ]
机构
[1] Univ Leeds, Leeds, W Yorkshire, England
[2] Microsoft Res, Cambridge, England
[3] Katholieke Univ Leuven, Leuven, Belgium
[4] ETH, Zurich, Switzerland
[5] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[6] Univ Oxford, Oxford, England
关键词
Database; Benchmark; Object recognition; Object detection; Segmentation; FEATURES;
D O I
10.1007/s11263-014-0733-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Pascal Visual Object Classes (VOC) challenge consists of two components: (i) a publicly available dataset of images together with ground truth annotation and standardised evaluation software; and (ii) an annual competition and workshop. There are five challenges: classification, detection, segmentation, action classification, and person layout. In this paper we provide a review of the challenge from 2008-2012. The paper is intended for two audiences: algorithm designers, researchers who want to see what the state of the art is, as measured by performance on the VOC datasets, along with the limitations and weak points of the current generation of algorithms; and, challenge designers, who want to see what we as organisers have learnt from the process and our recommendations for the organisation of future challenges. To analyse the performance of submitted algorithms on the VOC datasets we introduce a number of novel evaluation methods: a bootstrapping method for determining whether differences in the performance of two algorithms are significant or not; a normalised average precision so that performance can be compared across classes with different proportions of positive instances; a clustering method for visualising the performance across multiple algorithms so that the hard and easy images can be identified; and the use of a joint classifier over the submitted algorithms in order to measure their complementarity and combined performance. We also analyse the community's progress through time using the methods of Hoiem et al. (Proceedings of European Conference on Computer Vision, 2012) to identify the types of occurring errors. We conclude the paper with an appraisal of the aspects of the challenge that worked well, and those that could be improved in future challenges.
引用
收藏
页码:98 / 136
页数:39
相关论文
共 50 条
  • [1] The Pascal Visual Object Classes Challenge: A Retrospective
    Mark Everingham
    S. M. Ali Eslami
    Luc Van Gool
    Christopher K. I. Williams
    John Winn
    Andrew Zisserman
    [J]. International Journal of Computer Vision, 2015, 111 : 98 - 136
  • [2] The 2005 PASCAL visual object classes challenge
    Everingham, Mark
    Zisserman, Andrew
    Williams, Christopher K. I.
    Van Gool, Luc
    Allan, Moray
    Bishop, Christopher M.
    Chapelle, Olivier
    Dalal, Navneet
    Deselaers, Thomas
    Dorko, Gyuri
    Duffner, Stefan
    Eichhorn, Jan
    Farquhar, Jason D. R.
    Fritz, Mario
    Garcia, Christophe
    Griffiths, Tom
    Jurie, Frederic
    Keysers, Daniel
    Koskela, Markus
    Laaksonen, Jorma
    Larlus, Diane
    Leibe, Bastian
    Meng, Hongying
    Ney, Hermann
    Schiele, Bernt
    Schmid, Cordelia
    Seemann, Edgar
    Shawe-Taylor, John
    Storkey, Amos
    Szedmak, Sandor
    Triggs, Bill
    Ulusoy, Ilkay
    Viitaniemi, Ville
    Zhang, Jianguo
    [J]. MACHINE LEARNING CHALLENGES: EVALUATING PREDICTIVE UNCERTAINTY VISUAL OBJECT CLASSIFICATION AND RECOGNIZING TEXTUAL ENTAILMENT, 2006, 3944 : 117 - 176
  • [3] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [4] The Pascal Visual Object Classes (VOC) Challenge
    Mark Everingham
    Luc Van Gool
    Christopher K. I. Williams
    John Winn
    Andrew Zisserman
    [J]. International Journal of Computer Vision, 2010, 88 : 303 - 338
  • [5] The concept of visual classes for object classification
    Schiele, B
    Crowley, JL
    [J]. SCIA '97 - PROCEEDINGS OF THE 10TH SCANDINAVIAN CONFERENCE ON IMAGE ANALYSIS, VOLS 1 AND 2, 1997, : 43 - 50
  • [6] Joint learning of visual attributes, object classes and visual saliency
    Wang, Gang
    Forsyth, David
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 537 - 544
  • [7] Visual object tracking: Progress, challenge, and future
    Zhang, Libo
    Fan, Heng
    [J]. INNOVATION, 2023, 4 (02):
  • [8] PET: AN EYE-TRACKING DATASET FOR ANIMAL-CENTRIC PASCAL OBJECT CLASSES
    Gilani, Syed Omer
    Subramanian, Ramanathan
    Yan, Yan
    Melcher, David
    Sebe, Nicu
    Winkler, Stefan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [9] From Pascal to Delphi to Object Pascal-2000
    Gofen, A
    [J]. ACM SIGPLAN NOTICES, 2001, 36 (06) : 38 - 49
  • [10] Properties of patch based approaches for the recognition of visual object classes
    Teynor, Alexandra
    Rahtu, Esa
    Setia, Lokesh
    Burkhardt, Hans
    [J]. PATTERN RECOGNITION, PROCEEDINGS, 2006, 4174 : 284 - 293