Statistics of high-level scene context

被引:54
|
作者
Greene, Michelle R. [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
来源
FRONTIERS IN PSYCHOLOGY | 2013年 / 4卷
关键词
context; scene; ensemble; bag of words; data mining; scene understanding; NATURAL SCENES; VISUAL FEATURES; EYE-MOVEMENTS; OBJECTS; PERCEPTION; REPRESENTATION; MEMORY; IDENTIFICATION; CONSISTENCY; REGULARITIES;
D O I
10.3389/fpsyg.2013.00777
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Context is critical for recognizing environments and for searching for objects within them: contextual associations have been shown to modulate reaction time and object recognition accuracy, as well as influence the distribution of eye movements and patterns of brain activations. However, we have not yet systematically quantified the relationships between objects and their scene environments. Here I seek to fill this gap by providing descriptive statistics of object-scene relationships. A total of 48, 167 objects were hand-labeled in 3499 scenes using the LabelMe tool (Russell et al., 2008). From these data, I computed a variety of descriptive statistics at three different levels of analysis: the ensemble statistics that describe the density and spatial distribution of unnamed "things" in the scene; the bag of words level where scenes are described by the list of objects contained within them; and the structural level where the spatial distribution and relationships between the objects are measured. The utility of each level of description for scene categorization was assessed through the use of linear classifiers, and the plausibility of each level for modeling human scene categorization is discussed. Of the three levels, ensemble statistics were found to be the most informative (per feature), and also best explained human patterns of categorization errors. Although a bag of words classifier had similar performance to human observers, it had a markedly different pattern of errors. However, certain objects are more useful than others, and ceiling classification performance could be achieved using only the 64 most informative objects. As object location tends not to vary as a function of category, structural information provided little additional information. Additionally, these data provide valuable information on natural scene redundancy that can be exploited for machine vision, and can help the visual cognition community to design experiments guided by statistics rather than intuition
引用
收藏
页数:31
相关论文
共 50 条
  • [1] High-level scene perception
    Henderson, JM
    Hollingworth, A
    [J]. ANNUAL REVIEW OF PSYCHOLOGY, 1999, 50 : 243 - 271
  • [2] Summarizing high-level scene behavior
    Streib, Kevin
    Davis, James W.
    [J]. MACHINE VISION AND APPLICATIONS, 2014, 25 (01) : 229 - 244
  • [3] Summarizing high-level scene behavior
    Kevin Streib
    James W. Davis
    [J]. Machine Vision and Applications, 2014, 25 : 229 - 244
  • [4] STATISTICS OF HIGH-LEVEL REFLECTIONS IN AUDITORIUMS
    PREIZER, LB
    [J]. SOVIET PHYSICS ACOUSTICS-USSR, 1966, 11 (04): : 407 - &
  • [5] DISCRIMINATIVE HIGH-LEVEL REPRESENTATIONS FOR SCENE CLASSIFICATION
    Zhang, Lei
    Xie, Shouzhi
    Zhen, Xiantong
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 4345 - 4348
  • [6] High-Level Aftereffects to Global Scene Properties
    Greene, Michelle R.
    Oliva, Aude
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2010, 36 (06) : 1430 - 1442
  • [7] High-level scene structure using visibility and occlusion
    Mcllroy, Paul
    Cipolla, Roberto
    Rosten, Ed
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [8] Scene Segmentation and Semantic Representation for High-Level Retrieval
    Zhu, Songhao
    Liu, Yuncai
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2008, 15 : 713 - 716
  • [9] Deriving High-Level Scene Descriptions from Deep Scene CNN Features
    Bayat, Akram
    Pomptun, Marc
    [J]. PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
  • [10] High-Level Context Information for Tasks in Teaching
    Schulz, Renee
    Isabwe, Ghislain Maurice N.
    Prinz, Andreas
    Hara, Takahiro
    [J]. ADVANCES IN HUMAN FACTORS IN TRAINING, EDUCATION, AND LEARNING SCIENCES, AHFE 2017, 2018, 596 : 278 - 289