The structure of visual spaces

被引:32
|
作者
Koenderink, Jan [1 ,2 ,3 ]
van Doorn, Andrea [1 ]
机构
[1] Buys Ballot Lab, NL-3584 CC Utrecht, Netherlands
[2] Univ Utrecht, Dept Phys & Astron, NL-3508 TC Utrecht, Netherlands
[3] Delft Univ Technol, Man Machine Interact Grp, NL-2600 AA Delft, Netherlands
关键词
visual space; human perception; isotropic metric; visual field; parallactic optical structure; panoramic vision; projective structure; Riemann metrics; ground plane; ambiguity groups; pictorial space; stereopsis;
D O I
10.1007/s10851-008-0076-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The "visual space" of an optical observer situated at a single, fixed viewpoint is necessarily very ambiguous. Although the structure of the "visual field" (the lateral dimensions, i.e., the "image") is well defined, the "depth" dimension has to be inferred from the image on the basis of "monocular depth cues" such as occlusion, shading, etc. Such cues are in no way "given", but are guesses on the basis of prior knowledge about the generic structure of the world and the laws of optics. Thus such a guess is like a hallucination that is used to tentatively interpret image structures as depth cues. The guesses are successful if they lead to a coherent interpretation. Such "controlled hallucination" (in psychological terminology) is similar to the "analysis by synthesis" of computer vision. Although highly ambiguous, visual spaces do have geometrical structure. The group of ambiguities left open by the cues (e.g., the well known bas-relief ambiguity in the case of shape from shading) may be interpreted as the group of congruences (proper motions) of the space. The general structure of visual spaces for different visual fields is explored in the paper. Applications include improved viewing systems for optical man-machine interfaces.
引用
收藏
页码:171 / 187
页数:17
相关论文
共 50 条