Neural Mechanisms Underlying Visual Object Recognition

被引:7
|
作者
Afraz, Arash
Yamins, Daniel L. K.
DiCarlo, James J. [1 ]
机构
[1] MIT, Dept Brain & Cognit Sci, E25-618, Cambridge, MA 02139 USA
来源
COGNITION, VOL 79, 2014 | 2014年 / 79卷
关键词
INFEROTEMPORAL CORTEX; TEMPORAL CORTEX; INFORMATION; NEURONS; MICROSTIMULATION; EYE;
D O I
10.1101/sqb.2014.79.024729
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Invariant visual object recognition and the underlying neural representations are fundamental to higher-level human cognition. To understand these neural underpinnings, we combine human and monkey psychophysics, large-scale neurophysiology, neural perturbation methods, and computational modeling to construct falsifiable, predictive models that aim to fully account for the neural encoding and decoding processes that underlie visual object recognition. A predictive encoding model must minimally describe the transformation of the retinal image to population patterns of neural activity along the entire cortical ventral stream of visual processing and must accurately predict the responses to any retinal image. A predictive decoding model must minimally describe the transformation from those population patterns of neural activity to observed object recognition behavior ( i. e., subject reports), and, given that population pattern of activity, it must accurately predict behavior for any object recognition task. To date, we have focused on core object recognition-a remarkable behavior that is accomplished with image viewing durations of <200 msec. Our work thus far reveals that the neural encoding process is reasonably well explained by a largely feed-forward, highly complex, multistaged nonlinear neural network-the current best neuronal simulation models predict approximately one-half of the relevant neuronal response variance across the highest levels of the ventral stream ( areas V4 and IT). Remarkably, however, the decoding process from IT to behavior for all object recognition tasks tested thus far is very accurately predicted by simple direct linear conversion of the inferior temporal neural population state to behavior choice. We have recently examined the behavioral consequences of direct suppression of IT neural activity using pharmacological and optogenetic methods and find them to be well-explained by the same linear decoding model.
引用
收藏
页码:99 / 107
页数:9
相关论文
共 50 条
  • [31] A dual role of prestimulus spontaneous neural activity in visual object recognition
    Ella Podvalny
    Matthew W. Flounders
    Leana E. King
    Tom Holroyd
    Biyu J. He
    Nature Communications, 10
  • [32] Neural substrates of visual and tactile integration about the object recognition by fMRI
    Yang, Wei-hui
    Tanmei
    Lu, Sheng-fu
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 899 - +
  • [33] The Neural-SIFT Feature Descriptor for Visual Vocabulary Object Recognition
    Jansen, Sybren
    Shantia, Amirhosein
    Wiering, Marco A.
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [34] DEEP NEURAL NETWORKS: THE NEW BENCHMARK MODEL OF VISUAL OBJECT RECOGNITION
    Zeman, Astrid
    PERCEPTION, 2019, 48 : 198 - 198
  • [35] A dual role of prestimulus spontaneous neural activity in visual object recognition
    Podvalny, Ella
    Flounders, Matthew W.
    King, Leana E.
    Holroyd, Tom
    He, Biyu J.
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [36] Neural encoding of subject-object distance in a visual recognition system
    Nicol, AU
    Brown, MW
    Horn, G
    EUROPEAN JOURNAL OF NEUROSCIENCE, 1998, 10 (01) : 34 - 44
  • [37] A network view on brain regions involved in experts' object and pattern recognition: Implications for the neural mechanisms of skilled visual perception
    Langner, Robert
    Eickhoff, Simon B.
    Bilalic, Merim
    BRAIN AND COGNITION, 2019, 131 : 74 - 86
  • [38] Mechanisms and Neural Basis of Object and Pattern Recognition A Study With Chess Experts
    Bilalic, Merim
    Langner, Robert
    Erb, Michael
    Grodd, Wolfgang
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2010, 139 (04) : 728 - 742
  • [39] Enhanced object recognition memory evoked by the NSAID, mefenamic acid: A study of the underlying mechanisms
    Foxon, GR
    Ennaceur, A
    Halliwell, RF
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2000, 12 : 473 - 473
  • [40] Neural circuits and synaptic mechanisms underlying the emergence of visual cortical receptive fields
    Martínez, LM
    Alonso, JM
    Hirsch, JA
    PERCEPTION, 2005, 34 : 5 - 6