Simulating Human Visual System Based on Vision Transformer

被引:1
|
作者
Qiu, Mengyu [1 ]
Guo, Yi [2 ]
Zhang, Mingguang [1 ]
Zhang, Jingwei [1 ]
Lan, Tian [1 ]
Liu, Zhilin [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Chinese Acad Sci Xian, Xian Inst Opt & Precis Mech, Xian, Peoples R China
关键词
Visual scanpath prediction; fixation duration prediction; saccade Sequences; visual attention; scene analysis; EYE-MOVEMENTS; MODEL;
D O I
10.1145/3607822.3616408
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The human visual system (HVS) is capable of responding in real-time to complex visual environments. During the process of freely observing visual scenes, predicting eye movements and visual fixations is a task known as scanpath prediction, which aims to simulate the HVS. In this paper, we propose a visual transformer-based model to study the attentional processes of the human visual system in analyzing visual scenes, thereby achieving scanpath prediction. This technology has important applications in human-computer interaction, virtual reality, augmented reality, and other fields. We have significantly simplified the workflow of scanpath prediction and the overall model architecture, achieving performance superior to existing methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] A human-like visual-attention-based artificial vision system for wildland firefighting assistance
    Kurosh Madani
    Viachaslau Kachurka
    Christophe Sabourin
    Veronique Amarger
    Vladimir Golovko
    Lucile Rossi
    Applied Intelligence, 2018, 48 : 2157 - 2179
  • [42] A human-like visual-attention-based artificial vision system for wildland firefighting assistance
    Madani, Kurosh
    Kachurka, Viachaslau
    Sabourin, Christophe
    Amarger, Veronique
    Golovko, Vladimir
    Rossi, Lucile
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2157 - 2179
  • [43] View Planning Method Based on the Visual Region of the Vision System
    Zhou, Xiaolong
    He, Bingwei
    Li, Y. F.
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL III, PROCEEDINGS, 2009, : 196 - +
  • [44] Mapping labels in the human developing visual system and the evolution of binocular vision
    Lambot, MA
    Depasse, F
    Noel, JC
    Vanderhaeghen, P
    JOURNAL OF NEUROSCIENCE, 2005, 25 (31): : 7232 - 7237
  • [45] Image fusion for the novelty rotating synthetic aperture system based on vision transformer
    Sun, Yu
    Zhi, Xiyang
    Jiang, Shikai
    Fan, Guanghua
    Yan, Xu
    Zhang, Wei
    INFORMATION FUSION, 2024, 104
  • [46] SimStu-Transformer: A Transformer-Based Approach to Simulating Student Behaviour
    Li, Zhaoxing
    Shi, Lei
    Cristea, Alexandra
    Zhou, Yunzhan
    Xiao, Chenghao
    Pan, Ziqi
    ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS AND DOCTORAL CONSORTIUM, PT II, 2022, 13356 : 348 - 351
  • [47] Simulating prosthetic vision: Optimizing the information content of a limited visual display
    van Rheede, Joram J.
    Kennard, Christopher
    Hicks, Stephen L.
    JOURNAL OF VISION, 2010, 10 (14):
  • [48] Simulating Human Visual Perception in Nighttime Illumination
    Zhou, Ning
    Dong, Weiming
    Wang, Jiaxin
    Jean-Claude, Paul
    Tsinghua Science and Technology, 2009, 14 (01) : 133 - 138
  • [49] Simulating Human Visual Perception in Nighttime Illumination
    周宁
    董未名
    王家廞
    Paul Jean-Claude
    TsinghuaScienceandTechnology, 2009, 14 (01) : 133 - 138
  • [50] Simulating Human Visual Perception in Tunnel Portals
    Liu, Changjiang
    Wang, Qiuping
    SUSTAINABILITY, 2021, 13 (07)