Simulating Human Visual System Based on Vision Transformer

被引：1

作者：

Qiu, Mengyu ^{[1
]}

Guo, Yi ^{[2
]}

Zhang, Mingguang ^{[1
]}

Zhang, Jingwei ^{[1
]}

Lan, Tian ^{[1
]}

Liu, Zhilin ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China

[2] Chinese Acad Sci Xian, Xian Inst Opt & Precis Mech, Xian, Peoples R China

来源：

ACM SYMPOSIUM ON SPATIAL USER INTERACTION, SUI 2023 | 2023年

关键词：

Visual scanpath prediction; fixation duration prediction; saccade Sequences; visual attention; scene analysis; EYE-MOVEMENTS; MODEL;

D O I：

10.1145/3607822.3616408

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The human visual system (HVS) is capable of responding in real-time to complex visual environments. During the process of freely observing visual scenes, predicting eye movements and visual fixations is a task known as scanpath prediction, which aims to simulate the HVS. In this paper, we propose a visual transformer-based model to study the attentional processes of the human visual system in analyzing visual scenes, thereby achieving scanpath prediction. This technology has important applications in human-computer interaction, virtual reality, augmented reality, and other fields. We have significantly simplified the workflow of scanpath prediction and the overall model architecture, achieving performance superior to existing methods.

引用

页数：5

共 50 条

[41] A human-like visual-attention-based artificial vision system for wildland firefighting assistance
Kurosh Madani
Viachaslau Kachurka
Christophe Sabourin
Veronique Amarger
Vladimir Golovko
Lucile Rossi
Applied Intelligence, 2018, 48 : 2157 - 2179
[42] A human-like visual-attention-based artificial vision system for wildland firefighting assistance
Madani, Kurosh
Kachurka, Viachaslau
Sabourin, Christophe
Amarger, Veronique
Golovko, Vladimir
Rossi, Lucile
APPLIED INTELLIGENCE, 2018, 48 (08) : 2157 - 2179
[43] View Planning Method Based on the Visual Region of the Vision System
Zhou, Xiaolong
He, Bingwei
Li, Y. F.
2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL III, PROCEEDINGS, 2009, : 196 - +
[44] Mapping labels in the human developing visual system and the evolution of binocular vision
Lambot, MA
Depasse, F
Noel, JC
Vanderhaeghen, P
JOURNAL OF NEUROSCIENCE, 2005, 25 (31): : 7232 - 7237
[45] Image fusion for the novelty rotating synthetic aperture system based on vision transformer
Sun, Yu
Zhi, Xiyang
Jiang, Shikai
Fan, Guanghua
Yan, Xu
Zhang, Wei
INFORMATION FUSION, 2024, 104
[46] SimStu-Transformer: A Transformer-Based Approach to Simulating Student Behaviour
Li, Zhaoxing
Shi, Lei
Cristea, Alexandra
Zhou, Yunzhan
Xiao, Chenghao
Pan, Ziqi
ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS AND DOCTORAL CONSORTIUM, PT II, 2022, 13356 : 348 - 351
[47] Simulating prosthetic vision: Optimizing the information content of a limited visual display
van Rheede, Joram J.
Kennard, Christopher
Hicks, Stephen L.
JOURNAL OF VISION, 2010, 10 (14):
[48] Simulating Human Visual Perception in Nighttime Illumination
Zhou, Ning
Dong, Weiming
Wang, Jiaxin
Jean-Claude, Paul
Tsinghua Science and Technology, 2009, 14 (01) : 133 - 138
[49] Simulating Human Visual Perception in Nighttime Illumination
周宁
董未名
王家廞
Paul Jean-Claude
TsinghuaScienceandTechnology, 2009, 14 (01) : 133 - 138
[50] Simulating Human Visual Perception in Tunnel Portals
Liu, Changjiang
Wang, Qiuping
SUSTAINABILITY, 2021, 13 (07)

← 1 2 3 4 5 →