Simulating Human Visual System Based on Vision Transformer

被引：1

作者：

Qiu, Mengyu ^{[1
]}

Guo, Yi ^{[2
]}

Zhang, Mingguang ^{[1
]}

Zhang, Jingwei ^{[1
]}

Lan, Tian ^{[1
]}

Liu, Zhilin ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China

[2] Chinese Acad Sci Xian, Xian Inst Opt & Precis Mech, Xian, Peoples R China

来源：

ACM SYMPOSIUM ON SPATIAL USER INTERACTION, SUI 2023 | 2023年

关键词：

Visual scanpath prediction; fixation duration prediction; saccade Sequences; visual attention; scene analysis; EYE-MOVEMENTS; MODEL;

D O I：

10.1145/3607822.3616408

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The human visual system (HVS) is capable of responding in real-time to complex visual environments. During the process of freely observing visual scenes, predicting eye movements and visual fixations is a task known as scanpath prediction, which aims to simulate the HVS. In this paper, we propose a visual transformer-based model to study the attentional processes of the human visual system in analyzing visual scenes, thereby achieving scanpath prediction. This technology has important applications in human-computer interaction, virtual reality, augmented reality, and other fields. We have significantly simplified the workflow of scanpath prediction and the overall model architecture, achieving performance superior to existing methods.

引用

页数：5

共 50 条

[1] ViTVO: Vision Transformer based Visual Odometry with Attention Supervision
Chiu, Chu-Chi
Yang, Hsuan-Kung
Chen, Hao-Wei
Chen, Yu-Wen
Lee, Chun-Yi
2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
[2] Thresholds of Vision of the Human Visual System: Visual Adaptation for Monocular and Binocular Vision
Montrucchio, Bartolomeo
Celozzi, Cesare
Cerutti, Paolo
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2015, 45 (06) : 739 - 749
[3] Vision Transformer in Industrial Visual Inspection
Hutten, Nils
Meyes, Richard
Meisen, Tobias
APPLIED SCIENCES-BASEL, 2022, 12 (23):
[4] A human activity recognition method based on Vision Transformer
Han, Huiyan
Zeng, Hongwei
Kuang, Liqun
Han, Xie
Xue, Hongxin
SCIENTIFIC REPORTS, 2024, 14 (01):
[5] Fine-grained visual clasificatio based on compct Vision transformer
Xu H.
Guo L.
Li R.-Z.
Kongzhi yu Juece/Control and Decision, 2024, 39 (03): : 893 - 900
[6] Vision transformer-based visual language understanding of the construction process
Yang, Bin
Zhang, Binghan
Han, Yilong
Liu, Boda
Hu, Jiniming
Jin, Yiming
ALEXANDRIA ENGINEERING JOURNAL, 2024, 99 : 242 - 256
[7] Visual perception enhancement fall detection algorithm based on vision transformer
Cai, Xi
Wang, Xiangcheng
Bao, Kexin
Chen, Yinuo
Jiao, Yin
Han, Guang
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
[8] Performance assessment of a visual attention system entirely based on a human vision modeling
Le Meur, O
Le Callet, P
Barba, D
Thoreau, D
ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2327 - 2330
[9] Visual simulating dichromatic vision in CIE space
Hu, Yinghua
GRAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS THEORY AND APPLICATIONS, 2006, : 92 - 97
[10] Computer-Based System for Simulating Visual Impairments
Velazquez, Ramiro
Varona, Jorge
Rodrigo, Pedro
IETE JOURNAL OF RESEARCH, 2016, 62 (06) : 833 - 841

← 1 2 3 4 5 →