Simulating Human Visual System Based on Vision Transformer

被引:1
|
作者
Qiu, Mengyu [1 ]
Guo, Yi [2 ]
Zhang, Mingguang [1 ]
Zhang, Jingwei [1 ]
Lan, Tian [1 ]
Liu, Zhilin [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Chinese Acad Sci Xian, Xian Inst Opt & Precis Mech, Xian, Peoples R China
关键词
Visual scanpath prediction; fixation duration prediction; saccade Sequences; visual attention; scene analysis; EYE-MOVEMENTS; MODEL;
D O I
10.1145/3607822.3616408
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The human visual system (HVS) is capable of responding in real-time to complex visual environments. During the process of freely observing visual scenes, predicting eye movements and visual fixations is a task known as scanpath prediction, which aims to simulate the HVS. In this paper, we propose a visual transformer-based model to study the attentional processes of the human visual system in analyzing visual scenes, thereby achieving scanpath prediction. This technology has important applications in human-computer interaction, virtual reality, augmented reality, and other fields. We have significantly simplified the workflow of scanpath prediction and the overall model architecture, achieving performance superior to existing methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Fault Diagnosis of Suspension System Based on Spectrogram Image and Vision Transformer
    Balaji, P. Arun
    Venkatesh, S. Naveen
    Sugumaran, V
    EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2024, 26 (01):
  • [22] A Recognition System for Diagnosing Salivary Gland Neoplasms Based on Vision Transformer
    Li, Mao
    Shen, Ze-liang
    Xian, Hong-chun
    Zheng, Zhi-jian
    Yu, Zhen-wei
    Liang, Xin-hua
    Gao, Rui
    Tang, Ya-ling
    Zhang, Zhong
    AMERICAN JOURNAL OF PATHOLOGY, 2025, 195 (02): : 221 - 231
  • [23] Simulating prosthetic vision: I. Visual models of phosphenes
    Chen, Spencer C.
    Suaning, Gregg J.
    Morley, John W.
    Lovell, Nigel H.
    VISION RESEARCH, 2009, 49 (12) : 1493 - 1506
  • [24] Visual perception and performance under conditions simulating prosthetic vision
    Dagnelie, G.
    Thompson, R. W.
    Barnett, G. D.
    Zhang, W. Q.
    PERCEPTION, 2000, 29 : 84 - 84
  • [25] Binocular stereo vision technology based on human visual characteristics
    Zhao, Jing
    Sui, Xiubao
    Zhu, Haoyang
    Chen, Qian
    Gu, Guohua
    Yao, Zheyi
    AOPC 2021: INFRARED DEVICE AND INFRARED TECHNOLOGY, 2021, 12061
  • [26] Quantification of display visual artifacts based on a human vision model
    Jhang, Zih-Jian
    Wang, Sheng-Bo
    Wen, Chao-Hua
    AD'07: Proceedings of Asia Display 2007, Vols 1 and 2, 2007, : 1947 - 1952
  • [27] The visual vision and human cognition
    Green, TRG
    IEEE SYMPOSIUM ON VISUAL LANGUAGES, PROCEEDINGS, 1996, : 2 - 2
  • [28] The simulating of the woven fabric visual system
    Deng Zhongmin
    Lu Hongmei
    Zhu Lili
    1st International Symposium on Digital Manufacture, Vols 1-3, 2006, : 999 - 1002
  • [29] The simulating of the woven fabric visual system
    Deng, Zhongmin
    Lu, Hongmei
    Zhu, Lili
    Wuhan Ligong Daxue Xuebao/Journal of Wuhan University of Technology, 2006, 28 (SUPPL. 1): : 999 - 1002
  • [30] The Simulating of the Woven Fabric Visual System
    DENG Zhongmin LU Hongmei ZHU Lili Huazhong University of Science TechnologWuhan China Wuhan University of Science EngineeringWnhan China
    武汉理工大学学报, 2006, (S3) : 999 - 1002