Simulating Human Visual System Based on Vision Transformer

被引:1
|
作者
Qiu, Mengyu [1 ]
Guo, Yi [2 ]
Zhang, Mingguang [1 ]
Zhang, Jingwei [1 ]
Lan, Tian [1 ]
Liu, Zhilin [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Chinese Acad Sci Xian, Xian Inst Opt & Precis Mech, Xian, Peoples R China
关键词
Visual scanpath prediction; fixation duration prediction; saccade Sequences; visual attention; scene analysis; EYE-MOVEMENTS; MODEL;
D O I
10.1145/3607822.3616408
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The human visual system (HVS) is capable of responding in real-time to complex visual environments. During the process of freely observing visual scenes, predicting eye movements and visual fixations is a task known as scanpath prediction, which aims to simulate the HVS. In this paper, we propose a visual transformer-based model to study the attentional processes of the human visual system in analyzing visual scenes, thereby achieving scanpath prediction. This technology has important applications in human-computer interaction, virtual reality, augmented reality, and other fields. We have significantly simplified the workflow of scanpath prediction and the overall model architecture, achieving performance superior to existing methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] ViTVO: Vision Transformer based Visual Odometry with Attention Supervision
    Chiu, Chu-Chi
    Yang, Hsuan-Kung
    Chen, Hao-Wei
    Chen, Yu-Wen
    Lee, Chun-Yi
    2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
  • [2] Thresholds of Vision of the Human Visual System: Visual Adaptation for Monocular and Binocular Vision
    Montrucchio, Bartolomeo
    Celozzi, Cesare
    Cerutti, Paolo
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2015, 45 (06) : 739 - 749
  • [3] Vision Transformer in Industrial Visual Inspection
    Hutten, Nils
    Meyes, Richard
    Meisen, Tobias
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [4] A human activity recognition method based on Vision Transformer
    Han, Huiyan
    Zeng, Hongwei
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [5] Fine-grained visual clasificatio based on compct Vision transformer
    Xu H.
    Guo L.
    Li R.-Z.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (03): : 893 - 900
  • [6] Vision transformer-based visual language understanding of the construction process
    Yang, Bin
    Zhang, Binghan
    Han, Yilong
    Liu, Boda
    Hu, Jiniming
    Jin, Yiming
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 99 : 242 - 256
  • [7] Visual perception enhancement fall detection algorithm based on vision transformer
    Cai, Xi
    Wang, Xiangcheng
    Bao, Kexin
    Chen, Yinuo
    Jiao, Yin
    Han, Guang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [8] Performance assessment of a visual attention system entirely based on a human vision modeling
    Le Meur, O
    Le Callet, P
    Barba, D
    Thoreau, D
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2327 - 2330
  • [9] Visual simulating dichromatic vision in CIE space
    Hu, Yinghua
    GRAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS THEORY AND APPLICATIONS, 2006, : 92 - 97
  • [10] Computer-Based System for Simulating Visual Impairments
    Velazquez, Ramiro
    Varona, Jorge
    Rodrigo, Pedro
    IETE JOURNAL OF RESEARCH, 2016, 62 (06) : 833 - 841