A Transformer-Based Model for the Prediction of Human Gaze Behavior on Videos

被引:0
|
作者
Ozdel, Suleyman [1 ]
Rong, Yao [1 ]
Albaba, Berat Mert [2 ]
Kuo, Yen-Ling [3 ]
Wang, Xi [2 ]
Kasneci, Enkelejda [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] ETH, Zurich, Switzerland
[3] Univ Virginia, Charlottesville, VA USA
关键词
Eye-tracking; Human gaze prediction; Human attention; Action recognition;
D O I
10.1145/3649902.3653439
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Eye-tracking applications that utilize the human gaze in video understanding tasks have become increasingly important. To effectively automate the process of video analysis based on eye-tracking data, it is important to accurately replicate human gaze behavior. However, this task presents significant challenges due to the inherent complexity and ambiguity of human gaze patterns. In this work, we introduce a novel method for simulating human gaze behavior. Our approach uses a transformer-based reinforcement learning algorithm to train an agent that acts as a human observer, with the primary role of watching videos and simulating human gaze behavior. We employed an eye-tracking dataset gathered from videos generated by the VirtualHome simulator, with a primary focus on activity recognition. Our experimental results demonstrate the effectiveness of our gaze prediction method by highlighting its capability to replicate human gaze behavior and its applicability for downstream tasks where real human-gaze is used as input.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] TransCFD: A transformer-based decoder for flow field prediction
    Jiang, Jundou
    Li, Guanxiong
    Jiang, Yi
    Zhang, Laiping
    Deng, Xiaogang
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [42] Rethinking Transformer-based Set Prediction for Object Detection
    Sun, Zhiqing
    Cao, Shengcao
    Yang, Yiming
    Kitani, Kris
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3591 - 3600
  • [43] A Transformer-Based Bridge Structural Response Prediction Framework
    Li, Ziqi
    Li, Dongsheng
    Sun, Tianshu
    [J]. SENSORS, 2022, 22 (08)
  • [44] Transformer-based Architecture for Empathy Prediction and Emotion Classification
    Vasava, Himil
    Uikey, Pramegh
    Wasnik, Gaurav
    Sharma, Raksha
    [J]. PROCEEDINGS OF THE 12TH WORKSHOP ON COMPUTATIONAL APPROACHES TO SUBJECTIVITY, SENTIMENT & SOCIAL MEDIA ANALYSIS, 2022, : 261 - 264
  • [45] Deep Transformer-Based Asset Price and Direction Prediction
    Gezici, Abdul Haluk Batur
    Sefer, Emre
    [J]. IEEE ACCESS, 2024, 12 : 24164 - 24178
  • [46] HTTNet: hybrid transformer-based approaches for trajectory prediction
    Ge, Xianlei
    Shen, Xiaobo
    Zhou, Xuanxin
    Li, Xiaoyan
    [J]. Bulletin of the Polish Academy of Sciences: Technical Sciences, 2024, 72 (05)
  • [47] SST: A Simplified Swin Transformer-based Model for Taxi Destination Prediction based on Existing Trajectory
    Wang, Zepu
    Sun, Yifei
    Lei, Zhiyu
    Zhu, Xincheng
    Sun, Peng
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 1404 - 1409
  • [48] Vision Transformer-Based Video Hashing Retrieval for Tracing the Source of Fake Videos
    Pei, Pengfei
    Zhao, Xianfeng
    Li, Jinchuan
    Cao, Yun
    Lai, Xuyuan
    [J]. Security and Communication Networks, 2023, 2023
  • [49] Transformer-Based Model for Electrical Load Forecasting
    L'Heureux, Alexandra
    Grolinger, Katarina
    Capretz, Miriam A. M.
    [J]. ENERGIES, 2022, 15 (14)
  • [50] Transformer-based settlement prediction model of pile composite foundation under embankment loading
    Gao, Song
    Chen, Changfu
    Jiang, Xueqin
    Zhu, Shimin
    Cai, Huan
    Li, Wei
    [J]. COMPUTERS AND GEOTECHNICS, 2024, 176