A Transformer-Based Model for the Prediction of Human Gaze Behavior on Videos

被引:0
|
作者
Ozdel, Suleyman [1 ]
Rong, Yao [1 ]
Albaba, Berat Mert [2 ]
Kuo, Yen-Ling [3 ]
Wang, Xi [2 ]
Kasneci, Enkelejda [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] ETH, Zurich, Switzerland
[3] Univ Virginia, Charlottesville, VA USA
关键词
Eye-tracking; Human gaze prediction; Human attention; Action recognition;
D O I
10.1145/3649902.3653439
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Eye-tracking applications that utilize the human gaze in video understanding tasks have become increasingly important. To effectively automate the process of video analysis based on eye-tracking data, it is important to accurately replicate human gaze behavior. However, this task presents significant challenges due to the inherent complexity and ambiguity of human gaze patterns. In this work, we introduce a novel method for simulating human gaze behavior. Our approach uses a transformer-based reinforcement learning algorithm to train an agent that acts as a human observer, with the primary role of watching videos and simulating human gaze behavior. We employed an eye-tracking dataset gathered from videos generated by the VirtualHome simulator, with a primary focus on activity recognition. Our experimental results demonstrate the effectiveness of our gaze prediction method by highlighting its capability to replicate human gaze behavior and its applicability for downstream tasks where real human-gaze is used as input.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] TransGOP: Transformer-Based Gaze Object Prediction
    Wang, Binglu
    Guo, Chenxi
    Jin, Yang
    Xia, Haisheng
    Liu, Nian
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 9, 2024, : 10180 - 10188
  • [2] Transformer-Based Fire Detection in Videos
    Mardani, Konstantina
    Vretos, Nicholas
    Daras, Petros
    [J]. SENSORS, 2023, 23 (06)
  • [3] Transformer-based fall detection in videos
    Nunez-Marcos, Adrian
    Arganda-Carreras, Ignacio
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [4] Vision Transformer-Based Photovoltaic Prediction Model
    Kang, Zaohui
    Xue, Jizhong
    Lai, Chun Sing
    Wang, Yu
    Yuan, Haoliang
    Xu, Fangyuan
    [J]. ENERGIES, 2023, 16 (12)
  • [5] Vision Transformer-Based Tailing Detection in Videos
    Lee, Jaewoo
    Lee, Sungjun
    Cho, Wonki
    Siddiqui, Zahid Ali
    Park, Unsang
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [6] Transformer-based power system energy prediction model
    Rao, Zhuyi
    Zhang, Yunxiang
    [J]. PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 913 - 917
  • [7] A Transformer-based System for Action Spotting in Soccer Videos
    Zhu, He
    Liang, Junwei
    Lin, Chengzhi
    Zhang, Jun
    Hu, Jianming
    [J]. PROCEEDINGS OF THE 5TH ACM INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2022, 2022, : 103 - 109
  • [8] MPformer: A Transformer-Based Model for Earthen Ruins Climate Prediction
    Xu, Guodong
    Wang, Hai
    Ji, Shuo
    Ma, Yuhui
    Feng, Yi
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 29 (06) : 1829 - 1838
  • [9] Learning Daily Human Mobility with a Transformer-Based Model
    Wang, Weiying
    Osaragi, Toshihiro
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (02)
  • [10] A Transformer-based Model for Older Adult Behavior Change Detection
    Akbari, Fateme
    Sartipi, Kamran
    [J]. 2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 27 - 35