Latency Matters: Real-Time Action Forecasting Transformer

被引:5
|
作者
Girase, Harshayu [1 ,2 ]
Agarwal, Nakul [1 ]
Choi, Chiho [1 ]
Mangalam, Karttikeya [2 ]
机构
[1] Honda Res Inst USA, San Jose, CA USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
10.1109/CVPR52729.2023.01799
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present RAFTformer, a real-time action forecasting transformer for latency-aware real-world action forecasting. RAFTformer is a two-stage fully transformer based architecture comprising of a video transformer backbone that operates on high resolution, short-range clips, and a head transformer encoder that temporally aggregates information from multiple short-range clips to span a long-term horizon. Additionally, we propose a novel self-supervised shuffled causal masking scheme as a model level augmentation to improve forecasting fidelity. Finally, we also propose a novel real-time evaluation setting for action forecasting that directly couples model inference latency to overall forecasting performance and brings forth a hitherto overlooked trade-off between latency and action forecasting performance. Our parsimonious network design facilitates RAFTformer inference latency to be 9x smaller than prior works at the same forecasting accuracy. Owing to its two-staged design, RAFTformer uses 94% less training compute and 90% lesser training parameters to outperform prior state-of-the-art baselines by 4.9 points on EGTEA Gaze+ and by 1.4 points on EPIC-Kitchens-100 validation set, as measured by Top-5 recall (T5R) in the offline setting. In the real-time setting, RAFTformer outperforms prior works by an even greater margin of upto 4.4 T5R points on the EPIC-Kitchens-100 dataset. Project Webpage: https://karttikeya.github.io/publication/RAFTformer/.
引用
收藏
页码:18759 / 18769
页数:11
相关论文
共 50 条
  • [1] Real-time disease risk monitoring and forecasting for early action
    Pittiglio, Claudia
    Kivaria, Fredrick
    Morteo, Karl
    Bebay, Charles
    Seck, Ismaila
    Falcucci, Alessandra
    Cinardi, Giuseppina
    Franceschini, Gianluca
    Soumare, Baba
    Dhingra, Madhur
    2024 12TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS, AGRO-GEOINFORMATICS 2024, 2024, : 142 - 146
  • [2] Real-time squared: A real-time data set for real-time GDP forecasting
    Golinelli, Roberto
    Parigi, Giuseppe
    INTERNATIONAL JOURNAL OF FORECASTING, 2008, 24 (03) : 368 - 385
  • [3] DESIGN FOR REAL-TIME IMAGE TRANSFORMER
    BACCHI, H
    TCHEN, H
    ANNALES DES TELECOMMUNICATIONS-ANNALS OF TELECOMMUNICATIONS, 1975, 30 (9-10): : 363 - 373
  • [4] OPERATIONAL FORECASTING WITH REAL-TIME DATABASES
    BAE, DH
    GEORGAKAKOS, KP
    NANDA, SK
    JOURNAL OF HYDRAULIC ENGINEERING-ASCE, 1995, 121 (01): : 49 - 60
  • [5] Real-Time river flow forecasting
    Shamseldin, AY
    RIVER BASIN MODELLING FOR FLOOD RISK MITIGATION, 2006, : 181 - 195
  • [6] Advances in real-time flood forecasting
    Young, PC
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2002, 360 (1796): : 1433 - 1450
  • [7] A Real-Time Weather Forecasting and Analysis
    Kothapalli, Sushmitha
    Totad, S. G.
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1567 - 1570
  • [8] REAL-TIME FORECASTING OF RIVER FLOWS
    KITANIDIS, PK
    BRAS, RL
    TRANSACTIONS-AMERICAN GEOPHYSICAL UNION, 1978, 59 (04): : 272 - 272
  • [9] Digital Time: Latency, Real-time, and the Onlife Experience of Everyday Time
    Luciano Floridi
    Philosophy & Technology, 2021, 34 (3) : 407 - 412
  • [10] Fuzzy time series for real-time flood forecasting
    Chang-Shian Chen
    You-Da Jhong
    Wan-Zhen Wu
    Shien-Tsung Chen
    Stochastic Environmental Research and Risk Assessment, 2019, 33 : 645 - 656