Latency Matters: Real-Time Action Forecasting Transformer

被引:5
|
作者
Girase, Harshayu [1 ,2 ]
Agarwal, Nakul [1 ]
Choi, Chiho [1 ]
Mangalam, Karttikeya [2 ]
机构
[1] Honda Res Inst USA, San Jose, CA USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
关键词
D O I
10.1109/CVPR52729.2023.01799
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present RAFTformer, a real-time action forecasting transformer for latency-aware real-world action forecasting. RAFTformer is a two-stage fully transformer based architecture comprising of a video transformer backbone that operates on high resolution, short-range clips, and a head transformer encoder that temporally aggregates information from multiple short-range clips to span a long-term horizon. Additionally, we propose a novel self-supervised shuffled causal masking scheme as a model level augmentation to improve forecasting fidelity. Finally, we also propose a novel real-time evaluation setting for action forecasting that directly couples model inference latency to overall forecasting performance and brings forth a hitherto overlooked trade-off between latency and action forecasting performance. Our parsimonious network design facilitates RAFTformer inference latency to be 9x smaller than prior works at the same forecasting accuracy. Owing to its two-staged design, RAFTformer uses 94% less training compute and 90% lesser training parameters to outperform prior state-of-the-art baselines by 4.9 points on EGTEA Gaze+ and by 1.4 points on EPIC-Kitchens-100 validation set, as measured by Top-5 recall (T5R) in the offline setting. In the real-time setting, RAFTformer outperforms prior works by an even greater margin of upto 4.4 T5R points on the EPIC-Kitchens-100 dataset. Project Webpage: https://karttikeya.github.io/publication/RAFTformer/.
引用
收藏
页码:18759 / 18769
页数:11
相关论文
共 50 条
  • [31] Real-Time Inflation Forecasting in a Changing World
    Groen, Jan J. J.
    Paap, Richard
    Ravazzolo, Francesco
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2013, 31 (01) : 29 - 44
  • [32] Real-Time Tsunami Forecasting: Challenges and Solutions
    Vasily V. Titov
    Frank I. Gonzalez
    E. N. Bernard
    Marie C. Eble
    Harold O. Mofjeld
    Jean C. Newman
    Angie J. Venturato
    Natural Hazards, 2005, 35 : 35 - 41
  • [33] Real-Time Certified Probabilistic Pedestrian Forecasting
    Jacobs, Henry O.
    Hughes, Owen K.
    Johnson-Roberson, Matthew
    Vasudevan, Ram
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04): : 2064 - 2071
  • [34] Operational solar forecasting for the real-time market
    Yang, Dazhi
    Wu, Elynn
    Kleissl, Jan
    INTERNATIONAL JOURNAL OF FORECASTING, 2019, 35 (04) : 1499 - 1519
  • [35] Real-time forecasting following a damaging earthquake
    Marzocchi, Warner
    Lombardi, Anna Maria
    GEOPHYSICAL RESEARCH LETTERS, 2009, 36
  • [36] Approximating and forecasting macroeconomic signals in real-time
    Valle e Azevedo, Joao
    Pereira, Ana
    INTERNATIONAL JOURNAL OF FORECASTING, 2013, 29 (03) : 479 - 492
  • [37] Real-time forecasting of infectious disease epidemics
    Wu, J. T.
    Cowling, B. J.
    HONG KONG MEDICAL JOURNAL, 2018, 24 (05) : 26 - 29
  • [38] WATER-SUPPLY FORECASTING IN REAL-TIME
    HUBER, AL
    WATER RESOURCES BULLETIN, 1984, 20 (02): : 167 - 171
  • [39] Residual Correction in Real-Time Traffic Forecasting
    Kim, Daejin
    Cho, Youngin
    Kim, Dongmin
    Park, Cheonbok
    Choo, Jaegul
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 962 - 971
  • [40] Real-time probabilistic forecasting of flood stages
    Chen, Shien-Tsung
    Yu, Pao-Shan
    JOURNAL OF HYDROLOGY, 2007, 340 (1-2) : 63 - 77