Latency Matters: Real-Time Action Forecasting Transformer

被引：5

作者：

Girase, Harshayu ^{[1
,2
]}

Agarwal, Nakul ^{[1
]}

Choi, Chiho ^{[1
]}

Mangalam, Karttikeya ^{[2
]}

机构：

[1] Honda Res Inst USA, San Jose, CA USA

[2] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.01799

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present RAFTformer, a real-time action forecasting transformer for latency-aware real-world action forecasting. RAFTformer is a two-stage fully transformer based architecture comprising of a video transformer backbone that operates on high resolution, short-range clips, and a head transformer encoder that temporally aggregates information from multiple short-range clips to span a long-term horizon. Additionally, we propose a novel self-supervised shuffled causal masking scheme as a model level augmentation to improve forecasting fidelity. Finally, we also propose a novel real-time evaluation setting for action forecasting that directly couples model inference latency to overall forecasting performance and brings forth a hitherto overlooked trade-off between latency and action forecasting performance. Our parsimonious network design facilitates RAFTformer inference latency to be 9x smaller than prior works at the same forecasting accuracy. Owing to its two-staged design, RAFTformer uses 94% less training compute and 90% lesser training parameters to outperform prior state-of-the-art baselines by 4.9 points on EGTEA Gaze+ and by 1.4 points on EPIC-Kitchens-100 validation set, as measured by Top-5 recall (T5R) in the offline setting. In the real-time setting, RAFTformer outperforms prior works by an even greater margin of upto 4.4 T5R points on the EPIC-Kitchens-100 dataset. Project Webpage: https://karttikeya.github.io/publication/RAFTformer/.

引用

页码：18759 / 18769

页数：11

共 50 条

[31] Real-Time Inflation Forecasting in a Changing World
Groen, Jan J. J.
Paap, Richard
Ravazzolo, Francesco
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2013, 31 (01) : 29 - 44
[32] Real-Time Tsunami Forecasting: Challenges and Solutions
Vasily V. Titov
Frank I. Gonzalez
E. N. Bernard
Marie C. Eble
Harold O. Mofjeld
Jean C. Newman
Angie J. Venturato
Natural Hazards, 2005, 35 : 35 - 41
[33] Real-Time Certified Probabilistic Pedestrian Forecasting
Jacobs, Henry O.
Hughes, Owen K.
Johnson-Roberson, Matthew
Vasudevan, Ram
IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04): : 2064 - 2071
[34] Operational solar forecasting for the real-time market
Yang, Dazhi
Wu, Elynn
Kleissl, Jan
INTERNATIONAL JOURNAL OF FORECASTING, 2019, 35 (04) : 1499 - 1519
[35] Real-time forecasting following a damaging earthquake
Marzocchi, Warner
Lombardi, Anna Maria
GEOPHYSICAL RESEARCH LETTERS, 2009, 36
[36] Approximating and forecasting macroeconomic signals in real-time
Valle e Azevedo, Joao
Pereira, Ana
INTERNATIONAL JOURNAL OF FORECASTING, 2013, 29 (03) : 479 - 492
[37] Real-time forecasting of infectious disease epidemics
Wu, J. T.
Cowling, B. J.
HONG KONG MEDICAL JOURNAL, 2018, 24 (05) : 26 - 29
[38] WATER-SUPPLY FORECASTING IN REAL-TIME
HUBER, AL
WATER RESOURCES BULLETIN, 1984, 20 (02): : 167 - 171
[39] Residual Correction in Real-Time Traffic Forecasting
Kim, Daejin
Cho, Youngin
Kim, Dongmin
Park, Cheonbok
Choo, Jaegul
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 962 - 971
[40] Real-time probabilistic forecasting of flood stages
Chen, Shien-Tsung
Yu, Pao-Shan
JOURNAL OF HYDROLOGY, 2007, 340 (1-2) : 63 - 77

← 1 2 3 4 5 →