What Can Simple Arithmetic Operations Do for Temporal Modeling?

被引:1
|
作者
Wu, Wenhao [1 ,2 ]
Song, Yuxin [2 ]
Sun, Zhun [2 ]
Wang, Jingdong [2 ]
Xu, Chang [1 ]
Ouyang, Wanli [1 ,3 ]
机构
[1] Univ Sydney, Sydney, NSW, Australia
[2] Baidu Inc, Beijing, Peoples R China
[3] Shanghai AI Lab, Shanghai, Peoples R China
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
关键词
D O I
10.1109/ICCV51070.2023.01261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal modeling plays a crucial role in understanding video content. To tackle this problem, previous studies built complicated temporal relations through time sequence thanks to the development of computationally powerful devices. In this work, we explore the potential of four simple arithmetic operations for temporal modeling. Specifically, we first capture auxiliary temporal cues by computing addition, subtraction, multiplication, and division between pairs of extracted frame features. Then, we extract corresponding features from these cues to benefit the original temporal-irrespective domain. We term such a simple pipeline as an Arithmetic Temporal Module (ATM), which operates on the stem of a visual backbone with a plug-and-play style. We conduct comprehensive ablation studies on the instantiation of ATMs and demonstrate that this module provides powerful temporal modeling capability at a low computational cost. Moreover, the ATM is compatible with both CNNs- and ViTs-based architectures. Our results show that ATM achieves superior performance over several popular video benchmarks. Specifically, on Something-Something V1, V2 and Kinetics-400, we reach top-1 accuracy of 65.6%, 74.6%, and 89.4% respectively. The code is available at https://github.com/whwu95/ATM.
引用
收藏
页码:13666 / 13676
页数:11
相关论文
共 50 条
  • [21] BET GRADUATES - WHAT CAN AND WHAT DO THEY DO
    MOORE, JH
    ENGINEERING EDUCATION, 1977, 67 (08): : 785 - 786
  • [22] What feminists can do for breastfeeding and what breastfeeding can do for feminists
    Wolf, JH
    SIGNS, 2006, 31 (02): : 397 - 424
  • [23] Not What Nature Can Do for the City but What the City Can Do for Nature
    Karakiewicz, Justyna
    Holquin, Jose
    Kvan, Thomas
    SUSTAINABLE ENERGY-WATER-ENVIRONMENT NEXUS IN DESERTS, 2022, : 305 - 315
  • [24] WHAT CHATGPT CAN DO AND WHAT IT CAN'T
    Francula, Nedjeljko
    GEODETSKI LIST, 2023, 77 (01) : 63 - 64
  • [25] WHAT CAN BIOTECHNOLOGY DO FOR CHEMISTRY - WHAT CAN CHEMISTRY DO FOR BIOTECHNOLOGY
    LEUENBERGER, HGW
    CHIMIA, 1993, 47 (04) : 67 - 68
  • [26] What can the GIRL do? - What can't it do? A Review with a View
    Both, Ralf
    GERUCHE IN DER UMWELT, 2017, 2017, 2315 : 1 - 12
  • [27] SOFTWARE ENGINEERING - WHAT CAN IT DO FOR YOU - WHAT CAN YOU DO FOR IT
    ZWEBEN, SH
    PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1982, 19 : 4 - 5
  • [28] What can biofabrication do for space and what can space do for biofabrication?
    Moroni, Lorenzo
    Tabury, Kevin
    Stenuit, Hilde
    Grimm, Daniela
    Baatout, Sarah
    Mironov, Vladimir
    TRENDS IN BIOTECHNOLOGY, 2022, 40 (04) : 398 - 411
  • [29] Systems thinking: What business modeling can do for public health
    Williams, W
    Lyalin, D
    Wingo, PA
    JOURNAL OF PUBLIC HEALTH MANAGEMENT AND PRACTICE, 2005, 11 (06): : 550 - 553
  • [30] Big data technologies and Management: What conceptual modeling can do
    Storey, Veda C.
    Song, Il-Yeol
    DATA & KNOWLEDGE ENGINEERING, 2017, 108 : 50 - 67