Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition

被引:0
|
作者
Dai, Guangzhao [1 ]
Shu, Xiangbo [1 ]
Yan, Rui [2 ]
Huang, Peng [1 ]
Tang, Jinhui [1 ]
机构
[1] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
[2] Nanjing Univ, Nanjing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金; 中国博士后科学基金;
关键词
Egocentric Action Recognition; Prototype Learning; Video Understanding;
D O I
10.1145/3581783.3612144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Egocentric Action Recognition (EAR) is required to recognize both the interacting objects (noun) and the motion (verb) against cluttered backgrounds with distracting objects. For capturing interacting objects, traditional approaches heavily rely on luxury object annotations or detectors, though a few works heuristically enumerate the fixed sets of verb-constrained prototypes to roughly exclude the background. For capturing motion, the inherent variations of motion duration among egocentric videos with different lengths are almost ignored. To this end, we propose a novel Slowfast Diversity-aware Prototype learning (SDP) to effectively capture interacting objects by learning compact yet diverse prototypes, and adaptively capture motion in either long-time video or short-time video. Specifically, we present a new Part-to-Prototype (P2P) scheme to learn prototypes from raw videos covering the interacting objects by refining the semantic information from part level to prototype level. Moreover, for adaptively capturing motion, we design a new Slow-Fast Context (SFC) mechanism that explores the Up/Down augmentations for the prototype representation at the semantic level to strengthen the transient dynamic information in short-time videos and eliminate the redundant dynamic information in longtime videos, which are further fine-complemented via the slow-and fast-aware attentions. Extensive experiments demonstrate SDP outperforms state-of-the-art methods on two large-scale egocentric video benchmarks, i.e., EPIC-KITCHENS-100 and EGTEA.
引用
下载
收藏
页码:7549 / 7558
页数:10
相关论文
共 50 条
  • [21] Categorical Diversity-Aware Inner Product Search
    Hirata, Kohei
    Amagata, Daichi
    Fujita, Sumio
    Hara, Takahiro
    IEEE ACCESS, 2023, 11 : 2586 - 2596
  • [22] Diversity-aware strategies for static index pruning
    Yigit-Sert, Sevgi
    Altingovde, Ismail Sengor
    Ulusoy, Ozgur
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
  • [23] Productive fitness in diversity-aware evolutionary algorithms
    Gabor, Thomas
    Phan, Thomy
    Linnhoff-Popien, Claudia
    NATURAL COMPUTING, 2021, 20 (03) : 363 - 376
  • [24] A diversity-aware incentive mechanism for cross-silo federated learning with budget constraint
    Wu, Xiaohong
    Lin, Yujun
    Zhong, Haotian
    Tao, Jie
    Gu, Yonggen
    Shen, Shigen
    Yu, Shui
    Knowledge-Based Systems, 2025, 315
  • [25] Post-hoc Diversity-aware Curation of Rankings
    Markos, Vassilis
    Michael, Loizos
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 323 - 334
  • [26] SentiRec: Sentiment Diversity-aware Neural News Recommendation
    Wu, Chuhan
    Wu, Fangzhao
    Qi, Tao
    Huang, Yongfeng
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 44 - 53
  • [27] A diversity-aware memetic algorithm for the linear ordering Problem
    Lázaro Lugo
    Carlos Segura
    Gara Miranda
    Memetic Computing, 2022, 14 : 395 - 409
  • [28] An Architecture for Diversity-aware Search for Medical Web Content
    Denecke, K.
    METHODS OF INFORMATION IN MEDICINE, 2012, 51 (06) : 549 - 556
  • [29] A diversity-aware memetic algorithm for the linear ordering Problem
    Lugo, Lazaro
    Segura, Carlos
    Miranda, Gara
    MEMETIC COMPUTING, 2022, 14 (04) : 395 - 409
  • [30] A Diversity-aware Model for Majority Vote Ensemble Accuracy
    Lim, Nick Jin Sean
    Durrant, Robert John
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4078 - 4086