A survey on deep learning-based spatio-temporal action detection

被引:0
|
作者
Wang, Peng [1 ]
Zeng, Fanwei [2 ]
Qian, Yuntao [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310007, Zhejiang, Peoples R China
[2] Ant Grp, Hangzhou 310007, Zhejiang, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Computer vision; deep learning; spatio-temporal action detection; SEARCH;
D O I
10.1142/S0219691323500662
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Spatio-temporal action detection (STAD) aims to classify the actions present in a video and localize them in space and time. It has become a particularly active area of research in computer vision because of its explosively emerging real-world applications, such as autonomous driving, visual surveillance and entertainment. Many efforts have been devoted in recent years to build a robust and effective framework for STAD. This paper provides a comprehensive review of the state-of-the-art deep learning-based methods for STAD. First, a taxonomy is developed to organize these methods. Next, the linking algorithms, which aim to associate the frame- or clip-level detection results together to form action tubes, are reviewed. Then, the commonly used benchmark datasets and evaluation metrics are introduced, and the performance of state-of-the-art models is compared. At last, this paper is concluded, and a set of potential research directions of STAD are discussed.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] Spatio-temporal detection of auroral substorm based on deep learning
    Yang QiuJu
    Ren Jie
    Xiang Han
    CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2022, 65 (03): : 898 - 907
  • [2] Spatio-Temporal Deep Learning-Based Methods for Defect Detection: An Industrial Application Study Case
    da Silva, Lucas A.
    dos Santos, Eulanda M.
    Araujo, Leo
    Freire, Natalia S.
    Vasconcelos, Max
    Giusti, Rafael
    Ferreira, David
    Jesus, Anderson S.
    Pimentel, Agemilson
    Cruz, Caio F. S.
    Belem, Ruan J. S.
    Costa, Andre S.
    da Silva, Osmar A.
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [3] Deep Learning for Spatio-Temporal Data Mining: A Survey
    Wang, Senzhang
    Cao, Jiannong
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3681 - 3700
  • [4] Spatio-temporal deep learning fire smoke detection
    Wu Fan
    Wang Hui-qin
    Wang Ke
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2021, 36 (08) : 1186 - 1195
  • [5] Action Recognition Based on Efficient Deep Feature Learning in the Spatio-Temporal Domain
    Husain, Farzad
    Dellen, Babette
    Torras, Carme
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2016, 1 (02): : 984 - 991
  • [6] Reinforcement learning-based estimation for spatio-temporal systems
    Saviz Mowlavi
    Mouhacine Benosman
    Scientific Reports, 14 (1)
  • [7] Deep Learning-Based Action Detection in Untrimmed Videos: A Survey
    Vahdani, Elahe
    Tian, Yingli
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4302 - 4320
  • [8] Automatic Spatio-Temporal Deep Learning-Based Approach for Cardiac Cine MRI Segmentation
    Ammar, Abderazzak
    Bouattane, Omar
    Youssfi, Mohamed
    NETWORKING, INTELLIGENT SYSTEMS AND SECURITY, 2022, 237 : 59 - 73
  • [9] Video action detection by learning graph-based spatio-temporal interactions
    Tomei, Matteo
    Baraldi, Lorenzo
    Calderara, Simone
    Bronzin, Simone
    Cucchiara, Rita
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 206
  • [10] STEP: Spatio-Temporal Progressive Learning for Video Action Detection
    Yang, Xitong
    Yang, Xiaodong
    Liu, Ming-Yu
    Xiao, Fanyi
    Davis, Larry
    Kautz, Jan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 264 - 272