Improved Spatio-temporal Action Localization for Surveillance Videos

被引:0
|
作者
Liang, Morgan [1 ]
Li, Xun [1 ]
Onie, Sandersan [2 ]
Larsen, Mark [2 ]
Sowmya, Arcot [1 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[2] Univ New South Wales, Black Dog Inst, Sydney, NSW, Australia
基金
澳大利亚国家健康与医学研究理事会;
关键词
D O I
10.1109/DICTA52665.2021.9647106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an improved spatiotemporal action localization framework that operates in an online manner. Current state of the art approaches have achieved remarkable results mainly due to the advancements in action recognition models. These approaches have commonly followed a two-stage pipeline consisting of a region proposal stage and an action classification stage. Recently, the improvement in spatiotemporal action localization models have focused on improving the action classification stage. As a result, the outputs generated in the region proposal stage are suboptimal. We believe that the proposal stage remains a crucial component in determining the overall model performance. As a result, we adopt a tracking model in place of the existing proposal models to generate more accurate and robust regions of interest (RoI). We evaluate our approach on a private CCTV surveillance dataset and on the challenging JHMDB-21 benchmark. We are able to achieve promising results on our private dataset and achieve good results for the JHMDB-21 benchmark.
引用
收藏
页码:147 / 154
页数:8
相关论文
共 50 条
  • [1] Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization
    Ghamsarian, Negin
    Taschwer, Mario
    Putzgruber-Adamitsch, Doris
    Sarny, Stephanie
    Schoeffmann, Klaus
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10720 - 10727
  • [2] Real-time Spatio-Temporal Action Localization in 360 Videos
    Chen, Bo
    Ali-Eldin, Ahmed
    Shenoy, Prashant
    Nahrsted, Klara
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 73 - 76
  • [3] JOINT SPATIO-TEMPORAL ACTION LOCALIZATION IN UNTRIMMED VIDEOS WITH PER-FRAME SEGMENTATION
    Duan, Xuhuan
    Wang, Le
    Zhai, Changbo
    Zhang, Qilin
    Niu, Zhenxing
    Zheng, Nanning
    Hua, Gang
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 918 - 922
  • [4] Action Tubelet Detector for Spatio-Temporal Action Localization
    Kalogeiton, Vicky
    Weinzaepfel, Philippe
    Ferrari, Vittorio
    Schmid, Cordelia
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4415 - 4423
  • [5] Spatio-Temporal Activity Detection and Recognition in Untrimmed Surveillance Videos
    Gkountakos, Konstantinos
    Touska, Despoina
    Ioannidis, Konstantinos
    Tsikrika, Theodora
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 451 - 455
  • [6] Stalker Retrieval on Surveillance Videos using Spatio-Temporal Coappearance
    Liu, Jianquan
    Yung, Duncan
    Nishimura, Shoji
    Araki, Takuya
    [J]. 2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 127 - 134
  • [7] Learning to track for spatio-temporal action localization
    Weinzaepfel, Philippe
    Harchaoui, Zaid
    Schmid, Cordelia
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3164 - 3172
  • [8] STE: Spatio-Temporal Encoder for Action Spotting in Soccer Videos
    Darwish, Abdulrahman
    El-Shabrawy, Tallal
    [J]. PROCEEDINGS OF THE 5TH ACM INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2022, 2022, : 87 - 92
  • [9] Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos
    Duta, Ionut C.
    Ionescu, Bogdan
    Aizawa, Kiyoharu
    Sebe, Nicu
    [J]. MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 365 - 378
  • [10] Unified Spatio-Temporal Attention Networks for Action Recognition in Videos
    Li, Dong
    Yao, Ting
    Duan, Ling-Yu
    Mei, Tao
    Rui, Yong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (02) : 416 - 428