Argus plus plus : Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals

被引:6
|
作者
Yu, Lijun [1 ]
Qian, Yijun [1 ]
Liu, Wenhe [1 ]
Hauptmann, Alexander G. [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
基金
美国安德鲁·梅隆基金会;
关键词
D O I
10.1109/WACVW54805.2022.00017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Activity detection is one of the attractive computer vision tasks to exploit the video streams captured by widely installed cameras. Although achieving impressive performance, conventional activity detection algorithms are usually designed under certain constraints, such as using trimmed and/or object-centered video clips as inputs. Therefore, they failed to deal with the multi-scale multi-instance cases in real-world unconstrained video streams, which are untrimmed and have large field-of-views. Real-time requirements for streaming analysis also mark brute force expansion of them unfeasible. To overcome these issues, we propose Argus++, a robust real-time activity detection system for analyzing unconstrained video streams. The design of Argus++ introduces overlapping spatio-temporal cubes as an intermediate concept of activity proposals to ensure coverage and completeness of activity detection through over-sampling. The overall system is optimized for real-time processing on standalone consumer-level hardware. Extensive experiments on different surveillance and driving scenarios demonstrated its superior performance in a series of activity detection benchmarks, including CVPR ActivityNet ActEV 2021, NIST ActEV SDL UF/KF, TRECVID ActEV 2020/2021, and ICCV ROAD 2021.
引用
收藏
页码:112 / 121
页数:10
相关论文
共 50 条
  • [1] Real-time Eye Detection in Video Streams
    Lin, Kunhui
    Huang, Jiyong
    Chen, Jiawei
    Zhou, Changle
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 6, PROCEEDINGS, 2008, : 193 - +
  • [2] Robust real-time detection, tracking, and pose estimation of faces in video streams
    Huang, KS
    Trivedi, MM
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 965 - 968
  • [3] Real-time detection of faces in video streams
    Castrillon-Santana, M
    Déniz-Suárez, O
    Guerra-Artal, C
    Hernández-Tejera, M
    [J]. 2ND CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION, PROCEEDINGS, 2005, : 298 - 305
  • [4] DeepMark plus plus : Real-time Clothing Detection at the Edge
    Sidnev, Alexey
    Krapivin, Alexander
    Trushkov, Alexey
    Krasikova, Ekaterina
    Kazakov, Maxim
    Viryasov, Mikhail
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2979 - 2987
  • [5] Robust real-time video face recognition system for unconstrained environments
    Rajak, Amir
    Dailey, Matthew N.
    Ekpanyapong, Mongkol
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [6] Real-Time Deep Video SpaTial Resolution UpConversion SysTem (STRUCT plus plus Demo)
    Yang, Wenhan
    Deng, Shihong
    Hu, Yueyu
    Xing, Junliang
    Liu, Jiaying
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1255 - 1256
  • [7] ROBUSfT: Robust real-time shape-from-template, a C plus plus library
    Shetab-Bushehri, Mohammadreza
    Aranda, Miguel
    Ozgur, Erol
    Mezouar, Youcef
    Bartoli, Adrien
    [J]. IMAGE AND VISION COMPUTING, 2024, 141
  • [8] GeoBurst plus : Effective and Real-Time Local Event Detection in Geo-Tagged Tweet Streams
    Zhang, Chao
    Lei, Dongming
    Yuan, Quan
    Zhuang, Honglei
    Kaplan, Lance
    Wang, Shaowen
    Han, Jiawei
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2018, 9 (03)
  • [9] Real-time shot transition detection in compressed MPEG video streams
    Fouad, Mona A.
    Bayoumi, Fatma M.
    Onsi, Hoda M.
    Darwish, Mohamed G.
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2008, 17 (02)
  • [10] Real-time video breakup detection for multiple HD video streams on a single GPU
    Rosner, Jakub
    Fassold, Hannes
    Winter, Martin
    Schallauer, Peter
    [J]. REAL-TIME IMAGE AND VIDEO PROCESSING 2012, 2012, 8437