Multi-granularity Generator for Temporal Action Proposal

被引:10
|
作者
Liu, Yuan [2 ]
Ma, Lin [1 ]
Zhang, Yifeng [2 ]
Liu, Wei [1 ]
Chang, Shih-Fu [3 ]
机构
[1] Tencent AI Lab, Bellevue, WA 98004 USA
[2] Southeast Univ, Nanjing, Jiangsu, Peoples R China
[3] Columbia Univ, New York, NY 10027 USA
关键词
D O I
10.1109/CVPR.2019.00372
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal action proposal generation is an important task, aiming to localize the video segments containing human actions in an untrimmed video. In this paper, we propose a multi-granularity generator (MGG) to perform the temporal action proposal from different granularity perspectives, relying on the video visual features equipped with the position embedding information. First, we propose to use a bilinear matching model to exploit the rich local information within the video sequence. Afterwards, two components, namely segment proposal producer (SPP) and frame actionness producer (FAP), are combined to perform the task of temporal action proposal at two distinct granularities. SPP considers the whole video in the form of feature pyramid and generates segment proposals from one coarse perspective, while FAP carries out a finer actionness evaluation for each video frame. Our proposed MGG can be trained in an end-to-end fashion. Through temporally adjusting the segment proposals with fine-grained information based on frame actionness, MGG achieves the superior performance over state-of-the-art methods on the public THUMOS-14 and ActivityNet-1.3 datasets. Moreover, we employ existing action classifiers to perform the classification of the proposals generated by MGG, leading to significant improvements compared against the competing methods for the video detection task.
引用
收藏
页码:3599 / 3608
页数:10
相关论文
共 50 条
  • [31] Deconfounded hierarchical multi-granularity classification
    Zhao, Ziyu
    Gan, Leilei
    Shen, Tao
    Kuang, Kun
    Wu, Fei
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [32] Differentiable Multi-Granularity Human Parsing
    Zhou, Tianfei
    Yang, Yi
    Wang, Wenguan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8296 - 8310
  • [33] Multi-granularity and metric spatial reasoning
    Wang, Shengsheng
    Liu, Yiting
    Liu, Dayou
    Dickson, Bolou Bolou
    Wang, Xinying
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (06) : 3116 - 3133
  • [34] Multi-granularity visual explanations for CNN
    Bao, Huanan
    Wang, Guoyin
    Li, Shuai
    Liu, Qun
    KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [35] Multi-Granularity Causal Structure Learning
    Liang, Jiaxuan
    Wang, Jun
    Yu, Guoxian
    Xia, Shuyin
    Wang, Guoyin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13727 - 13735
  • [36] Accelerator for multi-granularity attribute reduction
    Jiang, Zehua
    Yang, Xibei
    Yu, Hualong
    Liu, Dun
    Wang, Pingxin
    Qian, Yuhua
    KNOWLEDGE-BASED SYSTEMS, 2019, 177 : 145 - 158
  • [37] Multi-granularity locking for nested transactions
    Lee, J
    Fekete, A
    ACTA INFORMATICA, 1996, 33 (02) : 131 - 152
  • [38] Multi-Granularity Spatio-Temporal Correlation Networks for Stock Trend Prediction
    Chen, Jiahao
    Xie, Liang
    Lin, Wenjing
    Wu, Yuchen
    Xu, Haijiao
    IEEE ACCESS, 2024, 12 : 67219 - 67232
  • [39] The Method of Analysis Granularity Determination for Multi-granularity Time Series
    Chen, Hailan
    Gao, Xuedong
    Du, Qiangbo
    2018 8TH INTERNATIONAL CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCES (LISS), 2018,
  • [40] Multi-granularity locks for XML repetitive
    Lee, E
    FOURTH ANNUAL ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, PROCEEDINGS, 2005, : 222 - 227