EFFICIENT SEARCH OF TOP-K VIDEO SUBVOLUMES FOR MULTI-INSTANCE ACTION DETECTION

被引:4
|
作者
Goussies, Norberto A. [1 ]
Liu, Zicheng [2 ]
Yuan, Junsong [3 ]
机构
[1] Univ Buenos Aires, DC FCEyN, RA-1053 Buenos Aires, DF, Argentina
[2] Microsoft Res, Redmond, WA 98052 USA
[3] Nanyang Technol Univ, Sch EEE, Singapore 39798, Singapore
关键词
branch-and-bound; action recognition;
D O I
10.1109/ICME.2010.5583547
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Action detection was formulated as a subvolume mutual information maximization problem in [8], where each subvolume identifies where and when the action occurs in the video. Despite the fact that the proposed branch-and-bound algorithm can find the best subvolume efficiently for low resolution videos, it is still not efficient enough to perform multi-instance detection in videos of high spatial resolution. In this paper we develop an algorithm that further speeds up the subvolume search and targets on real-time multi-instance action detection for high resolution videos (e.g. 320 x 240 or higher). Unlike the previous branch-and-bound search technique which restarts a new search for each action instance, we find the Top-K subvolumes simultaneously with a single round of search. To handle the larger spatial resolution, we downsample the volume of videos for a more efficient upper-bound estimation. To validate our algorithm, we perform experiments on a challenging dataset of 54 video sequences where each video consists of several actions performed by different people in a crowded environment. The experiments show that our method is not only efficient, but also capable of handling action variations caused by performing speed and style changes, spatial scale changes, as well as cluttered and moving background.
引用
收藏
页码:328 / 333
页数:6
相关论文
共 50 条
  • [1] Efficient Top-k Search for PageRank
    Fujiwara, Yasuhiro
    Nakatsuji, Makoto
    Shiokawa, Hiroaki
    Mishima, Takeshi
    Onizuka, Makoto
    [J]. Transactions of the Japanese Society for Artificial Intelligence, 2015, 30 (02) : 473 - 478
  • [2] Efficient Top-k Closeness Centrality Search
    Olsen, Paul W., Jr.
    Labouseur, Alan G.
    Hwang, Jeong-Hyon
    [J]. 2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 196 - 207
  • [3] Multi-Layer Multi-Instance Learning for Video Concept Detection
    Gu, Zhiwei
    Mei, Tao
    Hua, Xian-Sheng
    Tang, Jinhui
    Wu, Xiuqing
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (08) : 1605 - 1616
  • [4] EFFICIENT INSTANCE ANNOTATION IN MULTI-INSTANCE LEARNING
    Pham, Anh T.
    Raich, Raviv
    Fern, Xiaoli Z.
    [J]. 2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 137 - 140
  • [5] Efficient Top-k Keyword Search on XML Streams
    Li, Lingli
    Wang, Hongzhi
    Li, Jianzhong
    Luo, Jizhou
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 1041 - 1046
  • [6] Efficient Top-k Ego-Betweenness Search
    Zhang, Qi
    Li, Rong-Hua
    Pan, Minjia
    Dai, Yongheng
    Wang, Guoren
    Yuan, Ye
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 380 - 392
  • [7] Efficient Top-k Edge Structural Diversity Search
    Zhang, Qi
    Li, Rong-Hua
    Yang, Qixuan
    Wang, Guoren
    Qin, Lu
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 205 - 216
  • [8] Efficient Top-K keyword search on XML streams
    Li, Ling-Li
    Wang, Hong-Zhi
    Gao, Hong
    Li, Jian-Zhong
    [J]. Ruan Jian Xue Bao/Journal of Software, 2012, 23 (06): : 1561 - 1577
  • [9] Marginalized multi-layer multi-instance kernel for video concept detection
    Zha, Zheng-Jun
    Mei, Tao
    Hong, Richang
    Gu, Zhiwei
    [J]. SIGNAL PROCESSING, 2013, 93 (08) : 2119 - 2125
  • [10] Efficient Top-k Graph Similarity Search With GED Constraints
    Kim, Jongik
    [J]. IEEE ACCESS, 2022, 10 : 79180 - 79191