EFFICIENT SEARCH OF TOP-K VIDEO SUBVOLUMES FOR MULTI-INSTANCE ACTION DETECTION

被引：4

作者：

Goussies, Norberto A. ^{[1
]}

Liu, Zicheng ^{[2
]}

Yuan, Junsong ^{[3
]}

机构：

[1] Univ Buenos Aires, DC FCEyN, RA-1053 Buenos Aires, DF, Argentina

[2] Microsoft Res, Redmond, WA 98052 USA

[3] Nanyang Technol Univ, Sch EEE, Singapore 39798, Singapore

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010) | 2010年

关键词：

branch-and-bound; action recognition;

D O I：

10.1109/ICME.2010.5583547

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Action detection was formulated as a subvolume mutual information maximization problem in [8], where each subvolume identifies where and when the action occurs in the video. Despite the fact that the proposed branch-and-bound algorithm can find the best subvolume efficiently for low resolution videos, it is still not efficient enough to perform multi-instance detection in videos of high spatial resolution. In this paper we develop an algorithm that further speeds up the subvolume search and targets on real-time multi-instance action detection for high resolution videos (e.g. 320 x 240 or higher). Unlike the previous branch-and-bound search technique which restarts a new search for each action instance, we find the Top-K subvolumes simultaneously with a single round of search. To handle the larger spatial resolution, we downsample the volume of videos for a more efficient upper-bound estimation. To validate our algorithm, we perform experiments on a challenging dataset of 54 video sequences where each video consists of several actions performed by different people in a crowded environment. The experiments show that our method is not only efficient, but also capable of handling action variations caused by performing speed and style changes, spatial scale changes, as well as cluttered and moving background.

引用

页码：328 / 333

页数：6

共 50 条

[1] Efficient Top-k Search for PageRank
Fujiwara, Yasuhiro
Nakatsuji, Makoto
Shiokawa, Hiroaki
Mishima, Takeshi
Onizuka, Makoto
[J]. Transactions of the Japanese Society for Artificial Intelligence, 2015, 30 (02) : 473 - 478
[2] Efficient Top-k Closeness Centrality Search
Olsen, Paul W., Jr.
Labouseur, Alan G.
Hwang, Jeong-Hyon
[J]. 2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 196 - 207
[3] Multi-Layer Multi-Instance Learning for Video Concept Detection
Gu, Zhiwei
Mei, Tao
Hua, Xian-Sheng
Tang, Jinhui
Wu, Xiuqing
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (08) : 1605 - 1616
[4] EFFICIENT INSTANCE ANNOTATION IN MULTI-INSTANCE LEARNING
Pham, Anh T.
Raich, Raviv
Fern, Xiaoli Z.
[J]. 2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 137 - 140
[5] Efficient Top-k Keyword Search on XML Streams
Li, Lingli
Wang, Hongzhi
Li, Jianzhong
Luo, Jizhou
[J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 1041 - 1046
[6] Efficient Top-k Ego-Betweenness Search
Zhang, Qi
Li, Rong-Hua
Pan, Minjia
Dai, Yongheng
Wang, Guoren
Yuan, Ye
[J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 380 - 392
[7] Efficient Top-k Edge Structural Diversity Search
Zhang, Qi
Li, Rong-Hua
Yang, Qixuan
Wang, Guoren
Qin, Lu
[J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 205 - 216
[8] Efficient Top-K keyword search on XML streams
Li, Ling-Li
Wang, Hong-Zhi
Gao, Hong
Li, Jian-Zhong
[J]. Ruan Jian Xue Bao/Journal of Software, 2012, 23 (06): : 1561 - 1577
[9] Marginalized multi-layer multi-instance kernel for video concept detection
Zha, Zheng-Jun
Mei, Tao
Hong, Richang
Gu, Zhiwei
[J]. SIGNAL PROCESSING, 2013, 93 (08) : 2119 - 2125
[10] Efficient Top-k Graph Similarity Search With GED Constraints
Kim, Jongik
[J]. IEEE ACCESS, 2022, 10 : 79180 - 79191

← 1 2 3 4 5 →