Classifying web videos using a global video descriptor

被引：62

作者：

Solmaz, Berkan ^{[1
]}

Assari, Shayan Modiri ^{[1
]}

Shah, Mubarak ^{[1
]}

机构：

[1] Univ Cent Florida, Orlando, FL 32816 USA

来源：

MACHINE VISION AND APPLICATIONS | 2013年 / 24卷 / 07期

关键词：

Video descriptors; Action recognition; Frequency spectrum; Spatio-temporal analysis; RECOGNITION;

D O I：

10.1007/s00138-012-0449-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Computing descriptors for videos is a crucial task in computer vision. In this paper, we propose a global video descriptor for classification of videos. Our method, bypasses the detection of interest points, the extraction of local video descriptors and the quantization of descriptors into a code book; it represents each video sequence as a single feature vector. Our global descriptor is computed by applying a bank of 3-D spatio-temporal filters on the frequency spectrum of a video sequence; hence, it integrates the information about the motion and scene structure. We tested our approach on three datasets, KTH (Schuldt et al., Proceedings of the 17th international conference on, pattern recognition (ICPR'04), vol. 3, pp. 32-36, 2004), UCF50 (http://vision.eecs.ucf.edu/datasetsActions.html) and HMDB51 (Kuehne et al., HMDB: a large video database for human motion recognition, 2011), and obtained promising results which demonstrate the robustness and the discriminative power of our global video descriptor for classifying videos of various actions. In addition, the combination of our global descriptor and a local descriptor resulted in the highest classification accuracies on UCF50 and HMDB51 datasets.

引用

页码：1473 / 1485

页数：13

共 50 条

[1] Classifying web videos using a global video descriptor
Berkan Solmaz
Shayan Modiri Assari
Mubarak Shah
[J]. Machine Vision and Applications, 2013, 24 : 1473 - 1485
[2] An Extensible, Modular Framework for Classifying YouTube Videos Using Web and Social Media
Alsafrjalani, Mohamad Hammam
[J]. 2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 459 - 462
[3] Automatic Construction of an Action Video Shot Database using Web Videos
Nga, Do Hang
Yanai, Keiji
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 527 - 534
[4] Object matching in videos using Rotational Signal Descriptor
Venkatrayappa, Darshan
Montesinos, Philippe
Diep, Daniel
[J]. THREE-DIMENSIONAL IMAGE PROCESSING, MEASUREMENT (3DIPM), AND APPLICATIONS 2015, 2015, 9393
[5] Video Descriptor Using Attention Mechanism
Ahuja, Stuti
Sheikh, Aftaabahmed
Nadar, Shubhadarshini
Shunmugaperumal, Vanitha
[J]. ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 168 - 178
[6] Classifying web metrics using the web quality model
Calero, C
Ruiz, J
Piattini, M
[J]. ONLINE INFORMATION REVIEW, 2005, 29 (03) : 227 - 248
[7] EndoXplore: A Web-based Video Explorer for Endoscopic Videos
Muenzer, Bernd
Schoeffmann, Klaus
Boeszoermenyi, Laszlo
[J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2017, : 366 - 367
[8] Action recognition in depth videos using hierarchical gaussian descriptor
Nguyen, Xuan Son
Mouaddib, Abdel-Illah
Thanh Phuong Nguyen
Jeanpierre, Laurent
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (16) : 21617 - 21652
[9] Action recognition in depth videos using hierarchical gaussian descriptor
Xuan Son Nguyen
Abdel-Illah Mouaddib
Thanh Phuong Nguyen
Laurent Jeanpierre
[J]. Multimedia Tools and Applications, 2018, 77 : 21617 - 21652
[10] PORNOGRAPHY DETECTION USING BOSSANOVA VIDEO DESCRIPTOR
Caetano, Carlos
Avila, Sandra
Guimaraes, Silvio
Araujo, Arnaldo de A.
[J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1681 - 1685

← 1 2 3 4 5 →