Classifying web videos using a global video descriptor

被引:62
|
作者
Solmaz, Berkan [1 ]
Assari, Shayan Modiri [1 ]
Shah, Mubarak [1 ]
机构
[1] Univ Cent Florida, Orlando, FL 32816 USA
关键词
Video descriptors; Action recognition; Frequency spectrum; Spatio-temporal analysis; RECOGNITION;
D O I
10.1007/s00138-012-0449-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computing descriptors for videos is a crucial task in computer vision. In this paper, we propose a global video descriptor for classification of videos. Our method, bypasses the detection of interest points, the extraction of local video descriptors and the quantization of descriptors into a code book; it represents each video sequence as a single feature vector. Our global descriptor is computed by applying a bank of 3-D spatio-temporal filters on the frequency spectrum of a video sequence; hence, it integrates the information about the motion and scene structure. We tested our approach on three datasets, KTH (Schuldt et al., Proceedings of the 17th international conference on, pattern recognition (ICPR'04), vol. 3, pp. 32-36, 2004), UCF50 (http://vision.eecs.ucf.edu/datasetsActions.html) and HMDB51 (Kuehne et al., HMDB: a large video database for human motion recognition, 2011), and obtained promising results which demonstrate the robustness and the discriminative power of our global video descriptor for classifying videos of various actions. In addition, the combination of our global descriptor and a local descriptor resulted in the highest classification accuracies on UCF50 and HMDB51 datasets.
引用
收藏
页码:1473 / 1485
页数:13
相关论文
共 50 条
  • [1] Classifying web videos using a global video descriptor
    Berkan Solmaz
    Shayan Modiri Assari
    Mubarak Shah
    [J]. Machine Vision and Applications, 2013, 24 : 1473 - 1485
  • [2] An Extensible, Modular Framework for Classifying YouTube Videos Using Web and Social Media
    Alsafrjalani, Mohamad Hammam
    [J]. 2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 459 - 462
  • [3] Automatic Construction of an Action Video Shot Database using Web Videos
    Nga, Do Hang
    Yanai, Keiji
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 527 - 534
  • [4] Object matching in videos using Rotational Signal Descriptor
    Venkatrayappa, Darshan
    Montesinos, Philippe
    Diep, Daniel
    [J]. THREE-DIMENSIONAL IMAGE PROCESSING, MEASUREMENT (3DIPM), AND APPLICATIONS 2015, 2015, 9393
  • [5] Video Descriptor Using Attention Mechanism
    Ahuja, Stuti
    Sheikh, Aftaabahmed
    Nadar, Shubhadarshini
    Shunmugaperumal, Vanitha
    [J]. ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 168 - 178
  • [6] Classifying web metrics using the web quality model
    Calero, C
    Ruiz, J
    Piattini, M
    [J]. ONLINE INFORMATION REVIEW, 2005, 29 (03) : 227 - 248
  • [7] EndoXplore: A Web-based Video Explorer for Endoscopic Videos
    Muenzer, Bernd
    Schoeffmann, Klaus
    Boeszoermenyi, Laszlo
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2017, : 366 - 367
  • [8] Action recognition in depth videos using hierarchical gaussian descriptor
    Nguyen, Xuan Son
    Mouaddib, Abdel-Illah
    Thanh Phuong Nguyen
    Jeanpierre, Laurent
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (16) : 21617 - 21652
  • [9] Action recognition in depth videos using hierarchical gaussian descriptor
    Xuan Son Nguyen
    Abdel-Illah Mouaddib
    Thanh Phuong Nguyen
    Laurent Jeanpierre
    [J]. Multimedia Tools and Applications, 2018, 77 : 21617 - 21652
  • [10] PORNOGRAPHY DETECTION USING BOSSANOVA VIDEO DESCRIPTOR
    Caetano, Carlos
    Avila, Sandra
    Guimaraes, Silvio
    Araujo, Arnaldo de A.
    [J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1681 - 1685