Recognizing Realistic Actions from Videos "in the Wild"

被引:0
|
作者
Liu, Jingen [1 ]
Luo, Jiebo [2 ]
Shah, Mubarak [1 ]
机构
[1] Univ Cent Florida, Comp Vis Lab, Orlando, FL 32816 USA
[2] Eastman Kodak Co, Kodak Res Lab, Rochester, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a systematic framework for recognizing realistic actions from videos "in the wild." Such unconstrained videos are abundant in personal collections as well as on the web. Recognizing action from such videos has not been addressed extensively, primarily due to the tremendous variations that result from camera motion, background clutter, changes in object appearance, and scale, etc. The main challenge is how to extract reliable and informative features from the unconstrained videos. We extract both motion and static features from the videos. Since the raw features of both types are dense yet noisy, we propose strategies to prune these features. We use motion statistics to acquire stable motion features and clean static features. Furthermore, PageRank is used to mine the most informative static features. In order to further construct compact yet discriminative visual vocabularies, a divisive information-theoretic algorithm is employed to group semantically related features. Finally, AdaBoost is chosen to integrate all the heterogeneous yet complementary features for recognition. We have tested the framework on the KTH dataset and our own dataset consisting of 11 categories of actions collected from YouTube and personal videos, and have obtained impressive results for action recognition and action localization.
引用
收藏
页码:1996 / +
页数:2
相关论文
共 50 条
  • [1] Recognizing actions from videos in the wild via adaptive feature fusion
    Yi, Y. (issyy@mail.sysu.edu.cn), 1600, Science Press (36):
  • [2] Action-Scene Model for Recognizing Human Actions from Background in Realistic Videos
    Qu, Wen
    Zhang, Yifei
    Feng, Shi
    Wang, Daling
    Yu, Ge
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 566 - 577
  • [3] Recognizing Actions in Videos from Unseen Viewpoints
    Piergiovanni, A. J.
    Ryoo, Michael S.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4122 - 4130
  • [4] Recognizing Actions in Videos under Domain Shift
    Ricci, Elisa
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 671 - 671
  • [5] Recognizing Emotions Based on Human Actions in Videos
    Wang, Guolong
    Qin, Zheng
    Xu, Kaiping
    MULTIMEDIA MODELING, MMM 2017, PT II, 2017, 10133 : 306 - 317
  • [6] Recognizing Micro-Actions and Reactions from Paired Egocentric Videos
    Yonetani, Ryo
    Kitani, Kris M.
    Sato, Yoichi
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2629 - 2638
  • [7] HOW SCENES IMPLY ACTIONS IN REALISTIC VIDEOS?
    Wang, Hongsong
    Wang, Wei
    Wang, Liang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1619 - 1623
  • [8] Together Recognizing, Localizing and Summarizing Actions in Egocentric Videos
    Sahu, Abhimanyu
    Chowdhury, Ananda S.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4330 - 4340
  • [9] Recognizing Human Actions From Noisy Videos via Multiple Instance Learning
    Sener, Fadime
    Samet, Nermin
    Duygulu, Pinar
    Ikizler-Cinbis, Nazli
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [10] RECOGNIZING FALL ACTIONS FROM VIDEOS USING RECONSTRUCTION ERROR OF VARIATIONAL AUTOENCODER
    Zhou, Jiaxin
    Komuro, Takashi
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3372 - 3376