Automatic Annotation of Human Actions in Video

被引:105
|
作者
Duchenne, Olivier [1 ]
Laptev, Ivan [1 ]
Sivic, Josef [1 ]
Bach, Francis [1 ]
Ponce, Jean [1 ]
机构
[1] INRIA, Ecole Normale Super, Paris, France
关键词
D O I
10.1109/ICCV.2009.5459279
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the problem of automatic temporal annotation of realistic human actions in video using minimal manual supervision. To this end we consider two associated problems: (a) weakly-supervised learning of action models from readily available annotations, and (b) temporal localization of human actions in test videos. To avoid the prohibitive cost of manual annotation for training, we use movie scripts as a means of weak supervision. Scripts, however, provide only implicit, noisy, and imprecise information about the type and location of actions in video. We address this problem with a kernel-based discriminative clustering algorithm that locates actions in the weakly-labeled training data. Using the obtained action samples, we train temporal action detectors and apply them to locate actions in the raw video data. Our experiments demonstrate that the proposed method for weakly-supervised learning of action models leads to significant improvement in action detection. We present detection results for three action classes in four feature length movies with challenging and realistic video data.
引用
收藏
页码:1491 / 1498
页数:8
相关论文
共 50 条
  • [31] A methodology for image annotation of human actions in videos
    Moomina Waheed
    Shahid Hussain
    Arif Ali Khan
    Mansoor Ahmed
    Bashir Ahmad
    Multimedia Tools and Applications, 2020, 79 : 24347 - 24365
  • [32] Players Tracking and Ball Detection for an Automatic Tennis Video Annotation
    Teachabarikiti, Kosit
    Chalidabhongse, Thanarat H.
    Thammano, Arit
    11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 2491 - 2494
  • [33] Automatic Bharatanatyam Dance Video Annotation Tool Using CNN
    Bhuyan, Himadri
    Das, Partha Pratim
    Tewari, Vishal
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 512 - 522
  • [34] Enhanced semi-supervised learning for automatic video annotation
    Wang, Meng
    Hua, Xian-Sheng
    Dai, Li-Rong
    Song, Yan
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1485 - +
  • [35] Anomaly Detection and Knowledge Transfer in Automatic Sports Video Annotation
    Almajai, I.
    Yan, F.
    de Campos, T.
    Khan, A.
    Christmas, W.
    Windridge, D.
    Kittler, J.
    DETECTION AND IDENTIFICATION OF RARE AUDIOVISUAL CUES, 2012, 384 : 109 - 117
  • [36] Automatic face annotation in TV series by video/script alignment
    Zhang, Yifan
    Tang, Zhiqang
    Zhang, Chunjie
    Liu, Jing
    Lu, Hanqing
    NEUROCOMPUTING, 2015, 152 : 316 - 321
  • [37] Automatic target tracking for unmanned aerial vehicle video annotation
    Zhang, SQ
    Karim, MA
    OPTICAL ENGINEERING, 2004, 43 (08) : 1867 - 1873
  • [38] Automatic detection and recognition of athlete actions in diving video
    Li, Haojie
    Wu, Si
    Ba, Shan
    Lin, Shouxun
    Zhang, Yongdong
    ADVANCES IN MULTIMEDIA MODELING, PT 2, 2007, 4352 : 73 - +
  • [39] Human object annotation for surveillance video forensics
    Fraz, Muhammad
    Zafar, Iffat
    Tzanidou, Giounona
    Edirisinghe, Eran A.
    Sarfraz, Muhammad Saquib
    JOURNAL OF ELECTRONIC IMAGING, 2013, 22 (04)
  • [40] Analysis of Human Actions for Video Indexing
    Chen, Zhuoyuan
    Cui, Peng
    Sun, Lifeng
    Yang, Shiqiang
    Advances in Multimedia Information Processing - PCM 2008, 9th Pacific Rim Conference on Multimedia, 2008, 5353 : 635 - 642