Automatic Annotation of Human Actions in Video

被引:105
|
作者
Duchenne, Olivier [1 ]
Laptev, Ivan [1 ]
Sivic, Josef [1 ]
Bach, Francis [1 ]
Ponce, Jean [1 ]
机构
[1] INRIA, Ecole Normale Super, Paris, France
关键词
D O I
10.1109/ICCV.2009.5459279
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the problem of automatic temporal annotation of realistic human actions in video using minimal manual supervision. To this end we consider two associated problems: (a) weakly-supervised learning of action models from readily available annotations, and (b) temporal localization of human actions in test videos. To avoid the prohibitive cost of manual annotation for training, we use movie scripts as a means of weak supervision. Scripts, however, provide only implicit, noisy, and imprecise information about the type and location of actions in video. We address this problem with a kernel-based discriminative clustering algorithm that locates actions in the weakly-labeled training data. Using the obtained action samples, we train temporal action detectors and apply them to locate actions in the raw video data. Our experiments demonstrate that the proposed method for weakly-supervised learning of action models leads to significant improvement in action detection. We present detection results for three action classes in four feature length movies with challenging and realistic video data.
引用
收藏
页码:1491 / 1498
页数:8
相关论文
共 50 条
  • [41] Semi-automatic tool for motion annotation on complex video sequences
    Mahmood, M. H.
    Salvi, J.
    Llado, X.
    ELECTRONICS LETTERS, 2016, 52 (08) : 602 - 603
  • [42] Image/Video's Automatic Annotation Considering Semantics' Tolerance Relation
    Dai, Ying
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 3416 - 3423
  • [43] Automatic video annotation using multimodal dirichlet process mixture model
    Velivelli, Atulya
    Huang, Thomas S.
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1366 - 1371
  • [44] Automatic video annotation based on co-adaptation and label correction
    Wang, Meng
    Hua, Xian-Sheng
    Song, Yan
    Dai, Li-Rong
    Li, ShiPeng
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 5507 - +
  • [45] Spatial-Temporal Tag Mining for Automatic Geospatial Video Annotation
    Yin, Yifang
    Shen, Zhijie
    Zhang, Luming
    Zimmermann, Roger
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2014, 11 (02)
  • [46] A semi-automatic approach to detect highlights for home video annotation
    Wu, P
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 957 - 960
  • [47] Semi-automatic video semantic annotation based on active learning
    Song, Y
    Hua, XS
    Dai, LR
    Wang, RH
    Visual Communications and Image Processing 2005, Pts 1-4, 2005, 5960 : 251 - 258
  • [48] Improving the accuracy of automatic tennis video annotation by high level grammar
    Kolonias, I.
    Kittler, J.
    Christmas, W. J.
    Yan, F.
    14TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING WORKSHOPS, PROCEEDINGS, 2007, : 154 - 159
  • [49] An Efficient Method for Automatic Video Annotation and Retrieval in Visual Sensor Networks
    Feng, Jiangfan
    Zhou, Wenwen
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2014,
  • [50] Semi-automatic Annotation of Objects in Visual-Thermal Video
    Berg, Amanda
    Johnander, Joakim
    de Gevigney, Flavie Durand
    Ahlberg, Jorgen
    Felberg, Michael
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2242 - 2251