Automatic Annotation of Human Actions in Video

被引：105

作者：

Duchenne, Olivier ^{[1
]}

Laptev, Ivan ^{[1
]}

Sivic, Josef ^{[1
]}

Bach, Francis ^{[1
]}

Ponce, Jean ^{[1
]}

机构：

[1] INRIA, Ecole Normale Super, Paris, France

来源：

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2009年

关键词：

D O I：

10.1109/ICCV.2009.5459279

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper addresses the problem of automatic temporal annotation of realistic human actions in video using minimal manual supervision. To this end we consider two associated problems: (a) weakly-supervised learning of action models from readily available annotations, and (b) temporal localization of human actions in test videos. To avoid the prohibitive cost of manual annotation for training, we use movie scripts as a means of weak supervision. Scripts, however, provide only implicit, noisy, and imprecise information about the type and location of actions in video. We address this problem with a kernel-based discriminative clustering algorithm that locates actions in the weakly-labeled training data. Using the obtained action samples, we train temporal action detectors and apply them to locate actions in the raw video data. Our experiments demonstrate that the proposed method for weakly-supervised learning of action models leads to significant improvement in action detection. We present detection results for three action classes in four feature length movies with challenging and realistic video data.

引用

页码：1491 / 1498

页数：8

共 50 条

[31] A methodology for image annotation of human actions in videos
Moomina Waheed
Shahid Hussain
Arif Ali Khan
Mansoor Ahmed
Bashir Ahmad
Multimedia Tools and Applications, 2020, 79 : 24347 - 24365
[32] Players Tracking and Ball Detection for an Automatic Tennis Video Annotation
Teachabarikiti, Kosit
Chalidabhongse, Thanarat H.
Thammano, Arit
11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 2491 - 2494
[33] Automatic Bharatanatyam Dance Video Annotation Tool Using CNN
Bhuyan, Himadri
Das, Partha Pratim
Tewari, Vishal
COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 512 - 522
[34] Enhanced semi-supervised learning for automatic video annotation
Wang, Meng
Hua, Xian-Sheng
Dai, Li-Rong
Song, Yan
2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1485 - +
[35] Anomaly Detection and Knowledge Transfer in Automatic Sports Video Annotation
Almajai, I.
Yan, F.
de Campos, T.
Khan, A.
Christmas, W.
Windridge, D.
Kittler, J.
DETECTION AND IDENTIFICATION OF RARE AUDIOVISUAL CUES, 2012, 384 : 109 - 117
[36] Automatic face annotation in TV series by video/script alignment
Zhang, Yifan
Tang, Zhiqang
Zhang, Chunjie
Liu, Jing
Lu, Hanqing
NEUROCOMPUTING, 2015, 152 : 316 - 321
[37] Automatic target tracking for unmanned aerial vehicle video annotation
Zhang, SQ
Karim, MA
OPTICAL ENGINEERING, 2004, 43 (08) : 1867 - 1873
[38] Automatic detection and recognition of athlete actions in diving video
Li, Haojie
Wu, Si
Ba, Shan
Lin, Shouxun
Zhang, Yongdong
ADVANCES IN MULTIMEDIA MODELING, PT 2, 2007, 4352 : 73 - +
[39] Human object annotation for surveillance video forensics
Fraz, Muhammad
Zafar, Iffat
Tzanidou, Giounona
Edirisinghe, Eran A.
Sarfraz, Muhammad Saquib
JOURNAL OF ELECTRONIC IMAGING, 2013, 22 (04)
[40] Analysis of Human Actions for Video Indexing
Chen, Zhuoyuan
Cui, Peng
Sun, Lifeng
Yang, Shiqiang
Advances in Multimedia Information Processing - PCM 2008, 9th Pacific Rim Conference on Multimedia, 2008, 5353 : 635 - 642

← 1 2 3 4 5 →