Extraction and Classification of Diving Clips from Continuous Video Footage

被引:15
|
作者
Nibali, Aiden [1 ]
He, Zhen [1 ]
Morgan, Stuart [1 ,2 ]
Greenwood, Daniel [2 ]
机构
[1] La Trobe Univ, Bundoora, Vic, Australia
[2] Australian Inst Sport, Bruce, Australia
关键词
ACTION RECOGNITION;
D O I
10.1109/CVPRW.2017.18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to recent advances in technology, the recording and analysis of video data has become an increasingly common component of athlete training programmes. Today it is incredibly easy and affordable to set up a fixed camera and record athletes in a wide range of sports, such as diving, gymnastics, golf, tennis, etc. However, the manual analysis of the obtained footage is a time-consuming task which involves isolating actions of interest and categorizing them using domain-specific knowledge. In order to automate this kind of task, three challenging sub-problems are often encountered: 1) temporally cropping events/actions of interest from continuous video; 2) tracking the object of interest; and 3) classifying the events/actions of interest. Most previous work has focused on solving just one of the above sub-problems in isolation. In contrast, this paper provides a complete solution to the overall action monitoring task in the context of a challenging real-world exemplar. Specifically, we address the problem of diving classification. This is a challenging problem since the person (diver) of interest typically occupies fewer than 1% of the pixels in each frame. The model is required to learn the temporal boundaries of a dive, even though other divers and bystanders may be in view. Finally, the model must be sensitive to subtle changes in body pose over a large number of frames to determine the classification code. We provide effective solutions to each of the sub-problems which combine to provide a highly functional solution to the task as a whole. The techniques proposed can be easily generalized to video footage recorded from other sports.
引用
收藏
页码:94 / 104
页数:11
相关论文
共 50 条
  • [41] Towards realistic facial behaviour in humanoids - Mapping from video footage to a robot head
    Jaeckel, Peter
    Campbell, Neill
    Melhuish, Chris
    2007 IEEE 10TH INTERNATIONAL CONFERENCE ON REHABILITATION ROBOTICS, VOLS 1 AND 2, 2007, : 833 - +
  • [42] Application of the Visual Fast Count for the quantification of temperate epibenthic communities from video footage
    Strong, James Asa
    Service, Matthew
    Mitchell, Annika Jane
    JOURNAL OF THE MARINE BIOLOGICAL ASSOCIATION OF THE UNITED KINGDOM, 2006, 86 (05) : 939 - 945
  • [43] Using video clips from "The Office" to illustrate organizational behavior concepts
    Cain, Jeff
    Policastri, Anne
    CURRENTS IN PHARMACY TEACHING AND LEARNING, 2013, 5 (06) : 620 - 625
  • [44] SCSampler: Sampling Salient Clips from Video for Efficient Action Recognition
    Korbar, Bruno
    Tran, Du
    Torresani, Lorenzo
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6241 - 6251
  • [45] Scene Extraction for Video Clips based on the Relation of Text, Pointing Region and Temporal Duration of User Comments
    Wakamiya, Shoko
    Kitayama, Daisuke
    Sumiya, Kazutoshi
    PROCEEDINGS OF THE 20TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, 2009, : 289 - 294
  • [46] Continuous bangla speech segmentation, classification and feature extraction
    Rahman, Md. Mijanur
    Khan, Md. Farukuzzaman
    Bhuiyan, Md. Al-Amin
    International Journal of Computer Science Issues, 2012, 9 (02): : 67 - 75
  • [47] Fast Search for MPEG Video Clips from Large Video Database Using Combined Histogram Features
    Lee, Feifei
    Kotani, Koji
    Chen, Qiu
    Ohmi, Tadahiro
    WORLD CONGRESS ON ENGINEERING, WCE 2010, VOL I, 2010, : 637 - 640
  • [48] Fast search for MPEG video clips from large video database using combined histogram features
    New Industry Creation Hatchery Center, Tohoku University, Sendai, 980-8579, Japan
    不详
    WCE - World Congr. Eng., 1600, (637-640):
  • [49] Video-Based Cryptanalysis: Extracting Cryptographic Keys from Video Footage of a Device's Power LED Captured by Standard Video Cameras
    Nassi, Ben
    Iluz, Etay
    Cohen, Or
    Vayner, Ofek
    Nassi, Dudi
    Zadov, Boris
    Elovici, Yuval
    45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, : 2422 - 2440
  • [50] Preliminary yield estimation of the 2020 Beirut explosion using video footage from social media
    S. E. Rigby
    T. J. Lodge
    S. Alotaibi
    A. D. Barr
    S. D. Clarke
    G. S. Langdon
    A. Tyas
    Shock Waves, 2020, 30 : 671 - 675