Extraction and Classification of Diving Clips from Continuous Video Footage

被引:15
|
作者
Nibali, Aiden [1 ]
He, Zhen [1 ]
Morgan, Stuart [1 ,2 ]
Greenwood, Daniel [2 ]
机构
[1] La Trobe Univ, Bundoora, Vic, Australia
[2] Australian Inst Sport, Bruce, Australia
关键词
ACTION RECOGNITION;
D O I
10.1109/CVPRW.2017.18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to recent advances in technology, the recording and analysis of video data has become an increasingly common component of athlete training programmes. Today it is incredibly easy and affordable to set up a fixed camera and record athletes in a wide range of sports, such as diving, gymnastics, golf, tennis, etc. However, the manual analysis of the obtained footage is a time-consuming task which involves isolating actions of interest and categorizing them using domain-specific knowledge. In order to automate this kind of task, three challenging sub-problems are often encountered: 1) temporally cropping events/actions of interest from continuous video; 2) tracking the object of interest; and 3) classifying the events/actions of interest. Most previous work has focused on solving just one of the above sub-problems in isolation. In contrast, this paper provides a complete solution to the overall action monitoring task in the context of a challenging real-world exemplar. Specifically, we address the problem of diving classification. This is a challenging problem since the person (diver) of interest typically occupies fewer than 1% of the pixels in each frame. The model is required to learn the temporal boundaries of a dive, even though other divers and bystanders may be in view. Finally, the model must be sensitive to subtle changes in body pose over a large number of frames to determine the classification code. We provide effective solutions to each of the sub-problems which combine to provide a highly functional solution to the task as a whole. The techniques proposed can be easily generalized to video footage recorded from other sports.
引用
收藏
页码:94 / 104
页数:11
相关论文
共 50 条
  • [1] Semi-automatic Stereo Extraction from Video Footage
    Guttmann, Moshe
    Wolf, Lior
    Cohen-Or, Daniel
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 136 - 142
  • [2] Key frame extraction from unstructured consumer video clips
    Papin, Christophe
    Luo, Jiebo
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2007, PTS 1 AND 2, 2007, 6508
  • [3] Video Clips Classification with Hierarchical Approach
    Wang Wei
    Li Yuanlei
    Wang Ping
    Zhang Lianfeng
    ADVANCES IN MANUFACTURING TECHNOLOGY, PTS 1-4, 2012, 220-223 : 2413 - 2418
  • [4] Moving objects extraction in diving video
    Li, Y
    Liao, QM
    IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2003, PTS 1 AND 2, 2003, 5022 : 1102 - 1110
  • [5] 3D Feature Extraction from Uncalibrated Video Clips
    Donate, Arturo
    Liu, Xiuwen
    PROCEEDINGS OF THE 2010 ACM WORKSHOP ON 3D VIDEO PROCESSING (3DVP'10), 2010, : 31 - 36
  • [6] A Novel Approach for Object Extraction from Video Sequences Based on Continuous Background/Foreground Classification
    Bellardi, Thiago C.
    Rios-Martinez, Jorge
    Vasquez, Dizan
    Laugier, Christian
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010,
  • [7] Crowd Violence Detection from Video Footage
    Gkountakos, Konstantinos
    Ioannidis, Konstantinos
    Tsikrika, Theodora
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    2021 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2021, : 231 - 234
  • [8] Looking for the Signs: Identifying Isolated Sign Instances in Continuous Video Footage
    Jiang, Tao
    Camgoz, Necati Cihan
    Bowden, Richard
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [9] From Traditional to Modern: Domain Adaptation for Action Classification in Short Social Video Clips
    Singh, Aditya
    Saini, Saurabh
    Shah, Rajvi
    Narayanan, P. J.
    PATTERN RECOGNITION, GCPR 2016, 2016, 9796 : 245 - 257
  • [10] Leveraging Viewer Comments for Mood Classification of Music Video Clips
    Yamamoto, Takehiro
    Nakamura, Satoshi
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 797 - 800