HARTIV: Human Activity Recognition Using Temporal Information in Videos

被引:4
|
作者
Deotale, Disha [1 ]
Verma, Madhushi [2 ]
Suresh, P. [3 ]
Jangir, Sunil Kumar [4 ]
Kaur, Manjit [2 ]
Idris, Sahar Ahmed [5 ]
Alshazly, Hammam [6 ]
机构
[1] SPPU Univ, GH Raisoni Inst Engn & Technol, CSE Dept, Pune, Maharashtra, India
[2] Bennett Univ, CSE Dept, Greater Noida, India
[3] TML Business Serv Ltd, Pune, Maharashtra, India
[4] Anand Int Coll Engn, CSE Dept, Jaipur, Rajasthan, India
[5] King Khalid Univ, Coll Ind Engn, Abha, Saudi Arabia
[6] South Valley Univ, Fac Comp & Informat, Qena 83523, Egypt
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 02期
关键词
Action recognition; human activity recognition; untrimmed video; deep learning; convolutional neural networks;
D O I
10.32604/cmc.2022.020655
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, the most challenging and important problem of computer vision is to detect human activities and recognize the same with temporal information from video data. The video datasets are generated using cameras available in various devices that can be in a static or dynamic position and are referred to as untrimmed videos. Smarter monitoring is a historical necessity in which commonly occurring, regular, and out-of-the-ordinary activities can be automatically identified using intelligence systems and computer vision technology. In a long video, human activity may be present anywhere in the video. There can be a single or multiple human activities present in such videos. This paper presents a deep learning-based methodology to identify the locally present human activities in the video sequences captured by a single wide-view camera in a sports environment. The recognition process is split into four parts: firstly, the video is divided into different set of frames, then the human body part in a sequence of frames is identified, next process is to identify the human activity using a convolutional neural network and finally the time information of the observed postures for each activity is determined with the help of a deep learning algorithm. The proposed approach has been tested on two different sports datasets including ActivityNet and THUMOS. Three sports activities like swimming, cricket bowling and high jump have been considered in this paper and classified with the temporal information i.e., the start and end time for every activity present in the video. The convolutional neural network and long short-term memory are used for feature extraction of temporal action recognition from video data of sports activity. The outcomes show that the proposed method for activity recognition in the sports domain outperforms the existing methods.
引用
收藏
页码:3919 / 3938
页数:20
相关论文
共 50 条
  • [21] GAIT RECOGNITION USING LOW SPATIAL AND TEMPORAL RESOLUTION VIDEOS
    Das Choudhury, Sruti
    Guan, Yu
    Li, Chang-Tsun
    2ND INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF2014), 2014,
  • [22] Spatio-Temporal Activity Detection and Recognition in Untrimmed Surveillance Videos
    Gkountakos, Konstantinos
    Touska, Despoina
    Ioannidis, Konstantinos
    Tsikrika, Theodora
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 451 - 455
  • [23] Exploiting Spatio-temporal Information for View Recognition in Cardiac Echo Videos
    Beymer, David
    Syeda-Mahmood, Tanveer
    Wang, Fei
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 432 - 439
  • [24] Feature Aggregation Tree: Capture Temporal Motion Information for Action Recognition in Videos
    Zhu, Bing
    PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 316 - 327
  • [25] Human Weapon-Activity Recognition in Surveillance Videos Using Structural-RNN
    Susarla, Praneeth
    Agrawal, Utkarsh
    Jayagopi, Dinesh Babu
    PROCEEDINGS OF THE 2ND MEDITERRANEAN CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (MEDPRAI-2018), 2018, : 101 - 107
  • [26] Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos
    Duta, Ionut C.
    Ionescu, Bogdan
    Aizawa, Kiyoharu
    Sebe, Nicu
    MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 365 - 378
  • [27] Human activity recognition using temporal convolutional neural network architecture
    Andrade-Ambriz, Yair A.
    Ledesma, Sergio
    Ibarra-Manzano, Mario-Alberto
    Oros-Flores, Marvella, I
    Almanza-Ojeda, Dora-Luz
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [28] Temporal Learning using Echo State Network for Human Activity Recognition
    Basterrech, Sebastian
    Ojha, Varun Kumar
    2016 THIRD EUROPEAN NETWORK INTELLIGENCE CONFERENCE (ENIC 2016), 2016, : 217 - 223
  • [29] Emotion Recognition from Human Speech Using Temporal Information and Deep Learning
    Kim, John W.
    Saurous, Rif A.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 937 - 940
  • [30] Trajectory-Based Human Activity Recognition from Videos
    Boufama, Boubakeur
    Habashi, Pejman
    Ahmad, Imran Shafiq
    2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 32 - 36