Object detection and activity recognition in video surveillance using neural networks

被引：6

作者：

Payghode, Vishva ^{[1
]}

Goyal, Ayush ^{[1
]}

Bhan, Anupama ^{[2
]}

Iyer, Sailesh Suryanarayan ^{[3
]}

Dubey, Ashwani Kumar ^{[2
]}

机构：

[1] Texas A&M Univ, Kingsville, TX USA

[2] Amity Univ, Noida, India

[3] Rai Univ, Rai Sch Engn, Dept CSE IT, Ahmadabad, India

来源：

INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS | 2023年 / 19卷 / 3/4期

关键词：

Deep learning; Video surveillance; Object detection; Neural network; Activity recognition;

D O I：

10.1108/IJWIS-01-2023-0006

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

PurposeThis paper aims to implement and extend the You Only Live Once (YOLO) algorithm for detection of objects and activities. The advantage of YOLO is that it only runs a neural network once to detect the objects in an image, which is why it is powerful and fast. Cameras are found at many different crossroads and locations, but video processing of the feed through an object detection algorithm allows determining and tracking what is captured. Video Surveillance has many applications such as Car Tracking and tracking of people related to crime prevention. This paper provides exhaustive comparison between the existing methods and proposed method. Proposed method is found to have highest object detection accuracy. Design/methodology/approachThe goal of this research is to develop a deep learning framework to automate the task of analyzing video footage through object detection in images. This framework processes video feed or image frames from CCTV, webcam or a DroidCam, which allows the camera in a mobile phone to be used as a webcam for a laptop. The object detection algorithm, with its model trained on a large data set of images, is able to load in each image given as an input, process the image and determine the categories of the matching objects that it finds. As a proof of concept, this research demonstrates the algorithm on images of several different objects. This research implements and extends the YOLO algorithm for detection of objects and activities. The advantage of YOLO is that it only runs a neural network once to detect the objects in an image, which is why it is powerful and fast. Cameras are found at many different crossroads and locations, but video processing of the feed through an object detection algorithm allows determining and tracking what is captured. For video surveillance of traffic cameras, this has many applications, such as car tracking and person tracking for crime prevention. In this research, the implemented algorithm with the proposed methodology is compared against several different prior existing methods in literature. The proposed method was found to have the highest object detection accuracy for object detection and activity recognition, better than other existing methods. FindingsThe results indicate that the proposed deep learning-based model can be implemented in real-time for object detection and activity recognition. The added features of car crash detection, fall detection and social distancing detection can be used to implement a real-time video surveillance system that can help save lives and protect people. Such a real-time video surveillance system could be installed at street and traffic cameras and in CCTV systems. When this system would detect a car crash or a fatal human or pedestrian fall with injury, it can be programmed to send automatic messages to the nearest local police, emergency and fire stations. When this system would detect a social distancing violation, it can be programmed to inform the local authorities or sound an alarm with a warning message to alert the public to maintain their distance and avoid spreading their aerosol particles that may cause the spread of viruses, including the COVID-19 virus. Originality/valueThis paper proposes an improved and augmented version of the YOLOv3 model that has been extended to perform activity recognition, such as car crash detection, human fall detection and social distancing detection. The proposed model is based on a deep learning convolutional neural network model used to detect objects in images. The model is trained using the widely used and publicly available Common Objects in Context data set. The proposed model, being an extension of YOLO, can be implemented for real-time object and activity recognition. The proposed model had higher accuracies for both large-scale and all-scale object detection. This proposed model also exceeded all the other previous methods that were compared in extending and augmenting the object detection to activity recognition. The proposed model resulted in the highest accuracy for car crash detection, fall detection and social distancing detection.

引用

页码：123 / 138

页数：16

共 50 条

[1] Novelty detection in video surveillance using hierarchical neural networks
Owens, J
Hunter, A
Fletcher, E
[J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 1249 - 1254
[2] Human activity detection and recognition for video surveillance
Niu, W
Long, J
Han, D
Wang, YF
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 719 - 722
[3] Automated Daily Human Activity Recognition for Video Surveillance Using Neural Network
Babiker, Mohanad
Khalifa, Othman O.
Htike, Kyaw Kyaw
Hassan, Aisha
Zaharadeen, Muhamed
[J]. 2017 IEEE 4TH INTERNATIONAL CONFERENCE ON SMART INSTRUMENTATION, MEASUREMENT AND APPLICATION (ICSIMA 2017), 2017,
[4] IMPROVED OBJECT DETECTION IN VIDEO SURVEILLANCE USING DEEP CONVOLUTIONAL NEURAL NETWORK LEARNING
Dhiyanesh, B.
Kanna, Rajesh K.
Rajkumar, S.
Radha, R.
[J]. PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 913 - 920
[5] Deep Neural Networks for Moving Object Classification in Video Surveillance Applications
Rebai, Rania
Fendri, Emna
Hammami, Mohamed
[J]. FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
[6] Object recognition by indexing using Neural Networks
Villela, PR
Azuela, JHS
[J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 1001 - 1004
[7] Moving object detection for surveillance video frames using two stage multi-scale residual convolution neural networks
Khairwa, Anshul
Thangavelu, Arunkumar
[J]. INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2024, 15 (3-4)
[8] Tracking and Abnormal Behavior Detection in Video Surveillance using Optical Flow and Neural Networks
Rasheed, Nida
Khan, Shoab A.
Khalid, Adnan
[J]. 2014 28TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2014, : 61 - 66
[9] Activity recognition in video surveillance
Fang Shuai
Cao Yang
Wang Han
[J]. PROCEEDINGS OF THE 2007 CHINESE CONTROL AND DECISION CONFERENCE, 2007, : 249 - 252
[10] Object Detection from Video Tubelets with Convolutional Neural Networks
Kang, Kai
Ouyang, Wanli
Li, Hongsheng
Wang, Xiaogang
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 817 - 825

← 1 2 3 4 5 →