Detection, identification and alert of wild animals in surveillance videos using deep learning

被引：1

作者：

Jartarghar, Harish A. ^{[1
]}

Kruthi, M. N. ^{[1
]}

Karuntharaka, B. ^{[1
]}

Nasreen, Azra ^{[1
]}

Shankar, T. ^{[2
]}

Kumar, Ramakanth ^{[1
]}

Sreelakshmi, K. ^{[3
]}

机构：

[1] RV Coll Engn, Dept Comp Sci & Engn, Bangalore, India

[2] IISc, Bengaluru, India

[3] RV Coll Engn, Bangalore, India

来源：

CURRENT SCIENCE | 2024年 / 127卷 / 04期

关键词：

CCTV; Convolutional neural network; F1; score; Mean average precision (mAP); VGG-16;

D O I：

10.1108/IJIUS-09-2022-0125

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

PurposeWith the rapid advancement of lifestyle and technology, human lives are becoming increasingly threatened. Accidents, exposure to dangerous substances and animal strikes are all possible threats. Human lives are increasingly being harmed as a result of attacks by wild animals. Further investigation into the cases reported revealed that such events can be detected early on. Techniques such as machine learning and deep learning will be used to solve this challenge. The upgraded VGG-16 model with deep learning-based detection is appropriate for such real-time applications because it overcomes the low accuracy and poor real-time performance of traditional detection methods and detects medium- and long-distance objects more accurately. Many organizations use various safety and security measures, particularly CCTV/video surveillance systems, to address physical security concerns. CCTV/video monitoring systems are quite good at visually detecting a range of attacks associated with suspicious behavior on the premises and in the workplace. Many have indeed begun to use automated systems such as video analytics solutions such as motion detection, object/perimeter detection, face recognition and artificial intelligence/machine learning, among others. Anomaly identification can be performed with the data collected from the CCTV cameras. The camera surveillance can generate enormous quantities of data, which is laborious and expensive to screen for the species of interest. Many cases have been recorded where wild animals enter public places, causing havoc and damaging lives and property. There are many cases where people have lost their lives to wild attacks. The conventional approach of sifting through images by eye can be expensive and risky. Therefore, an automated wild animal detection system is required to avoid these circumstances.Design/methodology/approachThe proposed system consists of a wild animal detection module, a classifier and an alarm module, for which video frames are fed as input and the output is prediction results. Frames extracted from videos are pre-processed and then delivered to the neural network classifier as filtered frames. The classifier module categorizes the identified animal into one of the several categories. An email or WhatsApp notice is issued to the appropriate authorities or users based on the classifier outcome.FindingsEvaluation metrics are used to assess the quality of a statistical or machine learning model. Any system will include a review of machine learning models or algorithms. A number of evaluation measures can be performed to put a model to the test. Among them are classification accuracy, logarithmic loss, confusion matrix and other metrics. The model must be evaluated using a range of evaluation metrics. This is because a model may perform well when one measurement from one evaluation metric is used but perform poorly when another measurement from another evaluation metric is used. We must utilize evaluation metrics to guarantee that the model is running correctly and optimally.Originality/valueThe output of conv5 3 will be of size 7*7*512 in the ImageNet VGG-16 in Figure 4, which operates on images of size 224*224*3. Therefore, the parameters of fc6 with a flattened input size of 7*7*512 and an output size of 4,096 are 4,096, 7*7*512. With reshaped parameters of dimensions 4,096*7*7*512, the comparable convolutional layer conv6 has a 7*7 kernel size and 4,096 output channels. The parameters of fc7 with an input size of 4,096 (i.e. the output size of fc6) and an output size of 4,096 are 4,096, 4,096. The input can be thought of as a one-of-a-kind image with 4,096 input channels. With reshaped parameters of dimensions 4,096*1*1*4,096, the comparable convolutional layer conv7 has a 1*1 kernel size and 4,096 output channels. It is clear that conv6 has 4,096 filters, each with dimensions 7*7*512, and conv7 has 4,096 filters, each with dimensions 1*1*4,096. These filters are numerous, large and computationally expensive. To remedy this, the authors opt to reduce both their number and the size of each filter by subsampling parameters from the converted convolutional layers. Conv6 will use 1,024 filters, each with dimensions 3*3*512. Therefore, the parameters are subsampled from 4,096*7*7*512 to 1,024*3*3*512. Conv7 will use 1,024 filters, each with dimensions 1*1*1,024. Therefore, the parameters are subsampled from 4,096*1*1*4,096 to 1,024*1*1*1,024.

引用

页数：4

共 50 条

[1] Anomaly Detection in Traffic Surveillance Videos Using Deep Learning
Khan, Sardar Waqar
Hafeez, Qasim
Khalid, Muhammad Irfan
Alroobaea, Roobaea
Hussain, Saddam
Iqbal, Jawaid
Almotiri, Jasem
Ullah, Syed Sajid
SENSORS, 2022, 22 (17)
[2] Violence Detection From Industrial Surveillance Videos Using Deep Learning
Khan, Hamza
Yuan, Xiaohong
Qingge, Letu
Roy, Kaushik
IEEE ACCESS, 2025, 13 : 15363 - 15375
[3] Violence Detection in Surveillance Videos with Deep Network using Transfer Learning
Mumtaz, Aqib
Sargano, Allah Bux
Habib, Zulfiqar
2018 2ND EUROPEAN CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (EECS 2018), 2018, : 558 - 563
[4] Splash Detection in Fish Plants Surveillance Videos Using Deep Learning
Jovanovic, Vedran
Svendsen, Eirik
Risojevic, Vladimir
Babic, Zdenka
2018 14TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2018,
[5] Identification of animals and recognition of their actions in wildlife videos using deep learning techniques
Schindler, Frank
Steinhage, Volker
ECOLOGICAL INFORMATICS, 2021, 61
[6] Application of Deep Learning for Weapons Detection in Surveillance Videos
Hashmi, Tufail Sajjad Shah
Ul Haq, Nazeef
Fraz, Muhammad Moazam
Shahzad, Muhammad
2021 INTERNATIONAL CONFERENCE ON DIGITAL FUTURES AND TRANSFORMATIVE TECHNOLOGIES (ICODT2), 2021,
[7] Anomaly detection in surveillance videos using deep autoencoder
Mishra S.
Jabin S.
International Journal of Information Technology, 2024, 16 (2) : 1111 - 1122
[8] A Deep Learning Based Technique for Anomaly Detection in Surveillance Videos
Singh, Prakhar
Pankajakshan, Vinod
2018 TWENTY FOURTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2018,
[9] Deep Learning Based Fire Detection System for Surveillance Videos
Wang, Hao
Pan, Zhiying
Zhang, Zhifei
Song, Hongzhang
Zhang, Shaobo
Zhang, Jianhua
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT II, 2019, 11741 : 318 - 328
[10] A Scalable and Generalised Deep Learning Framework for Anomaly Detection in Surveillance Videos
Jebur, Sabah Abdulazeez
Alzubaidi, Laith
Saihood, Ahmed
Hussein, Khalid A.
Hoomod, Haider Kadhim
Gu, Yuantong
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2025, 2025 (01)

← 1 2 3 4 5 →