Detection, identification and alert of wild animals in surveillance videos using deep learning

被引:1
|
作者
Jartarghar, Harish A. [1 ]
Kruthi, M. N. [1 ]
Karuntharaka, B. [1 ]
Nasreen, Azra [1 ]
Shankar, T. [2 ]
Kumar, Ramakanth [1 ]
Sreelakshmi, K. [3 ]
机构
[1] RV Coll Engn, Dept Comp Sci & Engn, Bangalore, India
[2] IISc, Bengaluru, India
[3] RV Coll Engn, Bangalore, India
来源
CURRENT SCIENCE | 2024年 / 127卷 / 04期
关键词
CCTV; Convolutional neural network; F1; score; Mean average precision (mAP); VGG-16;
D O I
10.1108/IJIUS-09-2022-0125
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
PurposeWith the rapid advancement of lifestyle and technology, human lives are becoming increasingly threatened. Accidents, exposure to dangerous substances and animal strikes are all possible threats. Human lives are increasingly being harmed as a result of attacks by wild animals. Further investigation into the cases reported revealed that such events can be detected early on. Techniques such as machine learning and deep learning will be used to solve this challenge. The upgraded VGG-16 model with deep learning-based detection is appropriate for such real-time applications because it overcomes the low accuracy and poor real-time performance of traditional detection methods and detects medium- and long-distance objects more accurately. Many organizations use various safety and security measures, particularly CCTV/video surveillance systems, to address physical security concerns. CCTV/video monitoring systems are quite good at visually detecting a range of attacks associated with suspicious behavior on the premises and in the workplace. Many have indeed begun to use automated systems such as video analytics solutions such as motion detection, object/perimeter detection, face recognition and artificial intelligence/machine learning, among others. Anomaly identification can be performed with the data collected from the CCTV cameras. The camera surveillance can generate enormous quantities of data, which is laborious and expensive to screen for the species of interest. Many cases have been recorded where wild animals enter public places, causing havoc and damaging lives and property. There are many cases where people have lost their lives to wild attacks. The conventional approach of sifting through images by eye can be expensive and risky. Therefore, an automated wild animal detection system is required to avoid these circumstances.Design/methodology/approachThe proposed system consists of a wild animal detection module, a classifier and an alarm module, for which video frames are fed as input and the output is prediction results. Frames extracted from videos are pre-processed and then delivered to the neural network classifier as filtered frames. The classifier module categorizes the identified animal into one of the several categories. An email or WhatsApp notice is issued to the appropriate authorities or users based on the classifier outcome.FindingsEvaluation metrics are used to assess the quality of a statistical or machine learning model. Any system will include a review of machine learning models or algorithms. A number of evaluation measures can be performed to put a model to the test. Among them are classification accuracy, logarithmic loss, confusion matrix and other metrics. The model must be evaluated using a range of evaluation metrics. This is because a model may perform well when one measurement from one evaluation metric is used but perform poorly when another measurement from another evaluation metric is used. We must utilize evaluation metrics to guarantee that the model is running correctly and optimally.Originality/valueThe output of conv5 3 will be of size 7*7*512 in the ImageNet VGG-16 in Figure 4, which operates on images of size 224*224*3. Therefore, the parameters of fc6 with a flattened input size of 7*7*512 and an output size of 4,096 are 4,096, 7*7*512. With reshaped parameters of dimensions 4,096*7*7*512, the comparable convolutional layer conv6 has a 7*7 kernel size and 4,096 output channels. The parameters of fc7 with an input size of 4,096 (i.e. the output size of fc6) and an output size of 4,096 are 4,096, 4,096. The input can be thought of as a one-of-a-kind image with 4,096 input channels. With reshaped parameters of dimensions 4,096*1*1*4,096, the comparable convolutional layer conv7 has a 1*1 kernel size and 4,096 output channels. It is clear that conv6 has 4,096 filters, each with dimensions 7*7*512, and conv7 has 4,096 filters, each with dimensions 1*1*4,096. These filters are numerous, large and computationally expensive. To remedy this, the authors opt to reduce both their number and the size of each filter by subsampling parameters from the converted convolutional layers. Conv6 will use 1,024 filters, each with dimensions 3*3*512. Therefore, the parameters are subsampled from 4,096*7*7*512 to 1,024*3*3*512. Conv7 will use 1,024 filters, each with dimensions 1*1*1,024. Therefore, the parameters are subsampled from 4,096*1*1*4,096 to 1,024*1*1*1,024.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Anomaly Detection in Traffic Surveillance Videos Using Deep Learning
    Khan, Sardar Waqar
    Hafeez, Qasim
    Khalid, Muhammad Irfan
    Alroobaea, Roobaea
    Hussain, Saddam
    Iqbal, Jawaid
    Almotiri, Jasem
    Ullah, Syed Sajid
    SENSORS, 2022, 22 (17)
  • [2] Violence Detection From Industrial Surveillance Videos Using Deep Learning
    Khan, Hamza
    Yuan, Xiaohong
    Qingge, Letu
    Roy, Kaushik
    IEEE ACCESS, 2025, 13 : 15363 - 15375
  • [3] Violence Detection in Surveillance Videos with Deep Network using Transfer Learning
    Mumtaz, Aqib
    Sargano, Allah Bux
    Habib, Zulfiqar
    2018 2ND EUROPEAN CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (EECS 2018), 2018, : 558 - 563
  • [4] Splash Detection in Fish Plants Surveillance Videos Using Deep Learning
    Jovanovic, Vedran
    Svendsen, Eirik
    Risojevic, Vladimir
    Babic, Zdenka
    2018 14TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2018,
  • [5] Identification of animals and recognition of their actions in wildlife videos using deep learning techniques
    Schindler, Frank
    Steinhage, Volker
    ECOLOGICAL INFORMATICS, 2021, 61
  • [6] Application of Deep Learning for Weapons Detection in Surveillance Videos
    Hashmi, Tufail Sajjad Shah
    Ul Haq, Nazeef
    Fraz, Muhammad Moazam
    Shahzad, Muhammad
    2021 INTERNATIONAL CONFERENCE ON DIGITAL FUTURES AND TRANSFORMATIVE TECHNOLOGIES (ICODT2), 2021,
  • [7] Anomaly detection in surveillance videos using deep autoencoder
    Mishra S.
    Jabin S.
    International Journal of Information Technology, 2024, 16 (2) : 1111 - 1122
  • [8] A Deep Learning Based Technique for Anomaly Detection in Surveillance Videos
    Singh, Prakhar
    Pankajakshan, Vinod
    2018 TWENTY FOURTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2018,
  • [9] Deep Learning Based Fire Detection System for Surveillance Videos
    Wang, Hao
    Pan, Zhiying
    Zhang, Zhifei
    Song, Hongzhang
    Zhang, Shaobo
    Zhang, Jianhua
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT II, 2019, 11741 : 318 - 328
  • [10] A Scalable and Generalised Deep Learning Framework for Anomaly Detection in Surveillance Videos
    Jebur, Sabah Abdulazeez
    Alzubaidi, Laith
    Saihood, Ahmed
    Hussein, Khalid A.
    Hoomod, Haider Kadhim
    Gu, Yuantong
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2025, 2025 (01)