Object of Interest and Unsupervised Learning-based Framework for an Effective Video Summarization Using Deep Learning

被引:2
|
作者
Negi, Alok [1 ]
Kumar, Krishan [1 ]
Saini, Parul [1 ]
机构
[1] Natl Inst Technol, Srinagar 246174, Uttarakhand, India
关键词
Keyframe extraction; K-means; Object of interest (OoI); Pearson correlation coefficient (PCC); Principal component analysis (PCA); ResNet-50; Video summarization (VS); VGG-16; RECOGNITION; SPARSE;
D O I
10.1080/03772063.2023.2220693
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
During this digital era, a large amount of visual data is being generated by various multimedia sources. A technique is urgently required to provide a clear and accurate summary of the original video to highlight the most informative segments of the video content. However, the varying video resolution, multi-dimensional feature representation, and massive storage create difficulties in key frame extraction techniques. As a result, many unnecessary features of the videos must be dropped to count their unique qualities. Therefore, a novel deep learning-based approach is proposed where frames are first retrieved using 25 frames per second; then, objects are detected on extracted frames using YOLOv5, and the frames with the target object only are processed further to overcome time consumption and high-speed computing hardware limitations. Further features are obtained using VGG-16 and object of Interest (OoI) based ResNet-50, respectively, and comparisons are performed to find the best solution. The extracted features are compressed using Principal Component Analysis (PCA) based on unsupervised Learning, which may efficiently minimize information loss, and potentially reduce dimension. By performing a comprehensive evaluation to obtain the best value of K using the Silhouette score, Candidate frames are extracted with the maximum mean and standard deviation from each K-means algorithm-based cluster. Pearson Correlation Coefficient (PCC) is used as a post- processing step to remove the redundant frames from the candidate frames and final keyframes extraction. The experiment was performed on the benchmark office dataset from industrial surveillance, which outperforms the state-of-the-art models in terms of recall score.
引用
收藏
页码:5019 / 5030
页数:12
相关论文
共 50 条
  • [41] An effective framework for detecting the object from the video sequences by utilizing deep learning with hybrid technology
    Chaturvedi, Ravi Prakash
    Ghose, Udayan
    [J]. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (01): : 113 - 126
  • [42] Deep Learning-Based Multi-class Multiple Object Tracking in UAV Video
    Micheal, A. Ancy
    Vani, K.
    [J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (12) : 2543 - 2552
  • [43] A Deep Learning-Based Real-Time Video Object Contextualizing and Archiving System
    Pham, Dinh-Lam
    Yoon, Byeongnam
    Vu, Viet-Vu
    Kim, Joo-Chang
    Ahn, Sang-Eun
    Chang, Jeong-Hyun
    Yoo, Hyun
    Sun, Kyonghee
    Kim, Kyong-Sook
    Kim, Kwanghoon Pio
    [J]. 2023 25TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, ICACT, 2023, : 137 - 144
  • [44] Plant Counting of Cotton from UAS Imagery Using Deep Learning-Based Object Detection Framework
    Oh, Sungchan
    Chang, Anjin
    Ashapure, Akash
    Jung, Jinha
    Dube, Nothabo
    Maeda, Murilo
    Gonzalez, Daniel
    Landivar, Juan
    [J]. REMOTE SENSING, 2020, 12 (18)
  • [45] Deep Learning-Based Multi-class Multiple Object Tracking in UAV Video
    A. Ancy Micheal
    K. Vani
    [J]. Journal of the Indian Society of Remote Sensing, 2022, 50 : 2543 - 2552
  • [46] Prevention of smombie accidents using deep learning-based object detection
    Kim, Hyun-Seok
    Kim, Geon-Hwan
    Cho, You-Ze
    [J]. ICT EXPRESS, 2022, 8 (04): : 618 - 625
  • [47] A Deep Learning-Based Framework for Automatic Brain Tumors Classification Using Transfer Learning
    Rehman, Arshia
    Naz, Saeeda
    Razzak, Muhammad Imran
    Akram, Faiza
    Imran, Muhammad
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (02) : 757 - 775
  • [48] A Deep Learning-Based Framework for Automatic Brain Tumors Classification Using Transfer Learning
    Arshia Rehman
    Saeeda Naz
    Muhammad Imran Razzak
    Faiza Akram
    Muhammad Imran
    [J]. Circuits, Systems, and Signal Processing, 2020, 39 : 757 - 775
  • [49] Unsupervised Online Learning in Deep Learning-Based Massive MIMO CSI Feedback
    Cui, Yiming
    Guo, Jiajia
    Wen, Chao-Kai
    Jin, Shi
    Han, Shuangfeng
    [J]. IEEE COMMUNICATIONS LETTERS, 2022, 26 (09) : 2086 - 2090
  • [50] On Understanding Biosonar Deformations Using Deep Learning-Based Video Interpolation
    Gao, Li
    He, Weikai
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 6058 - 6059