Object of Interest and Unsupervised Learning-based Framework for an Effective Video Summarization Using Deep Learning

被引:2
|
作者
Negi, Alok [1 ]
Kumar, Krishan [1 ]
Saini, Parul [1 ]
机构
[1] Natl Inst Technol, Srinagar 246174, Uttarakhand, India
关键词
Keyframe extraction; K-means; Object of interest (OoI); Pearson correlation coefficient (PCC); Principal component analysis (PCA); ResNet-50; Video summarization (VS); VGG-16; RECOGNITION; SPARSE;
D O I
10.1080/03772063.2023.2220693
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
During this digital era, a large amount of visual data is being generated by various multimedia sources. A technique is urgently required to provide a clear and accurate summary of the original video to highlight the most informative segments of the video content. However, the varying video resolution, multi-dimensional feature representation, and massive storage create difficulties in key frame extraction techniques. As a result, many unnecessary features of the videos must be dropped to count their unique qualities. Therefore, a novel deep learning-based approach is proposed where frames are first retrieved using 25 frames per second; then, objects are detected on extracted frames using YOLOv5, and the frames with the target object only are processed further to overcome time consumption and high-speed computing hardware limitations. Further features are obtained using VGG-16 and object of Interest (OoI) based ResNet-50, respectively, and comparisons are performed to find the best solution. The extracted features are compressed using Principal Component Analysis (PCA) based on unsupervised Learning, which may efficiently minimize information loss, and potentially reduce dimension. By performing a comprehensive evaluation to obtain the best value of K using the Silhouette score, Candidate frames are extracted with the maximum mean and standard deviation from each K-means algorithm-based cluster. Pearson Correlation Coefficient (PCC) is used as a post- processing step to remove the redundant frames from the candidate frames and final keyframes extraction. The experiment was performed on the benchmark office dataset from industrial surveillance, which outperforms the state-of-the-art models in terms of recall score.
引用
收藏
页码:5019 / 5030
页数:12
相关论文
共 50 条
  • [1] An Effective Video Summarization Framework Based on the Object of Interest Using Deep Learning
    Ul Haq, Hafiz Burhan
    Asif, Muhammad
    Ahmad, Maaz Bin
    Ashraf, Rehan
    Mahmood, Toqeer
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [2] Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
    Yoon, Ui Nyoung
    Hong, Myung Duk
    Jo, Geun-Sik
    [J]. SENSORS, 2023, 23 (07)
  • [3] Unsupervised Learning-Based Framework for Deepfake Video Detection
    Zhang, Li
    Qiao, Tong
    Xu, Ming
    Zheng, Ning
    Xie, Shichuang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4785 - 4799
  • [4] Deep Learning-based Framework for Changeable Target-of-Interest Object Tracking using AMR
    Kwak, Jeonghoon
    Yang, Kyon-Mo
    Koo, Jaewan
    Seo, Kap-Ho
    [J]. Journal of Institute of Control, Robotics and Systems, 2022, 28 (12): : 1140 - 1146
  • [5] An Optimized Deep Learning Method for Video Summarization Based on the User of Interest
    Ul Haq, Hafiz Burhan
    Suwansantisuk, Watcharapan
    Chamnongthai, Kosin
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 244 - 256
  • [6] Text summarization using unsupervised deep learning
    Yousefi-Azar, Mahmood
    Hamey, Len
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 68 : 93 - 105
  • [7] Unsupervised Video Object Segmentation for Deep Reinforcement Learning
    Goel, Vik
    Weng, Jameson
    Poupart, Pascal
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [8] Lecture Video Summarization Using Deep Learning
    Khetarpaul, Sonia
    Jain, Lakshay
    Goyal, Kush
    Tej, P. Vishnu
    [J]. RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, ACIIDS 2024, 2024, 2145 : 94 - 105
  • [9] Soccer Video Summarization using Deep Learning
    Agyeman, Rockson
    Muhammad, Rafiq
    Choi, Gyu Sang
    [J]. 2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 270 - 273
  • [10] A novel deep unsupervised learning-based framework for optimization of truss structures
    Mai, Hau T.
    Lieu, Qui X.
    Kang, Joowon
    Lee, Jaehong
    [J]. ENGINEERING WITH COMPUTERS, 2023, 39 (04) : 2585 - 2608