Auto encoder with mode-based learning for keyframe extraction in video summarization

被引:0
|
作者
Shambharkar, Prashant Giridhar [1 ]
Goel, Ruchi [1 ,2 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Delhi, India
[2] Delhi Technol Univ, Dept Comp Sci & Engn, Delhi 110042, India
关键词
computer vision; keyframe extraction; NLP; spatiotemporal features; video summarization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The exponential increase in video consumption has created new difficulties for browsing and navigating through video more effectively and efficiently. Researchers are interested in video summarization because it offers a brief but instructive video version that helps users and systems save time and effort when looking for and comprehending relevant content. Key frame extraction is a method of video summarization that only chooses the most important frames from a given video. In this article, a novel supervised learning method 'TC-CLSTM Auto Encoder with Mode-based Learning' using temporal and spatial features is proposed for automatically choosing keyframes or important sub-shots from videos. The method was able to achieve an average F-score of 84.35 on TVSum dataset. Extensive tests on benchmark data sets show that the suggested methodology outperforms state-of-the-art methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Dual Attention Mechanisms Based Auto-Encoder for Video Anomaly Detection
    Gu, Jiatao
    Zeng, Jing
    Ji, Genlin
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 153 - 165
  • [42] Key Frames Extraction Based on Local Features for Efficient Video Summarization
    Gharbi, Hana
    Massaoudi, Mohamed
    Bahroun, Sahbi
    Zagrouba, Ezzeddine
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2016, 2016, 10016 : 275 - 285
  • [43] Hierarchical Keyframe-based Video Summarization Using QR-Decomposition and Modified k-Means Clustering
    Amiri, Ali
    Fathy, Mahmood
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [44] Unsupervised Intrusion Detection Based on Asymmetric Auto-Encoder Feature Extraction
    Liu, Chunbo
    Wang, Liyin
    Zhang, Zhikai
    Xiang, Chunmiao
    Gu, Zhaojun
    Wang, Zhi
    Wang, Shuang
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (09) : 1161 - 1173
  • [45] Transfer learning based video summarization in wireless capsule endoscopy
    Raut V.
    Gunjan R.
    International Journal of Information Technology, 2022, 14 (4) : 2183 - 2190
  • [46] Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
    Yoon, Ui Nyoung
    Hong, Myung Duk
    Jo, Geun-Sik
    SENSORS, 2023, 23 (07)
  • [47] Endoscopy Video Summarization based on Unsupervised Learning and Feature Discrimination
    Ben Ismail, M. Maher
    Bchir, Ouiem
    Emam, Ahmed Z.
    2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
  • [48] NIRS feature extraction based on deep auto-encoder neural network
    Liu, Ting
    Li, Zhongren
    Yu, Chunxia
    Qin, Yuhua
    INFRARED PHYSICS & TECHNOLOGY, 2017, 87 : 124 - 128
  • [49] Sliding mode-based learning control for positioning of flying pickup head
    Liu, TS
    Wu, WC
    PROCEEDINGS OF THE 2001 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS (CCA'01), 2001, : 345 - 350
  • [50] Sliding Mode-based Integral Reinforcement Learning Event Triggered Control
    Jia, Chao
    Li, Xinyu
    Wang, Hongkun
    Song, Zijian
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2025, 23 (01) : 315 - 331