Robust Clustering-Based Automated Video Shot Boundary Detection Using Handcrafted and Deep Feature Fusion

被引:0
|
作者
Mishra, Ravi [1 ]
Chopkar, Priyanka Nandkishor [1 ]
Moyal, Vishal [2 ]
Marotkar, Devashree Shrish [3 ]
Kapur, Vivek Rajkumar [1 ]
机构
[1] G H Raisoni Inst Engn & Technol, Nagpur 440028, Maharashtra, India
[2] SVKMs Inst Technol, Dhule 424001, Maharashtra, India
[3] G H Raisoni Inst Engn & Technol, Elect & Telecommun Engn, Nagpur 440028, Maharashtra, India
关键词
Shot boundary detection; color histogram differences; Morlet wavelet-assisted modified stacked autoencoder; robust deep k-means map clustering;
D O I
10.1142/S0218213024500106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video shot boundary detection (VSBD) plays a key role in analyzing, summarizing, indexing and retrieving content-based data from videos. Many artificial intelligence (AI)-based techniques have recently been introduced to detect the gradual transition from video frames. However, those techniques fail to detect the presence of inter-classes like fade-in, fade-out and dissolve from gradual transition video frames. In addition, the existing techniques face high computational complexity during detecting transitions from the video frames. This research brings novel clustering-based techniques to classify the inter-classes like fade-in, fade-out and dissolve from gradual transition video frames. At the initial stage, color histogram differences (CHD) technique is introduced to detect the abrupt transition from the video frames. The identified abrupt transitions are completely removed from the video frames. Then, segmentation is done to segment the gradual transitions from the video frames. The segmented gradual transition frames are then given to extract the handcrafted and deep features from the video frames. The deep features are extracted from the segments using the Morlet Wavelet-assisted modified stacked autoencoder (MW-MSAE) technique. The extracted handcrafted features and deep features are then concatenated together, and finally, fused feature vectors are obtained. The fused features are then fed to the robust deep k-means map clustering method (RDKMM) to aggregate based on similar features. For calculating the similar features, a similarity-based correlation calculation (SBCC) is done in adjacent frames to determine gradual shot transitions like fade-in, fade-out and dissolve. The dataset used in this research is the TREC Video Retrieval Evaluation (TRECVID) 2021 dataset. In the experimental scenario, an accuracy of 95.5%, a sensitivity of 95.3%, a specificity of 96.9%, a precision of 93.9%, an F-measure of 94.6% and Mathew's correlation coefficient (MCC) of 92.4% are obtained.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] Soccer Video Event Detection Using 3D Convolutional Networks and Shot Boundary Detection via Deep Feature Distance
    Liu, Tingxi
    Lu, Yao
    Lei, Xiaoyu
    Zhang, Lijing
    Wang, Haoyu
    Huang, Wei
    Wang, Zijian
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 440 - 449
  • [22] Video shot boundary detection using block based cumulative approach
    Rashmi, B. S.
    Nagendraswamy, H. S.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (01) : 641 - 664
  • [23] Video shot boundary detection using block based cumulative approach
    B. S. Rashmi
    H. S. Nagendraswamy
    [J]. Multimedia Tools and Applications, 2021, 80 : 641 - 664
  • [24] Scene boundary detection by using shot clustering and music detection
    Baek, Joon-sik
    Lee, Soon-tak
    Baek, Joong-hwan
    [J]. 2005 Portuguese Conference on Artificial Intelligence, Proceedings, 2005, : 94 - 97
  • [25] Multiocular disease detection using a generic framework based on handcrafted and deep learned feature analysis
    Raveenthini, M.
    Lavanya, R.
    [J]. INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 17
  • [26] Handcrafted Deep-Feature-Based Brain Tumor Detection and Classification Using MRI Images
    Mohan, Prakash
    Veerappampalayam Easwaramoorthy, Sathishkumar
    Subramani, Neelakandan
    Subramanian, Malliga
    Meckanzi, Sangeetha
    [J]. ELECTRONICS, 2022, 11 (24)
  • [27] Spam Detection Using Clustering-Based SVM
    Pandya, Darshit
    [J]. PROCEEDINGS OF THE 2019 2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND MACHINE INTELLIGENCE (MLMI 2019), 2019, : 12 - 15
  • [28] Video Shot Boundary Detection using Statistical Methods
    Madhusudhan, M., V
    Hegde, Chetana
    [J]. 2015 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2015, : 52 - 56
  • [29] A Shot Boundary Detection Method Based on Color Feature
    Zhang, Hua
    Hu, Ruimin
    Song, Lin
    [J]. 2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2541 - 2544
  • [30] Mutual Information Based Video Shot Boundary Detection
    Lv, Na
    Feng, Zhiquan
    Peng, Jingliang
    [J]. PROCEEDINGS OF 2012 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2012, : 20 - 24