A DCA-based sparse coding for video summarization with MCP

被引:0
|
作者
Li, Yujie [1 ]
Li, Zhenni [2 ,3 ]
Tan, Benying [1 ]
Ding, Shuxue [1 ]
机构
[1] Guilin Univ Elect Technol, Guangxi Coll & Univ Key Lab AI Algorithm Engn, Sch Artificial Intelligence, Jinji Rd, Guilin, Peoples R China
[2] Guangdong Univ Technol, Sch Automat, Guangdong Key Lab IoT Informat Technol, Guangzhou, Peoples R China
[3] Guangdong Univ Technol, Sch Automat, Guangdong Key Lab IoT Informat Technol, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; image processing; image segmentation; signal processing; sparse matrices; video signal processing; VISUAL-ATTENTION; MINIMIZATION; EXTRACTION; FRAMEWORK; MODEL;
D O I
10.1049/ipr2.12738
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video summarization offers a summary version that conveys the primary information of a longer video. The main challenges of video summarization are related to keyframe extraction and saliency mapping. Thus, this work proposes a sparse coding model for keyframe extraction and saliency mapping applications. Specifically, the minimax concave penalty (MCP) is utilized as a sparse regularization scheme and the regularized non-convex MCP problem is solved by decomposing MCP into two convex functions and the convex function's algorithm difference is relied on to solve the resulting sub-problems. The experimental results demonstrate higher compressed keyframes and saliency maps than current state-of-the-art algorithms. In particular, the model attains a lower summary length of 34% and 19% compared to sparse modeling representation selection (SMRS) and sparse modeling using the determinant sparsity measure (SC-det), respectively. In addition, the developed scheme has a shorter computation time, requiring 82% and 33% less time than the ITTI and the dense and sparse reconstruction (DSR) methods.
引用
收藏
页码:1564 / 1577
页数:14
相关论文
共 50 条
  • [41] DCA-based unimodal feature-level fusion of orthogonal moments for Indian sign language dataset
    Joshi, Garima
    Vig, Renu
    Singh, Sukhwinder
    IET COMPUTER VISION, 2018, 12 (05) : 570 - 577
  • [42] JOINT WEIGHTED SPARSE REPRESENTATION BASED MEDIAN FILTER FOR DEPTH VIDEO CODING
    Hu, Jinhui
    Hu, Ruimin
    Chen, Yu
    Liao, Liang
    Xiao, Jing
    Ruan, Ruolin
    2015 DATA COMPRESSION CONFERENCE (DCC), 2015, : 450 - 450
  • [43] Scalable video summarization via sparse dictionary learning and selection simultaneously
    Etezadifar, Pouriya
    Farsi, Hassan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (06) : 7947 - 7971
  • [44] Scalable video summarization via sparse dictionary learning and selection simultaneously
    Pouriya Etezadifar
    Hassan Farsi
    Multimedia Tools and Applications, 2017, 76 : 7947 - 7971
  • [45] Resource Restricted On-line Video Summarization with Minimum Sparse Reconstruction
    Mei, Shaohui
    Wang, Zhiyong
    He, Mingyi
    Feng, Dagan
    2015 PICTURE CODING SYMPOSIUM (PCS) WITH 2015 PACKET VIDEO WORKSHOP (PV), 2015, : 139 - 143
  • [46] Evaluating DCA-based method performances for RNA contact prediction by a well-curated data set
    Pucci, Fabrizio
    Zerihun, Mehari B.
    Peter, Emanuel K.
    Schug, Alexander
    RNA, 2020, 26 (07) : 794 - 802
  • [47] Sparse coded handcrafted and deep features for colon capsule video summarization
    Mohammed, Ahmed
    Yildirim, Sule
    Pedersen, Marius
    Hovde, Oistein
    Cheikh, Faouzi
    2017 IEEE 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2017, : 728 - 733
  • [48] Hierarchical video summarization based on video structure and highlight
    Geng, Yuliang
    Xu, De
    Feng, Songhe
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2006, 4109 : 226 - 234
  • [49] Smart Surveillance Based on Video Summarization
    Thomas, Sinnu Susan
    Gupta, Sumana
    Subramanian, Venkatesh K.
    2017 IEEE REGION 10 INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR SMART CITIES (IEEE TENSYMP 2017), 2017,
  • [50] Gesture-based video summarization
    Kosmopoulos, D
    Doulamis, A
    Doulamis, N
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 3213 - 3216