Interactive video summarization with human intentions

被引:3
|
作者
Liu, Huaping [1 ]
Sun, Fuchun [1 ]
Zhang, Xinyu [2 ]
Fang, Bin [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, BNRist, State Key Lab Intelligent Technol & Syst, Beijing, Peoples R China
[2] Tsinghua Univ, State Key Lab Automot Safety & Energy, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
Interactive action summarization; Video summarization; Human-machine interaction; Non-negative matrix factorization;
D O I
10.1007/s11042-018-6305-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic video summarization, which is a typical cognitive-inspired task and attempts to select a small set of the most representative images or video clips for a specific video sequence, is therefore vital for enabling many tasks. In this work, we develop an interactive Non-negative Matrix Factorization (NMF) method for representative action video discovery. The original video is first evenly segmented into short clips, and the bag-of-words model is used to describe each clip. A temporally consistent NMF model is subsequently used for clustering and action segmentation. Because the clustering and segmentation results may not satisfy user intention, the user-controlled operations MERGE and ADD are developed to permit the user to adjust the results in line with expectations. The newly developed interactive NMF method can therefore generate personalized results.Experimental results on the public Weizman dataset demonstrate that our approach provides satisfactory action discovery and segmentation results.
引用
收藏
页码:1737 / 1755
页数:19
相关论文
共 50 条
  • [21] Video Interactive Captioning with Human Prompts
    Wu, Aming
    Han, Yahong
    Yang, Yi
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 961 - 967
  • [22] Social Interactive Human Video Synthesis
    Okwechime, Dumebi
    Ong, Eng-Jon
    Gilbert, Andrew
    Bowden, Richard
    COMPUTER VISION-ACCV 2010, PT I, 2011, 6492 : 256 - 270
  • [23] Echocardiogram video summarization
    Ebadollahi, S
    Chang, SF
    Wu, H
    Takoma, S
    MEDICAL IMAGING 2001: ULTRASONIC IMAGING AND SIGNAL PROCESSING, 2001, 4325 : 492 - 501
  • [24] VIDEO ANALYSIS BASED ON HUMAN POSE FOR UNSUPERVISED SUMMARIZATION AND RETRIEVAL
    Santiago, C.
    Alves, D. M.
    Ferreira, B. Q.
    Carvalho, J.
    Messina, A.
    Costeira, J. P.
    2019 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2019,
  • [25] Dynamic video summarization of home video
    Lienhart, R
    STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2000, 2000, 3972 : 378 - 389
  • [26] Hierarchical video summarization
    Ratakonda, K
    Sezan, MI
    Crinon, R
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1531 - 1541
  • [27] Video Summarization Overview
    Otani, Mayu
    Song, Yale
    Wang, Yang
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2022, 13 (04): : 284 - 335
  • [28] Video retrieval and summarization
    Sebe, N
    Lew, MS
    Smeulders, AWM
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2003, 92 (2-3) : 141 - 146
  • [29] AudioVisual Video Summarization
    Zhao, Bin
    Gong, Maoguo
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 5181 - 5188
  • [30] Video Co-summarization: Video Summarization by Visual Co-occurrence
    Chu, Wen-Sheng
    Song, Yale
    Jaimes, Alejandro
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3584 - 3592