Robust multi-level video representation using mean shift analysis

被引:0
|
作者
Gao, H [1 ]
Yu, XD [1 ]
Wang, L [1 ]
Xue, P [1 ]
Tian, Q [1 ]
机构
[1] Nanyang Technol Univ, Sch EEE, Singapore 639798, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robust method for multi-level video representation based on the mean shift analysis (MSA) of low-level visual features is proposed in this paper. By tuning the bandwidth of MSA, video representation from the coarse level to the fine level can be achieved. This representation form provides a flexible scheme for content-based video analysis such as summarization, classification, and retrieval. Compared with the conventional k-means or fuzzy c-means algorithms, our method can adjust the resolution of representation in a more straight/forward way, and is more robust since it does not need to initialize the cluster centers.
引用
收藏
页码:627 / 630
页数:4
相关论文
共 50 条
  • [1] Multi-level video representation with application to keyframe extraction
    Yu, XD
    Wang, L
    Tian, Q
    Xue, P
    [J]. 10TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, 2004, : 117 - 123
  • [2] Multi-level analysis of sports video sequences
    Han, JG
    Farin, D
    de With, PHN
    [J]. MULTIMEDIA CONTENT ANALYSIS, MANAGEMENT, AND RETRIEVAL 2006, 2006, 6073
  • [3] Multi-level semantic analysis for sports video
    Tjondronegoro, DW
    Chen, YPP
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2005, 3682 : 24 - 30
  • [4] Multi-level molecular representation
    Olivier, P
    Nakata, K
    Landon, M
    [J]. ARTIFICIAL INTELLIGENCE IN DESIGN '96, 1996, : 3 - 20
  • [5] Multi-Level Representation Learning with Semantic Alignment for Referring Video Object Segmentation
    Wu, Dongming
    Dong, Xingping
    Shao, Ling
    Shen, Jianbing
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4986 - 4995
  • [6] Multi-Level Visual Representation with Semantic-Reinforced Learning for Video Captioning
    Dong, Chengbo
    Chen, Xinru
    Chen, Aozhu
    Hu, Fan
    Wang, Zihan
    Li, Xirong
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4750 - 4754
  • [7] Multi-Level Fusion for Robust RGBT Tracking via Enhanced Thermal Representation
    Tang, Zhangyong
    Xu, Tianyang
    Wu, Xiao-Jun
    Kittler, Josef
    [J]. ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 20 (10)
  • [8] Text Representation Using Multi-level Latent Dirichlet Allocation
    Razavi, Amir H.
    Inkpen, Diana
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2014, 2014, 8436 : 215 - 226
  • [9] Multi-level lecture video classification using text content
    Agziyagli, Veysel Sercan
    Ogul, Hasan
    [J]. 2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
  • [10] Multi-level Video Segmentation Using Visual Semantic Units
    Shih, Huang-Chia
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 37 - 38