Robust multi-level video representation using mean shift analysis

被引：0

作者：

Gao, H ^{[1
]}

Yu, XD ^{[1
]}

Wang, L ^{[1
]}

Xue, P ^{[1
]}

Tian, Q ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch EEE, Singapore 639798, Singapore

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3 | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A robust method for multi-level video representation based on the mean shift analysis (MSA) of low-level visual features is proposed in this paper. By tuning the bandwidth of MSA, video representation from the coarse level to the fine level can be achieved. This representation form provides a flexible scheme for content-based video analysis such as summarization, classification, and retrieval. Compared with the conventional k-means or fuzzy c-means algorithms, our method can adjust the resolution of representation in a more straight/forward way, and is more robust since it does not need to initialize the cluster centers.

引用

页码：627 / 630

页数：4

共 50 条

[1] Multi-level video representation with application to keyframe extraction
Yu, XD
Wang, L
Tian, Q
Xue, P
[J]. 10TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, 2004, : 117 - 123
[2] Multi-level analysis of sports video sequences
Han, JG
Farin, D
de With, PHN
[J]. MULTIMEDIA CONTENT ANALYSIS, MANAGEMENT, AND RETRIEVAL 2006, 2006, 6073
[3] Multi-level semantic analysis for sports video
Tjondronegoro, DW
Chen, YPP
[J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2005, 3682 : 24 - 30
[4] Multi-level molecular representation
Olivier, P
Nakata, K
Landon, M
[J]. ARTIFICIAL INTELLIGENCE IN DESIGN '96, 1996, : 3 - 20
[5] Multi-Level Representation Learning with Semantic Alignment for Referring Video Object Segmentation
Wu, Dongming
Dong, Xingping
Shao, Ling
Shen, Jianbing
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4986 - 4995
[6] Multi-Level Visual Representation with Semantic-Reinforced Learning for Video Captioning
Dong, Chengbo
Chen, Xinru
Chen, Aozhu
Hu, Fan
Wang, Zihan
Li, Xirong
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4750 - 4754
[7] Multi-Level Fusion for Robust RGBT Tracking via Enhanced Thermal Representation
Tang, Zhangyong
Xu, Tianyang
Wu, Xiao-Jun
Kittler, Josef
[J]. ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 20 (10)
[8] Text Representation Using Multi-level Latent Dirichlet Allocation
Razavi, Amir H.
Inkpen, Diana
[J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2014, 2014, 8436 : 215 - 226
[9] Multi-level lecture video classification using text content
Agziyagli, Veysel Sercan
Ogul, Hasan
[J]. 2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
[10] Multi-level Video Segmentation Using Visual Semantic Units
Shih, Huang-Chia
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 37 - 38

← 1 2 3 4 5 →