Video representation and retrieval using spatio-temporal descriptors and region relations

被引:0
|
作者
Chatzis, Sotirios [1 ]
Doulamis, Anastasios
Kosmopoulos, Dimitrios
Varvarigou, Theodora
机构
[1] Natl Tech Univ Athens, Dept Elect & Comp Engn, Athens, Greece
[2] Demokritos Natl Ctr Sci Res, Inst Informat & Telecommun, GR-15310 Athens, Greece
关键词
spatio-temporal; graph matching; region; machine learning; ARVQ;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a novel methodology for video summarization and representation. The video shots are processed in space-time as 3D volumes of pixels. Pixel regions with consistent color and motion properties are extracted from these 3D volumes by a space-time segmentation technique based on a novel machine learning algorithm. Each region is then described by a high-dimensional point whose components represent the average position, motion velocity and color of the region. Subsequently, the spatio-temporal relations of the regions are deduced and a concise, graph-based description of them is generated. This graph-based description of the video shot's content, along with the region centroids, comprises a concise yet powerful description of the video-shot and is used for retrieval applications. The retrieval problem is formulated as an inexact graph matching problem between the data video shots and the query input which is also a video segment. Experimental results on action recognition and video retrieval are illustrated and discussed.
引用
收藏
页码:94 / 103
页数:10
相关论文
共 50 条
  • [1] Compressed spatio-temporal descriptors for video matching and retrieval
    Alatas, O
    Javed, O
    Shah, M
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 882 - 885
  • [2] Video retrieval of near-duplicates using κ-nearest neighbor retrieval of spatio-temporal descriptors
    DeMenthon, Daniel
    Doermann, David
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2006, 30 (03) : 229 - 253
  • [3] Video retrieval of near-duplicates using κ-nearest neighbor retrieval of spatio-temporal descriptors
    Daniel DeMenthon
    David Doermann
    [J]. Multimedia Tools and Applications, 2006, 30 : 229 - 253
  • [4] Spatio-temporal video search using the object based video representation
    Zhong, D
    Chang, SF
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL I, 1997, : 21 - 24
  • [5] A spatio-temporal pyramid matching for video retrieval
    Choi, Jaesik
    Wang, Ziyu
    Lee, Sang-Chul
    Jeon, Won J.
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (06) : 660 - 669
  • [6] Feature Pooling Using Spatio-Temporal Constrain for Video Summarization and Retrieval
    Ren, Jie
    Ren, Jinchang
    [J]. ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING: FUTURETECH & MUE, 2016, 393 : 381 - 387
  • [7] Unsupervised Learning of Spatio-Temporal Representation with Multi-Task Learning for Video Retrieval
    Kumar, Vidit
    [J]. 2022 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2022, : 118 - 123
  • [8] Motion trajectory clustering for video retrieval using spatio-temporal approximations
    Khalid, S
    Naftel, A
    [J]. VISUAL INFORMATION AND INFORMATION SYSTEMS, 2006, 3736 : 60 - 70
  • [9] An efficient approach for video retrieval by spatio-temporal features
    Kumar, G. S. Naveen
    Reddy, V. S. K.
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2019, 23 (04) : 311 - 316
  • [10] Video region segmentation by spatio-temporal watersheds
    El Saban, MA
    Manjunath, BS
    [J]. 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 1, PROCEEDINGS, 2003, : 349 - 352