Local Features and a Two-Layer Stacking Architecture for Semantic Concept Detection in Video

被引:8
|
作者
Markatopoulou, Foteini [1 ,2 ]
Mezaris, Vasileios [1 ]
Pittaras, Nikiforos [1 ]
Patras, Ioannis [2 ,3 ]
机构
[1] Inst Informat Technol, Ctr Res & Technol Hellas, Thermi 57001, Greece
[2] Queen Mary Univ London, London E1 4NS, England
[3] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
关键词
Content analysis and indexing; semantic concept detection; concept correlation; stacking; multi-label classification; video feature extraction; binary descriptors; semantic video annotation; FUSION;
D O I
10.1109/TETC.2015.2418714
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we deal with the problem of extending and using different local descriptors, as well as exploiting concept correlations, toward improved video semantic concept detection. We examine how the state-of-the-art binary local descriptors can facilitate concept detection, we propose color extensions of them inspired by previously proposed color extensions of scale invariant feature transform, and we show that the latter color extension paradigm is generally applicable to both binary and nonbinary local descriptors. In order to use them in conjunction with a state-of-the-art feature encoding, we compact the above color extensions using PCA and we compare two alternatives for doing this. Concerning the learning stage of concept detection, we perform a comparative study and propose an improved way of employing stacked models, which capture concept correlations, using multilabel classification algorithms in the last layer of the stack. We examine and compare the effectiveness of the above algorithms in both semantic video indexing within a large video collection and in the somewhat different problem of individual video annotation with semantic concepts, on the extensive video data set of the 2013 TRECVID Semantic Indexing Task. Several conclusions are drawn from these experiments on how to improve the video semantic concept detection.
引用
收藏
页码:193 / 204
页数:12
相关论文
共 50 条
  • [1] TWO-LAYER VIDEO FINGERPRINTING STRATEGY FOR NEAR-DUPLICATE VIDEO DETECTION
    Nie, Xiushan
    Jing, Weizhen
    Ma, Lin Yuan
    Cui, Chaoran
    Yin, Yilong
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [2] Two-layer coordination architecture HIF detection with μPMU data
    Wang, Xiaojun
    Zhang, Yongjie
    Luo, Yiping
    He, Jinghan
    Ling, Ping
    Fang, Chen
    JOURNAL OF ENGINEERING-JOE, 2018, (15): : 1033 - 1037
  • [3] A two-layer graphical model for combined video shot and scene boundary detection
    Al-Hames, Marc
    Zettl, Stefan
    Wallhoff, Frank
    Reiter, Stephan
    Schuller, Bjoern
    Rigoll, Gerhard
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 261 - +
  • [4] Region Trajectories for Video Semantic Concept Detection
    Ye, Yuancheng
    Rong, Xuejian
    Yang, Xiaodong
    Tian, Yingli
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 255 - 259
  • [5] A Novel Semantic Model for Video Concept Detection
    Zhu, Songhao
    Liu, Yuncai
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1837 - +
  • [6] A two-layer backdoor detection algorithm
    Horng, Shi-Jinn
    Su, Ming-Yang
    Tsai, Ja-Ga
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 8, 2005, : 107 - 112
  • [7] Two-layer image retrieval method based on wavelet and local color spatial features
    Zhao, Me
    Yan, Dong-Ming
    Zhang, Ying-Kang
    2007 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1-4, PROCEEDINGS, 2007, : 254 - 259
  • [8] An efficient Video Forgery Detection using Two-Layer Hybridized Deep CNN classifier
    Ugale, Meena
    Midhunchakkaravarthy, J.
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2025, 12 (01):
  • [9] Semantic Audiovisual Features in Video Scene Detection
    Abdullah, Lili Nurliyana
    Noah, Shahrul Azman Mohd
    Sembok, Tengku Mohd Tengku
    2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT AND ENGINEERING, PROCEEDINGS, 2009, : 745 - +
  • [10] Two-layer motion estimation algorithm for video coding
    Paramkusam, A. V.
    Reddy, V. S. K.
    ELECTRONICS LETTERS, 2014, 50 (04) : 276 - 277