Local Features and a Two-Layer Stacking Architecture for Semantic Concept Detection in Video

被引:8
|
作者
Markatopoulou, Foteini [1 ,2 ]
Mezaris, Vasileios [1 ]
Pittaras, Nikiforos [1 ]
Patras, Ioannis [2 ,3 ]
机构
[1] Inst Informat Technol, Ctr Res & Technol Hellas, Thermi 57001, Greece
[2] Queen Mary Univ London, London E1 4NS, England
[3] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
关键词
Content analysis and indexing; semantic concept detection; concept correlation; stacking; multi-label classification; video feature extraction; binary descriptors; semantic video annotation; FUSION;
D O I
10.1109/TETC.2015.2418714
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we deal with the problem of extending and using different local descriptors, as well as exploiting concept correlations, toward improved video semantic concept detection. We examine how the state-of-the-art binary local descriptors can facilitate concept detection, we propose color extensions of them inspired by previously proposed color extensions of scale invariant feature transform, and we show that the latter color extension paradigm is generally applicable to both binary and nonbinary local descriptors. In order to use them in conjunction with a state-of-the-art feature encoding, we compact the above color extensions using PCA and we compare two alternatives for doing this. Concerning the learning stage of concept detection, we perform a comparative study and propose an improved way of employing stacked models, which capture concept correlations, using multilabel classification algorithms in the last layer of the stack. We examine and compare the effectiveness of the above algorithms in both semantic video indexing within a large video collection and in the somewhat different problem of individual video annotation with semantic concepts, on the extensive video data set of the 2013 TRECVID Semantic Indexing Task. Several conclusions are drawn from these experiments on how to improve the video semantic concept detection.
引用
收藏
页码:193 / 204
页数:12
相关论文
共 50 条
  • [21] ATM multipeer communication using a two-layer architecture
    Dresler, S
    23RD ANNUAL CONFERENCE ON LOCAL COMPUTER NETWORKS - PROCEEDINGS, 1998, : 182 - 189
  • [22] Crash injury severity analysis using a two-layer Stacking framework
    Tang, Jinjun
    Liang, Jian
    Han, Chunyang
    Li, Zhibin
    Huang, Helai
    ACCIDENT ANALYSIS AND PREVENTION, 2019, 122 : 226 - 238
  • [23] Robust Semantic Concept Detection in Large Video Collections
    Shen, Jialie
    Tao, Dacheng
    Li, Xuelong
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 635 - +
  • [24] VIDEO SEMANTIC CONCEPT DETECTION VIA ASSOCIATIVE CLASSIFICATION
    Lin, Lin
    Shyu, Mei-Ling
    Ravitz, Guy
    Chen, Shu-Ching
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 418 - +
  • [25] Improving Automatic Video Retrieval with Semantic Concept Detection
    Koskela, Markus
    Sjoberg, Mats
    Laaksonen, Jorma
    IMAGE ANALYSIS, PROCEEDINGS, 2009, 5575 : 480 - 489
  • [26] AHP: A new strategy for the semantic concept detection in video
    Ding, Dayong
    Zhang, Bo
    Wu, Jinglan
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1974 - +
  • [27] Two-Layer Architecture for Signature-Based Attacks Detection over Encrypted Network Traffic
    Tahmi, Omar
    Talhi, Chamseddine
    Challal, Yacine
    FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2022, 2023, 13877 : 423 - 440
  • [28] Two-layer modeling for local area networks.
    Murata, Masayuki
    Takagi, Hideaki
    IEEE Transactions on Communications, 1988, 36 (09): : 1022 - 1034
  • [29] Home video structuring with a two-layer shot clustering approach
    Mang, Yu-Jin
    Jiang, Fan
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 500 - 504
  • [30] Two-layer hierarchical coding for MPEG-2 video
    Garzelli, A
    ELECTRONICS LETTERS, 2000, 36 (20) : 1696 - 1697