Local Features and a Two-Layer Stacking Architecture for Semantic Concept Detection in Video

被引：8

作者：

Markatopoulou, Foteini ^{[1
,2
]}

Mezaris, Vasileios ^{[1
]}

Pittaras, Nikiforos ^{[1
]}

Patras, Ioannis ^{[2
,3
]}

机构：

[1] Inst Informat Technol, Ctr Res & Technol Hellas, Thermi 57001, Greece

[2] Queen Mary Univ London, London E1 4NS, England

[3] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING | 2015年 / 3卷 / 02期

关键词：

Content analysis and indexing; semantic concept detection; concept correlation; stacking; multi-label classification; video feature extraction; binary descriptors; semantic video annotation; FUSION;

D O I：

10.1109/TETC.2015.2418714

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we deal with the problem of extending and using different local descriptors, as well as exploiting concept correlations, toward improved video semantic concept detection. We examine how the state-of-the-art binary local descriptors can facilitate concept detection, we propose color extensions of them inspired by previously proposed color extensions of scale invariant feature transform, and we show that the latter color extension paradigm is generally applicable to both binary and nonbinary local descriptors. In order to use them in conjunction with a state-of-the-art feature encoding, we compact the above color extensions using PCA and we compare two alternatives for doing this. Concerning the learning stage of concept detection, we perform a comparative study and propose an improved way of employing stacked models, which capture concept correlations, using multilabel classification algorithms in the last layer of the stack. We examine and compare the effectiveness of the above algorithms in both semantic video indexing within a large video collection and in the somewhat different problem of individual video annotation with semantic concepts, on the extensive video data set of the 2013 TRECVID Semantic Indexing Task. Several conclusions are drawn from these experiments on how to improve the video semantic concept detection.

引用

页码：193 / 204

页数：12

共 50 条

[21] ATM multipeer communication using a two-layer architecture
Dresler, S
23RD ANNUAL CONFERENCE ON LOCAL COMPUTER NETWORKS - PROCEEDINGS, 1998, : 182 - 189
[22] Crash injury severity analysis using a two-layer Stacking framework
Tang, Jinjun
Liang, Jian
Han, Chunyang
Li, Zhibin
Huang, Helai
ACCIDENT ANALYSIS AND PREVENTION, 2019, 122 : 226 - 238
[23] Robust Semantic Concept Detection in Large Video Collections
Shen, Jialie
Tao, Dacheng
Li, Xuelong
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 635 - +
[24] VIDEO SEMANTIC CONCEPT DETECTION VIA ASSOCIATIVE CLASSIFICATION
Lin, Lin
Shyu, Mei-Ling
Ravitz, Guy
Chen, Shu-Ching
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 418 - +
[25] Improving Automatic Video Retrieval with Semantic Concept Detection
Koskela, Markus
Sjoberg, Mats
Laaksonen, Jorma
IMAGE ANALYSIS, PROCEEDINGS, 2009, 5575 : 480 - 489
[26] AHP: A new strategy for the semantic concept detection in video
Ding, Dayong
Zhang, Bo
Wu, Jinglan
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1974 - +
[27] Two-Layer Architecture for Signature-Based Attacks Detection over Encrypted Network Traffic
Tahmi, Omar
Talhi, Chamseddine
Challal, Yacine
FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2022, 2023, 13877 : 423 - 440
[28] Two-layer modeling for local area networks.
Murata, Masayuki
Takagi, Hideaki
IEEE Transactions on Communications, 1988, 36 (09): : 1022 - 1034
[29] Home video structuring with a two-layer shot clustering approach
Mang, Yu-Jin
Jiang, Fan
2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 500 - 504
[30] Two-layer hierarchical coding for MPEG-2 video
Garzelli, A
ELECTRONICS LETTERS, 2000, 36 (20) : 1696 - 1697

← 1 2 3 4 5 →