Multimedia Event Detection Using A Classifier-Specific Intermediate Representation

被引:55
|
作者
Ma, Zhigang [1 ]
Yang, Yi [2 ]
Sebe, Nicu [1 ]
Zheng, Kai [3 ]
Hauptmann, Alexander G. [2 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[3] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
关键词
Intermediate representation; multimedia event detection; p-norm; VIDEO RETRIEVAL; PROJECTIONS; FEATURES;
D O I
10.1109/TMM.2013.2264928
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multimedia event detection (MED) plays an important role in many applications such as video indexing and retrieval. Current event detection works mainly focus on sports and news event detection or abnormality detection in surveillance videos. Differently, our research aims to detect more complicated and generic events within a longer video sequence. In the past, researchers have proposed using intermediate concept classifiers with concept lexica to help understand the videos. Yet it is difficult to judge how many and what concepts would be sufficient for the particular video analysis task. Additionally, obtaining robust semantic concept classifiers requires a large number of positive training examples, which in turn has high human annotation cost. In this paper, we propose an approach that exploits the external concepts-based videos and event-based videos simultaneously to learn an intermediate representation from video features. Our algorithm integrates the classifier inference and latent intermediate representation into a joint framework. The joint optimization of the intermediate representation and the classifier makes them mutually beneficial and reciprocal. Effectively, the intermediate representation and the classifier are tightly correlated. The classifier dependent intermediate representation not only accurately reflects the task semantics but is also more suitable for the specific classifier. Thus we have created a discriminative semantic analysis framework based on a tightly coupled intermediate representation. Extensive experiments on multimedia event detection using real-world videos demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:1628 / 1637
页数:10
相关论文
共 50 条
  • [1] Comparison of classifier-specific feature selection algorithms
    Kudo, M
    Somol, P
    Pudil, P
    Shimbo, M
    Sklansky, J
    [J]. ADVANCES IN PATTERN RECOGNITION, 2000, 1876 : 677 - 686
  • [2] PAFS - An Efficient Method for Classifier-Specific Feature Selection
    Pham Quang Huy
    Ngom, Alioune
    Rueda, Luis
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [3] An evaluation of classifier-specific filter measure performance for feature selection
    Freeman, Cecille
    Kulic, Dana
    Basir, Otman
    [J]. PATTERN RECOGNITION, 2015, 48 (05) : 1812 - 1826
  • [4] Combining multi-representation for multimedia event detection using co-training
    Bin, Yi
    Yang, Yang
    Shen, Fumin
    Xu, Xing
    [J]. NEUROCOMPUTING, 2016, 217 : 11 - 18
  • [5] Bi-Level Semantic Representation Analysis for Multimedia Event Detection
    Chang, Xiaojun
    Ma, Zhigang
    Yang, Yi
    Zeng, Zhiqiang
    Hauptmann, Alexander G.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (05) : 1180 - 1197
  • [6] Intermediate representation for vision and multimedia applications
    Yan, Yan
    Han, Yahong
    Radeva, Petia
    Tian, Qi
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 44 : 227 - 228
  • [7] Multimedia Event Detection using Visual Concept Signatures
    Younessian, Ehsan
    Quinn, Michael
    Mitamura, Teruko
    Hauptmann, Alex
    [J]. MULTIMEDIA CONTENT AND MOBILE DEVICES, 2013, 8667
  • [8] Multimedia classification and event detection using double fusion
    Zhen-zhong Lan
    Lei Bao
    Shoou-I Yu
    Wei Liu
    Alexander G. Hauptmann
    [J]. Multimedia Tools and Applications, 2014, 71 : 333 - 347
  • [9] Multimedia classification and event detection using double fusion
    Lan, Zhen-zhong
    Bao, Lei
    Yu, Shoou-I
    Liu, Wei
    Hauptmann, Alexander G.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 71 (01) : 333 - 347
  • [10] MULTIMEDIA EVENT DETECTION USING GMM SUPERVECTORS AND SVMS
    Kamishima, Yusuke
    Inoue, Nakamasa
    Shinoda, Koichi
    Sato, Shunsuke
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 3089 - 3092