Evaluating multimedia features and fusion for example-based event detection

被引:0
|
作者
Gregory K. Myers
Ramesh Nallapati
Julien van Hout
Stephanie Pancoast
Ramakant Nevatia
Chen Sun
Amirhossein Habibian
Dennis C. Koelma
Koen E. A. van de Sande
Arnold W. M. Smeulders
Cees G. M. Snoek
机构
[1] SRI International (SRI),Institute for Robotics and Intelligent Systems
[2] University of Southern California (USC),undefined
[3] University of Amsterdam (UvA),undefined
[4] IBM Thomas J Watson Research Center,undefined
来源
关键词
Multimedia event detection; Video retrieval; Content extraction; Difference coding; Late fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Multimedia event detection (MED) is a challenging problem because of the heterogeneous content and variable quality found in large collections of Internet videos. To study the value of multimedia features and fusion for representing and learning events from a set of example video clips, we created SESAME, a system for video SEarch with Speed and Accuracy for Multimedia Events. SESAME includes multiple bag-of-words event classifiers based on single data types: low-level visual, motion, and audio features; high-level semantic visual concepts; and automatic speech recognition. Event detection performance was evaluated for each event classifier. The performance of low-level visual and motion features was improved by the use of difference coding. The accuracy of the visual concepts was nearly as strong as that of the low-level visual features. Experiments with a number of fusion methods for combining the event detection scores from these classifiers revealed that simple fusion methods, such as arithmetic mean, perform as well as or better than other, more complex fusion methods. SESAME’s performance in the 2012 TRECVID MED evaluation was one of the best reported.
引用
收藏
页码:17 / 32
页数:15
相关论文
共 50 条
  • [41] Example-Based Facial Rigging
    Li, Hao
    Weise, Thibaut
    Pauly, Mark
    ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
  • [42] EXAMPLE-BASED MOTION MANIPULATION
    Su, Pin-Ching
    Chen, Hwann-Tzong
    Cheng, Chia-Ming
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4647 - 4651
  • [43] Example-Based Damping Design
    Xu, Hongyi
    Barbic, Jernej
    ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
  • [44] Event Detection Based on Hierarchical Event Fusion
    Xiao, Xiaoling
    Zhang, Xiang
    2009 INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND INFORMATION APPLICATION TECHNOLOGY, VOL II, PROCEEDINGS, 2009, : 483 - +
  • [45] Example-Based Treebank Querying
    Augustinus, Liesbeth
    Vandeghinste, Vincent
    Van Eynde, Frank
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3161 - 3167
  • [46] Example-based curve synthesis
    Merrell, Paul
    Manocha, Dinesh
    COMPUTERS & GRAPHICS-UK, 2010, 34 (04): : 304 - 311
  • [47] Example-based motion cloning
    Park, MJ
    Shin, SY
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2004, 15 (3-4) : 245 - 257
  • [48] Example-Based Fractured Appearance
    Glondu, L.
    Muguercia, L.
    Marchal, M.
    Bosch, C.
    Rushmeier, H.
    Dumont, G.
    Drettakis, G.
    COMPUTER GRAPHICS FORUM, 2012, 31 (04) : 1547 - 1556
  • [49] Event Detection Model Based on the Fusion of Hierarchical Syntactic and Type Semantic Features
    Rao, Guozheng
    Cong, Qing
    Zhang, Li
    Tian, Kaijia
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [50] Audio based event detection for multimedia surveillance
    Atrey, Pradeep K.
    Maddage, Namunu C.
    Kankanhalli, Mohan S.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5671 - 5674