A generic approach to semantic video indexing using adaptive fusion of multimodal classifiers

被引:4
|
作者
Kim, Dae-Jin [1 ]
Frigui, Hichem [1 ]
Fadeev, Aleksey [1 ]
机构
[1] Univ Louisville, Multimedia Res Lab, CECS Dept, Louisville, KY 40292 USA
基金
美国国家科学基金会;
关键词
video summary; semantic indexing; algorithm fusion; low-level descriptors;
D O I
10.1002/ima.20147
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a novel method for fusing the results of multiple semantic video indexing algorithms that use different types of feature descriptors and different classification methods. This method, called Context-Dependent Fusion (CDF), is motivated by the fact that the relative performance of different semantic indexing methods can vary significantly depending on the video type, context information, and the high-level concept of the video segment to be labeled. The training part of CDF has two main components: context extraction and algorithm fusion. In context extraction, the low-level multimodal descriptors extracted by the different classification algorithms are combined and used to partition the feature space into clusters of similar video shots, or contexts. The algorithm fusion component assigns aggregation weights to the individual classifiers within each context based on their relative performance in that context. Results on the TRECVID-2002 data collections show that the proposed method can identify meaningful and coherent clusters and that the performance of the different labeling algorithms can vary significantly across different clusters. Our initial experiments have indicated that the context-dependent fusion outperforms the individual algorithms and the global fusion of those algorithms. We also show that using standard multimodal descriptors and a simple k-NN classifier, the CDF approach provides results that are comparable to the state-of-the-art methods in semantic indexing. (c) 2008 Wiley Periodicals, Inc.
引用
收藏
页码:124 / 136
页数:13
相关论文
共 50 条
  • [1] A Generic Approach for Video Indexing
    Gayathri, N.
    Mahesh, K.
    [J]. PROCEEDING OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS, BIG DATA AND IOT (ICCBI-2018), 2020, 31 : 701 - 708
  • [2] COMMUNITY-DRIVEN HIERARCHICAL FUSION OF NUMEROUS CLASSIFIERS: APPLICATION TO VIDEO SEMANTIC INDEXING
    Bredin, Herve
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2329 - 2332
  • [3] The semantic pathfinder for generic news video indexing
    Snoek, C. G. M.
    Worring, M.
    Geusebroek, J. M.
    Koelma, D. C.
    Seinstra, F. J.
    Smeulders, A. W. M.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1469 - +
  • [4] Combining hierarchical classifiers with video semantic indexing systems
    Zhou, WS
    Dao, SK
    [J]. ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 78 - 85
  • [5] Semantic video indexing using context-dependent fusion
    Kim, Dae-Jin
    Frigui, Hichem
    Fadeev, Aleksey
    [J]. MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS II, 2008, 6820
  • [6] Multimodal approach for summarizing and indexing news video
    Kim, JG
    Chang, HS
    Kim, YT
    Kang, K
    Kim, M
    Kim, J
    Kim, HM
    [J]. ETRI JOURNAL, 2002, 24 (01) : 1 - 11
  • [7] Multimodal Information Fusion for Semantic Video Analysis
    Gulen, Elvan
    Yilmaz, Turgay
    Yazici, Adnan
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2012, 3 (04): : 52 - 74
  • [8] Semantic indexing of news video sequences: A multimodal hierarchical approach based on hidden Markov model
    Kolekar, M. H.
    Sengupta, S.
    [J]. TENCON 2005 - 2005 IEEE REGION 10 CONFERENCE, VOLS 1-5, 2006, : 2647 - 2652
  • [9] Semantic Browsing of Video Surveillance Databases through Online Generic Indexing
    Marraud, Denis
    Cepas, Benjamin
    Reithler, Livier
    [J]. 2009 THIRD ACM/IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS, 2009, : 342 - 349
  • [10] Classifier fusion: Combination methods for semantic indexing in video content
    Benmokhtar, Rachid
    Huet, Benoit
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 2, 2006, 4132 : 65 - 74