Video Event Recognition Leveraging Hierarchy of Semantic Concepts

被引:0
|
作者
Soltanian, Mohammad [1 ,2 ]
Ghaemmaghami, Shahrokh [1 ,2 ]
机构
[1] Sharif Univ Technol, Elect Engn Dept, Tehran, Iran
[2] Sharif Univ Technol, Elect Res Inst, Tehran, Iran
关键词
Wordnet tree; convolutional neural network; Columbia Consumer Video dataset; average pooling; max pooling; mean average precision; FEATURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new method for exploiting the semantic hierarchical structure of visual concepts in video event recognition task is proposed in this paper. The visual concepts are detected using the readily available Convolutional Neural Network (CNN) structures which make the recognition system extremely efficient in cases with limited hardware resources. The employed CNNs assign scores to each of the predetermined visual concepts in each video frame and the resulting concept scores are fed to the proposed hierarchical post-processing scheme. Our post-processing module takes advantage of the semantic hierarchy of the concepts to enhance the recognition accuracy of event recognition. The hierarchical post-processing works based on the relative shortest distance of concepts specified in Wordnet concept tree and results in a tangible alleviation of uncertainty of the concept scores at the CNN output. The post-processed scores are then delivered to the fine-tuned support vector machine (SVM) classifier to discriminate between the visual event classes. The proposed scheme improves the event recognition accuracy in terms of mean Average Precision (mAP) as demonstrated by the experiments on Columbia Consumer Video (CCV) dataset.
引用
收藏
页码:1549 / 1553
页数:5
相关论文
共 50 条
  • [1] Event detection and recognition for semantic annotation of video
    Lamberto Ballan
    Marco Bertini
    Alberto Del Bimbo
    Lorenzo Seidenari
    Giuseppe Serra
    [J]. Multimedia Tools and Applications, 2011, 51 : 279 - 302
  • [2] Event detection and recognition for semantic annotation of video
    Ballan, Lamberto
    Bertini, Marco
    Del Bimbo, Alberto
    Seidenari, Lorenzo
    Serra, Giuseppe
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 51 (01) : 279 - 302
  • [3] Leveraging Weak Semantic Relevance for Complex Video Event Classification
    Li, Chao
    Cao, Jiewei
    Huang, Zi
    Zhu, Lei
    Shen, Heng Tao
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3667 - 3676
  • [4] Mapping query to semantic concepts: Leveraging semantic indices for automatic and interactive video retrieval
    Wang, Dong
    Wang, Zhikun
    Li, Xirong
    Liu, Xiaobing
    Li, Jianmin
    Zhang, Bo
    [J]. ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 313 - +
  • [5] Video Event Recognition with Fuzzy Semantic Petri Nets
    Szwed, Piotr
    [J]. MAN-MACHINE INTERACTIONS 3, 2014, 242 : 431 - 439
  • [6] Semantic Model Vectors for Complex Video Event Recognition
    Merler, Michele
    Huang, Bert
    Xie, Lexing
    Hua, Gang
    Natsev, Apostol
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) : 88 - 101
  • [7] Leveraging the Video-Level Semantic Consistency of Event for Audio-Visual Event Localization
    Jiang, Yuanyuan
    Yin, Jianqin
    Dang, Yonghao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4617 - 4627
  • [8] EXPLORING AUDIO SEMANTIC CONCEPTS FOR EVENT-BASED VIDEO RETRIEVAL
    Wang, Yipei
    Rawat, Shourabh
    Metze, Florian
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Semantic Event Fusion of Different Visual Modality Concepts for Activity Recognition
    Crispim-Junior, Carlos F.
    Buso, Vincent
    Avgerinakis, Konstantinos
    Meditskos, Georgios
    Briassouli, Alexia
    Benois-Pineau, Jenny
    Kompatsiaris, Ioannis
    Bremond, Francois
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (08) : 1598 - 1611
  • [10] Object Tracking and Video Event Recognition with Fuzzy Semantic Petri Nets
    Szwed, Piotr
    Komorkiewicz, Mateusz
    [J]. 2013 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2013, : 167 - 174