Video Event Recognition Leveraging Hierarchy of Semantic Concepts

被引:0
|
作者
Soltanian, Mohammad [1 ,2 ]
Ghaemmaghami, Shahrokh [1 ,2 ]
机构
[1] Sharif Univ Technol, Elect Engn Dept, Tehran, Iran
[2] Sharif Univ Technol, Elect Res Inst, Tehran, Iran
关键词
Wordnet tree; convolutional neural network; Columbia Consumer Video dataset; average pooling; max pooling; mean average precision; FEATURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new method for exploiting the semantic hierarchical structure of visual concepts in video event recognition task is proposed in this paper. The visual concepts are detected using the readily available Convolutional Neural Network (CNN) structures which make the recognition system extremely efficient in cases with limited hardware resources. The employed CNNs assign scores to each of the predetermined visual concepts in each video frame and the resulting concept scores are fed to the proposed hierarchical post-processing scheme. Our post-processing module takes advantage of the semantic hierarchy of the concepts to enhance the recognition accuracy of event recognition. The hierarchical post-processing works based on the relative shortest distance of concepts specified in Wordnet concept tree and results in a tangible alleviation of uncertainty of the concept scores at the CNN output. The post-processed scores are then delivered to the fine-tuned support vector machine (SVM) classifier to discriminate between the visual event classes. The proposed scheme improves the event recognition accuracy in terms of mean Average Precision (mAP) as demonstrated by the experiments on Columbia Consumer Video (CCV) dataset.
引用
下载
收藏
页码:1549 / 1553
页数:5
相关论文
共 50 条
  • [21] Inferring semantic concepts for video indexing and retrieval
    Naphade, MR
    Huang, TS
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 766 - 769
  • [22] Enhancing Video Event Recognition Using Automatically Constructed Semantic-Visual Knowledge Base
    Zhang, Xishan
    Yang, Yang
    Zhang, Yongdong
    Luan, Huanbo
    Li, Jintao
    Zhang, Hanwang
    Chua, Tat-Seng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (09) : 1562 - 1575
  • [23] Learning Rules for Semantic Video Event Annotation
    Bertini, Marco
    Del Bimbo, Alberto
    Serra, Giuseppe
    VISUAL INFORMATION SYSTEMS: WEB-BASED VISUAL INFORMATION SEARCH AND MANAGEMENT, VISUAL 2008, 2008, 5188 : 192 - 203
  • [24] Semantic Feature Mining for Video Event Understanding
    Yang, Xiaoshan
    Zhang, Tianzhu
    Xu, Changsheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2016, 12 (04)
  • [25] Semantic trajectory based video event detection
    Wang X.-F.
    Zhang D.-P.
    Wang F.
    Shi Z.-Z.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (10): : 1845 - 1858
  • [26] Suspicious Network Event Recognition Leveraging on Machine Learning
    Sartiano, Daniele
    Attardi, Giuseppe
    Deri, Luca
    Martinelli, Maurizio
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5915 - 5920
  • [27] Video Segmentation Descriptors for Event Recognition
    Trichet, Remi
    Nevatia, Ramakant
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1940 - 1945
  • [28] Event Recognition and Classification in Sports Video
    Ellappan, Vijayan
    Rajasekaran, Rajkumar
    2017 SECOND INTERNATIONAL CONFERENCE ON RECENT TRENDS AND CHALLENGES IN COMPUTATIONAL MODELS (ICRTCCM), 2017, : 182 - 187
  • [29] Machine learning for video event recognition
    Avola, Danilo
    Cascio, Marco
    Cinque, Luigi
    Foresti, Gian Luca
    Pannone, Daniele
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2021, 28 (03) : 309 - 332
  • [30] PERIODIC EVENT DETECTION AND RECOGNITION IN VIDEO
    Vivek, E. P.
    Pogalin, Erik
    Smeulders, Arnold W. M.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3537 - 3540