Scene Recognition in Short Video with Multi-Resolution CNNs

被引:0
|
作者
Dong, Xu [1 ]
Tan, Li [1 ]
Zhou, Lina [1 ]
Song, Yanyan [1 ]
机构
[1] Beijing Technol & Business Univ, Sch Comp & Informat Engn, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
scene recognition; deep learning; deep fusion network;
D O I
10.1109/icaibd.2019.8837029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the problems of scene recognition in short videos, this paper proposes a deep fusion network based on VGGNet. Firstly, VGGNet16 is used to learn global features, and VGGNet19 is used to learn images in details. After that the learning features are fused by means of weighted averaging; In the public dataset 2017-AI-Challenger-scene-classification, the result of top3 is 92.2% and the top3 of the Charades short video dataset has achieved 78.9%, which proves that the proposed method has a good performance in scene recognition.
引用
收藏
页码:419 / 422
页数:4
相关论文
共 50 条
  • [1] Encoding Multi-resolution Two-Stream CNNs for Action Recognition
    Xue, Weichen
    Zhao, Haohua
    Zhang, Liqing
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III, 2016, 9949 : 564 - 571
  • [2] Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs
    Wang, Limin
    Guo, Sheng
    Huang, Weilin
    Xiong, Yuanjun
    Qiao, Yu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (04) : 2055 - 2068
  • [3] Blurring Scene Recognition in Short Video
    Tan, Li
    Song, Yanyan
    Dong, Xu
    Zhou, Lina
    [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 75 - 78
  • [4] A design framework for multi-resolution video servers
    Cho, J
    Sung, MY
    Shin, H
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2003, 20 (03) : 237 - 262
  • [5] Replica striping for multi-resolution video servers
    Song, M
    Shin, H
    [J]. PROTOCOLS AND SYSTEMS FOR INTERACTIVE DISTRIBUTED MULTIMEDIA, PROCEEDINGS, 2002, 2515 : 300 - 312
  • [6] Temporal multi-resolution analysis for video segmentation
    Lin, Y
    Kankanhalli, MS
    Chua, TS
    [J]. STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2000, 2000, 3972 : 494 - 505
  • [7] A Design Framework for Multi-Resolution Video Servers
    Jinsung Cho
    Minyoung Sung
    Heonshik Shin
    [J]. Multimedia Tools and Applications, 2003, 20 : 237 - 262
  • [8] Capturing and presenting shared multi-resolution video
    Kimber, D
    Qiong, L
    Foote, J
    Wilcox, L
    [J]. INTERNET MULTIMEDIA MANAGEMENT SYSTEMS III, 2002, 4862 : 261 - 271
  • [9] Iris recognition basing on multi-resolution analysis
    Pan, L. L.
    Xie, M.
    [J]. 2006 1ST IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-3, 2006, : 1236 - +
  • [10] Face recognition using multi-resolution transform
    Arivazhagan, S.
    Mumtaj, J.
    Ganesan, L.
    [J]. ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 301 - +