Deep feature extraction and motion representation for satellite video scene classification

被引:0
|
作者
Yanfeng GU [1 ]
Huan LIU [1 ]
Tengfei WANG [1 ]
Shengyang LI [2 ,3 ]
Guoming GAO [1 ]
机构
[1] School of Electronics and Information Engineering, Harbin Institute of Technology
[2] Technology and Engineering Center for Space Utilization, Chinese Academy of Sciences
[3] The Key Laboratory of Space Utilization, Chinese Academy of Sciences
基金
中国国家自然科学基金;
关键词
satellite videos; classification; convolutional neural network; CNN; long and short term memory; LSTM; motion representation;
D O I
暂无
中图分类号
TP751 [图像处理方法];
学科分类号
081002 ;
摘要
Satellite video scene classification(SVSC) is an advanced topic in the remote sensing field, which refers to determine the video scene categories from satellite videos. SVSC is an important and fundamental step for satellite video analysis and understanding, which provides priors for the presence of objects and dynamic events. In this paper, a two-stage framework is proposed to extract spatial features and motion features for SVSC. More specifically, the first stage is designed to extract spatial features for satellite videos.Representative frames are firstly selected based on the blur detection and spatial activity of satellite videos.Then the fine-tuned visual geometry group network(VGG-Net) is transferred to extract spatial features based on spatial content. The second stage is designed to build motion representation for satellite videos.The motion representation of moving targets in satellite videos is first built by the second temporal principal component of principal component analysis(PCA). Second, features from the first fully connected layer of VGG-Net are used as high-level spatial representation for moving targets. Third, a small network of long and short term memory(LSTM) is further designed for encoding temporal information. Two-stage features respectively characterize spatial and temporal patterns of satellite scenes, which are finally fused for SVSC.A satellite video dataset is built for video scene classification, including 7209 video segments and covering 8 scene categories. These satellite videos are from Jilin-1 satellites and Urthecast. The experimental results show the efficiency of our proposed framework for SVSC.
引用
收藏
页码:97 / 111
页数:15
相关论文
共 50 条
  • [1] Deep feature extraction and motion representation for satellite video scene classification
    Gu, Yanfeng
    Liu, Huan
    Wang, Tengfei
    Li, Shengyang
    Gao, Guoming
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (04)
  • [2] Deep feature extraction and motion representation for satellite video scene classification
    Yanfeng Gu
    Huan Liu
    Tengfei Wang
    Shengyang Li
    Guoming Gao
    [J]. Science China Information Sciences, 2020, 63
  • [3] Unsupervised feature extraction for the representation and recognition of lip motion video
    Lee, Michelle Jeungeun
    Lee, Kyungsuk David
    Lee, Soo-Young
    [J]. COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, PT 3, PROCEEDINGS, 2006, 4115 : 741 - 746
  • [4] HIERARCHICAL DEEP FEATURE REPRESENTATION FOR HIGH-RESOLUTION SCENE CLASSIFICATION
    Bian, Xiaoyong
    Chen, Chunfang
    Deng, Chunhua
    Liu, Ruiyao
    Du, Qian
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 517 - 520
  • [5] A Deep Scene Representation for Aerial Scene Classification
    Zheng, Xiangtao
    Yuan, Yuan
    Lu, Xiaoqiang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (07): : 4799 - 4809
  • [6] Feature Extraction and Scene Classification for Remote Sensing Image Based on Sparse Representation
    Guo, Youliang
    Zhang, Junping
    Zhong, Shengwei
    [J]. ALGORITHMS, TECHNOLOGIES, AND APPLICATIONS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGERY XXV, 2019, 10986
  • [7] Deep Feature Extraction and Feature Fusion for Bi-Temporal Satellite Image Classification
    Asokan, Anju
    Anitha, J.
    Patrut, Bogdan
    Danciulescu, Dana
    Hemanth, D. Jude
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 66 (01): : 373 - 388
  • [8] Audio feature extraction & analysis for scene classification
    Liu, Z
    Huang, JC
    Wang, Y
    Chen, TH
    [J]. 1997 IEEE FIRST WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1997, : 343 - 348
  • [9] Deep Feature Embedding and Hierarchical Classification for Audio Scene Classification
    Pham, Lam
    McLoughlin, Ian
    Phan, Huy
    Palaniappan, R.
    Merlins, Alfred
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [10] Feature Extraction of Binaural Recordings for Acoustic Scene Classification
    Zielinski, Slawomir K.
    Lee, Hyunkook
    [J]. PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 585 - 588