Low-Complexity Video Classification using Recurrent Neural Networks

被引:0
|
作者
Abramovich, Ifat [1 ]
Ben-Yehuda, Tomer [1 ]
Cohen, Rami [1 ]
机构
[1] Technion Israel Inst Technol, Andrew & Erna Viterbi Fac Elect Engn, SIPL, IL-3200003 Haifa, Israel
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning has led to great successes in computer vision tasks such as image classification. This is mostly attributed to the availability of large image datasets such as ImageNet. However, the progress in video classification has been slower, especially due to the small size of available video datasets and larger computational and memory demands. To promote innovation and advancement in this field, Google announced the YouTube-8M dataset in 2016, which is a public video dataset containing about 8-million tagged videos. In this paper, we train several deep neural networks for video classification on a subset of YouTube-8M. Our approach is based on extracting frame level features using the Inception-v3 network, which are later used by recurrent neural networks with LSTM/BiLSTM units for video classification. We focus on network architectures with low computational requirements and present a detailed performance comparison. We show that for 5 categories, more than 96% of the videos are labeled correctly, where for 10 categories more than 89% of the videos are labeled correctly. We demonstrate that transfer learning leads to substantial saving in training time, while offering good results.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Classification of Bird Sound Using High-and Low-Complexity Convolutional Neural Networks
    Saad, Aymen
    Ahmed, Javed
    Elaraby, Ahmed
    [J]. TRAITEMENT DU SIGNAL, 2022, 39 (01) : 187 - 193
  • [2] Low-Complexity Deep Neural Networks for Image Object Classification and Detection
    Hsiao, Shen-Fu
    Zhan, Jing-Fu
    Lin, Chih-Chien
    [J]. 2019 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2019), 2019, : 313 - 316
  • [3] LOW-COMPLEXITY SCALER BASED ON CONVOLUTIONAL NEURAL NETWORKS FOR ADAPTIVE VIDEO STREAMING
    Kim, Jaehwan
    Kim, Dongkyu
    Park, Min Woo
    Lee, Chaeeun
    Park, Youngo
    Choi, Kwang Pyo
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 131 - 135
  • [4] Video Genre Classification using Convolutional Recurrent Neural Networks
    Lakshmi, K. Prasanna
    Solanki, Mihir
    Dara, Jyothi Swaroop
    Kompalli, Avinash Bhargav
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 170 - 176
  • [5] Low-Complexity Approximate Convolutional Neural Networks
    Cintra, Renato J.
    Duffner, Stefan
    Garcia, Christophe
    Leite, Andre
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) : 5981 - 5992
  • [6] High-performance low-complexity wordspotting using neural networks
    Chang, EI
    Lippmann, RP
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (11) : 2864 - 2870
  • [7] Low-complexity video compression for wireless sensor networks
    Magli, E
    Mancin, M
    Merello, L
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 585 - 588
  • [8] Compact Recurrent Neural Networks for Acoustic Event Detection on Low-Energy Low-Complexity Platforms
    Cerutti, Gianmarco
    Prasad, Rahul
    Brutti, Alessio
    Farella, Elisabetta
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (04) : 654 - 664
  • [9] A study of low-complexity tools for semantic classification of mobile video
    Mariappan, A
    Igarta, M
    Taskiran, C
    Gandhi, B
    Delp, EJ
    [J]. MULTIMEDIA ON MOBILE DEVICES II, 2006, 6074
  • [10] Low-Complexity Neural Networks for Baseband Signal Processing
    Larue, Guillaume
    Dhiflaoui, Mona
    Dufrene, Louis-Adrien
    Lampin, Quentin
    Chollet, Paul
    Ghauch, Hadi
    Rekaya, Ghaya
    [J]. 2020 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2020,