VideoToVecs: a new video representation based on deep learning techniques for video classification and clustering

被引:4
|
作者
Ibrahim, Zein Al Abidin [1 ]
Saab, Marwa [1 ]
Sbeity, Ihab [1 ]
机构
[1] Lebanese Univ, Fac Sci, Comp Sci Dept, Hadat Campus, Beirut, Lebanon
关键词
Video; Representation; Classification; Clustering; Similarity measure; Deep learning;
D O I
10.1007/s42452-019-0573-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the recent revolution in the field of multimedia technology, video data have become much easier and straight-forward to be created, stored and transferred on a huge scale with small costs. The big amount of created data pushed the research community to delve into various study areas to aid the huge proliferation of multimedia content such as video structuring, video classification and clustering, events and objects detection, video recommendation and many other video content analysis techniques. The key success of any analysis technique relies on the audiovisual features extracted from the video. Motivated by the appearance and efficiency of deep learning techniques, we propose in this paper a new deep-learning-based features representation of videos. We depend on image-based features extracted from the sequence of frames in the video using deep learning techniques. A mapping approach named VideoToVecs is then applied to transform the extracted features into a matrix in which each row contains features of the same type. This matrix is named deep features video matrix. The efficiency of the representation is tested on 5261-video dataset for classification and clustering, and the obtained results were very promising as we will see in the paper.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Video Deep Learning Classification for Autonomous Vehicle Navigation
    Salem, Fathi M.
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 1014 - 1017
  • [42] A novel infrared video surveillance system using deep learning based techniques
    Huaizhong Zhang
    Chunbo Luo
    Qi Wang
    Matthew Kitchin
    Andrew Parmley
    Jesus Monge-Alvarez
    Pablo Casaseca-de-la-Higuera
    [J]. Multimedia Tools and Applications, 2018, 77 : 26657 - 26676
  • [43] A novel infrared video surveillance system using deep learning based techniques
    Zhang, Huaizhong
    Luo, Chunbo
    Wang, Qi
    Kitchin, Matthew
    Parmley, Andrew
    Monge-Alvarez, Jesus
    Casaseca-de-la-Higuera, Pablo
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (20) : 26657 - 26676
  • [44] VIDEO CAPTIONING BASED ON JOINT IMAGE-AUDIO DEEP LEARNING TECHNIQUES
    Wang, Chien-Yao
    Liaw, Pei-Sin
    Liang, Kai-Wen
    Wang, Jai-Ching
    Chang, Pao-Chi
    [J]. 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE-BERLIN), 2019, : 127 - 131
  • [45] An Enhanced Deep Learning-Based DeepFake Video Detection and Classification System
    Awotunde, Joseph Bamidele
    Jimoh, Rasheed Gbenga
    Imoize, Agbotiname Lucky
    Abdulrazaq, Akeem Tayo
    Li, Chun-Ta
    Lee, Cheng-Chi
    [J]. ELECTRONICS, 2023, 12 (01)
  • [46] Big Data and Deep Learning-Based Video Classification Model for Sports
    Wang, Lin
    Zhang, Haiyan
    Yuan, Guoliang
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [47] Research on video classification method of key pollution sources based on deep learning
    Zhao, Kunrong
    He, Tingting
    Wu, Shuang
    Wang, Songling
    Dai, Bilan
    Yang, Qifan
    Lei, Yutao
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 283 - 291
  • [48] A Deep Learning Based Video Classification System Using Multimodality Correlation Approach
    Lee, Jungheon
    Koh, Youngsan
    Yang, Jihoon
    [J]. 2017 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2017, : 2021 - 2025
  • [49] Self-Supervised Video Representation Learning Using Improved Instance-Wise Contrastive Learning and Deep Clustering
    Zhu, Yisheng
    Shuai, Hui
    Liu, Guangcan
    Liu, Qingshan
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6741 - 6752
  • [50] Large Scale Video Representation Learning via Relational Graph Clustering
    Lee, Hyodong
    Lee, Joonseok
    Ng, Joe Yue-Hei
    Natsev, Paul
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6806 - 6815