VideoToVecs: a new video representation based on deep learning techniques for video classification and clustering

被引:4
|
作者
Ibrahim, Zein Al Abidin [1 ]
Saab, Marwa [1 ]
Sbeity, Ihab [1 ]
机构
[1] Lebanese Univ, Fac Sci, Comp Sci Dept, Hadat Campus, Beirut, Lebanon
关键词
Video; Representation; Classification; Clustering; Similarity measure; Deep learning;
D O I
10.1007/s42452-019-0573-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the recent revolution in the field of multimedia technology, video data have become much easier and straight-forward to be created, stored and transferred on a huge scale with small costs. The big amount of created data pushed the research community to delve into various study areas to aid the huge proliferation of multimedia content such as video structuring, video classification and clustering, events and objects detection, video recommendation and many other video content analysis techniques. The key success of any analysis technique relies on the audiovisual features extracted from the video. Motivated by the appearance and efficiency of deep learning techniques, we propose in this paper a new deep-learning-based features representation of videos. We depend on image-based features extracted from the sequence of frames in the video using deep learning techniques. A mapping approach named VideoToVecs is then applied to transform the extracted features into a matrix in which each row contains features of the same type. This matrix is named deep features video matrix. The efficiency of the representation is tested on 5261-video dataset for classification and clustering, and the obtained results were very promising as we will see in the paper.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] VideoToVecs: a new video representation based on deep learning techniques for video classification and clustering
    Zein Al Abidin Ibrahim
    Marwa Saab
    Ihab Sbeity
    [J]. SN Applied Sciences, 2019, 1
  • [2] Multimodal deep representation learning for video classification
    Tian, Haiman
    Tao, Yudong
    Pouyanfar, Samira
    Chen, Shu-Ching
    Shyu, Mei-Ling
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (03): : 1325 - 1341
  • [3] Multimodal deep representation learning for video classification
    Haiman Tian
    Yudong Tao
    Samira Pouyanfar
    Shu-Ching Chen
    Mei-Ling Shyu
    [J]. World Wide Web, 2019, 22 : 1325 - 1341
  • [4] Deep Learning Based Video Event Classification
    Gencaslan, Serim
    Utku, Anil
    Akcayol, M. Ali
    [J]. JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2023, 26 (03): : 1155 - 1165
  • [5] Deep video representation learning: a survey
    Ravanbakhsh, Elham
    Liang, Yongqing
    Ramanujam, J.
    Li, Xin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (20) : 59195 - 59225
  • [6] Analysis of Watermarked Video Optimization and Training Based on Classification Using Deep Learning Techniques
    Muthulakshmi K.
    Valarmathi K.
    [J]. SN Computer Science, 5 (2)
  • [7] Deep Learning Based Sports Video Classification Research
    Li W.
    [J]. Appl. Math. Nonlinear Sci., 1
  • [8] A Deep Metric Learning Based Video Classification Method
    Zhi Hongxin
    Yu Hongtao
    Li Shaomei
    Gao Chao
    Wang Yanchuan
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (11) : 2562 - 2569
  • [9] Video Emotional Classification Based on Deep Reinforcement Learning
    Yuan, Tingting
    Yuan, Yuyu
    [J]. 2023 3RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE, ACCTCS, 2023, : 168 - 171
  • [10] Vehicle Representation and Classification of Surveillance Video Based on Sparse Learning
    Chen Xiangjun
    Ruan Yaduan
    Zhang Peng
    Chen Qimei
    Zhang Xinggan
    [J]. CHINA COMMUNICATIONS, 2014, 11 (01) : 135 - 141