Multi-layer multi-view topic model for classifying advertising video

被引:32
|
作者
Hou, Sujuan [1 ,5 ]
Chen, Ling [2 ]
Tao, Dacheng [2 ]
Zhou, Shangbo [3 ]
Liu, Wenjie [4 ]
Zheng, Yuanjie [1 ,5 ,6 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Peoples R China
[2] Univ Technol Sydney, FEIT, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW, Australia
[3] Chongqing Univ, Coll Comp Sci, Chongqing 400030, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing, Jiangsu, Peoples R China
[5] Shandong Normal Univ, Inst Life Sci, Jinan 250014, Peoples R China
[6] Shandong Normal Univ, Key Lab Intelligent Informat Proc, Jinan 250014, Peoples R China
关键词
Video representation; Ad video classification; Multi-layer; Multi-view; Topic model; CLASSIFICATION;
D O I
10.1016/j.patcog.2017.03.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent proliferation of advertising (ad) videos has driven the research in multiple applications, ranging from video analysis to video indexing and retrieval. Among them, classifying ad video is a key task because it allows automatic organization of videos according to categories or genres, and this further enables ad video indexing and retrieval. However, classifying ad video is challenging compared to other types of video classification because of its unconstrained content. While many studies focus on embedding ads relevant to videos, to our knowledge, few focus on ad video classification. In order to classify ad video, this paper proposes a novel ad video representation that aims to sufficiently capture the latent semantics of video content from multiple views in an unsupervised manner. In particular, we represent ad videos from four views, including bag-of-feature (BOF), vector of locally aggregated descriptors (VLAD), fisher vector (FV) and object bank (OB). We then devise a multi-layer multi-view topic model, mlmv_LDA, which models the topics of videos from different views. A topical representation for video, supporting category-related task, is finally achieved by the proposed method. Our empirical classification results on 10,111 real-world ad videos demonstrate that the proposed approach effectively differentiate ad videos. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:66 / 81
页数:16
相关论文
共 50 条
  • [1] A multi-layer multi-view stacking model for credit risk assessment
    Han, Wenfang
    Gu, Xiao
    Jian, Ling
    [J]. INTELLIGENT DATA ANALYSIS, 2023, 27 (05) : 1457 - 1475
  • [2] A multi-layer multi-view architecture for software engineering environments
    Kempkens, R
    Rösch, P
    Scott, L
    Zettel, J
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2000, 42 (02) : 141 - 149
  • [3] Reciprocal Multi-Layer Subspace Learning for Multi-View Clustering
    Li, Ruihuang
    Zhang, Changqing
    Fu, Huazhu
    Peng, Xi
    Zhou, Tianyi
    Hu, Qinghua
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8171 - 8179
  • [4] MLRMV: Multi-layer representation for multi-view action recognition
    Liu, Zhigang
    Yin, Ziyang
    Wu, Yin
    [J]. IMAGE AND VISION COMPUTING, 2021, 116
  • [5] Cross-view multi-layer perceptron for incomplete multi-view learning
    Wang, Zhi
    Zhou, Heng
    Zhong, Ping
    Zou, Hui
    [J]. APPLIED SOFT COMPUTING, 2024, 157
  • [6] Multi-Layer Frame Rate Up-Conversion Using Trilateral Filtering for Multi-View Video
    Yang, Yoonmo
    Hong, Jin Hyung
    Oh, Byung Tae
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (02) : 283 - 292
  • [7] Multi-Layer Multi-View Classification for Alzheimer's Disease Diagnosis
    Zhang, Changqing
    Adeli, Ehsan
    Zhou, Tao
    Chen, Xiaobo
    Shen, Dinggang
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4406 - 4413
  • [8] Cross-view and multi-view gait recognitions based on view transformation model using multi-layer perceptron
    Kusakunniran, Worapan
    Wu, Qiang
    Zhang, Jian
    Li, Hongdong
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (07) : 882 - 889
  • [9] MULTI-VIEW METRIC LEARNING FOR MULTI-VIEW VIDEO SUMMARIZATION
    Wang, Linbo
    Fang, Xianyong
    Guo, Yanwen
    Fu, Yanwei
    [J]. 2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 179 - 182
  • [10] Interaction Style Recognition Based on Multi-Layer Multi-View Profile Representation
    Wei, Wen-Li
    Lin, Jen-Chun
    Wu, Chung-Hsien
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (03) : 355 - 368