Multi-label movie genre classification based on multimodal fusion

被引:0
|
作者
Zihui Cai
Hongwei Ding
Jinlu Wu
Ying Xi
Xuemeng Wu
Xiaohui Cui
机构
[1] Wuhan University,Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education, School of Cyber Science and Engineering
来源
关键词
Multi-label; Movie genre classification; Multimodal fusion; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)w. It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.
引用
收藏
页码:36823 / 36840
页数:17
相关论文
共 50 条
  • [11] Web Genre Classification via Hierarchical Multi-label Classification
    Madjarov, Gjorgji
    Vidulin, Vedrana
    Dimitrovski, Ivica
    Kocev, Dragi
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 9 - 17
  • [12] Multi-Label Text Classification Based on Label Combination and Fusion of Attentions
    Wu, Xinke
    Sun, Jun
    Li, Zhihua
    Computer Engineering and Applications, 2023, 59 (06) : 125 - 133
  • [13] MULTIMODAL LEARNING FOR MULTI-LABEL IMAGE CLASSIFICATION
    Pang, Yanwei
    Ma, Zhao
    Yuan, Yuan
    Li, Xuelong
    Wang, Kongqiao
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1797 - 1800
  • [14] DEEP MULTIMODAL NETWORK FOR MULTI-LABEL CLASSIFICATION
    Chen, Tanfang
    Wang, Shangfei
    Chen, Shiyu
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 955 - 960
  • [15] A Multi-label and Adaptive Genre Classification of Web Pages
    Jebari, Chaker
    Wani, M. Arif
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 578 - 581
  • [16] Multimodal PLSA for Movie Genre Classification
    Hong, Hao-Zhi
    Hwang, Jen-Ing G.
    MULTIPLE CLASSIFIER SYSTEMS (MCS 2015), 2015, 9132 : 159 - 167
  • [17] Multi-label Movie Genre Detection from a Movie Poster Using Knowledge Transfer Learning
    Kaushil Kundalia
    Yash Patel
    Manan Shah
    Augmented Human Research, 2020, 5 (1)
  • [18] A Combination based on OWA Operators for Multi-label Genre Classification of web pages
    Jebari, Chaker
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2015, (54): : 13 - 20
  • [19] Multi-label Classification of Legal Text with Fusion of Label Relations
    Song Z.
    Li Y.
    Li D.
    Wang S.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (02): : 185 - 192
  • [20] A Multi-Label Text Classification Method Based on Labels Vector Fusion
    Tao, Yang
    Cui, Zhu
    Zhu Wenjun
    2018 INTERNATIONAL CONFERENCE ON PROMISING ELECTRONIC TECHNOLOGIES (ICPET 2018), 2018, : 80 - 85