Multi-label movie genre classification based on multimodal fusion

被引:0
|
作者
Zihui Cai
Hongwei Ding
Jinlu Wu
Ying Xi
Xuemeng Wu
Xiaohui Cui
机构
[1] Wuhan University,Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education, School of Cyber Science and Engineering
来源
关键词
Multi-label; Movie genre classification; Multimodal fusion; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)w. It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.
引用
收藏
页码:36823 / 36840
页数:17
相关论文
共 50 条
  • [41] A Multi-Instance Multi-Label Scene Classification Method based on Multi-Kernel Fusion
    Chen Tong-tong
    Liu Chan-juan
    Zou Hai-lin
    Zhou Shu-sen
    Liu Ying
    Ding Xin-miao
    2015 SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2015, : 782 - 787
  • [42] Label Relevance Based Multi-Label Scratch Classification Algorithm
    Peng C.
    Sun Y.
    Qi P.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (06): : 134 - 141
  • [43] Gradient-Based Label Binning in Multi-label Classification
    Rapp, Michael
    Mencia, Eneldo Loza
    Furnkranz, Johannes
    Hullermeier, Eyke
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 462 - 477
  • [44] Multi-Label Text Classification Based on DistilBERT and Label Correlation
    Wang, Xuyang
    Geng, Liuqing
    Zhang, Xin
    Computer Engineering and Applications, 2024, 60 (23) : 168 - 175
  • [45] Optimal Fusion Rules for Multi-label Fusion of Independent Classification System Families
    Fitch, James A.
    Oxley, Mark E.
    Kabban, Christine M. Schubert
    SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXIV, 2015, 9474
  • [46] Unconstrained Multimodal Multi-Label Learning
    Huang, Yan
    Wang, Wei
    Wang, Liang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 1923 - 1935
  • [47] DBMF-Net: A Dual-Branch Multimodal Fusion Network for Multi-label Sewer Defect Classification
    Chen, Ziyang
    Wan, Lin
    PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 437 - 451
  • [48] Research on Micro-video Multi-Label Classification Based on Deep Multimodal Association Learning
    Li, Yun
    Lu, Zhixiang
    Liu, Shuyi
    Wang, Su
    Lü, Zimin
    Jing, Peiguang
    Data Analysis and Knowledge Discovery, 2024, 8 (07) : 77 - 88
  • [49] MLCE: A Multi-Label Crotch Ensemble Method for Multi-Label Classification
    Yao, Yuan
    Li, Yan
    Ye, Yunming
    Li, Xutao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (04)
  • [50] A multi-stage multi-modal learning algorithm with adaptive multimodal fusion for improving multi-label skin lesion classification
    Zuo, Lihan
    Wang, Zizhou
    Wang, Yan
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2025, 162