Multi-label movie genre classification based on multimodal fusion

被引：0

作者：

Zihui Cai

Hongwei Ding

Jinlu Wu

Ying Xi

Xuemeng Wu

Xiaohui Cui

机构：

[1] Wuhan University,Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education, School of Cyber Science and Engineering

来源：

Multimedia Tools and Applications | 2024年 / 83卷

关键词：

Multi-label; Movie genre classification; Multimodal fusion; Deep learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)w. It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.

引用

页码：36823 / 36840

页数：17

共 50 条

[41] A Multi-Instance Multi-Label Scene Classification Method based on Multi-Kernel Fusion
Chen Tong-tong
Liu Chan-juan
Zou Hai-lin
Zhou Shu-sen
Liu Ying
Ding Xin-miao
2015 SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2015, : 782 - 787
[42] Label Relevance Based Multi-Label Scratch Classification Algorithm
Peng C.
Sun Y.
Qi P.
Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (06): : 134 - 141
[43] Gradient-Based Label Binning in Multi-label Classification
Rapp, Michael
Mencia, Eneldo Loza
Furnkranz, Johannes
Hullermeier, Eyke
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 462 - 477
[44] Multi-Label Text Classification Based on DistilBERT and Label Correlation
Wang, Xuyang
Geng, Liuqing
Zhang, Xin
Computer Engineering and Applications, 2024, 60 (23) : 168 - 175
[45] Optimal Fusion Rules for Multi-label Fusion of Independent Classification System Families
Fitch, James A.
Oxley, Mark E.
Kabban, Christine M. Schubert
SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXIV, 2015, 9474
[46] Unconstrained Multimodal Multi-Label Learning
Huang, Yan
Wang, Wei
Wang, Liang
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 1923 - 1935
[47] DBMF-Net: A Dual-Branch Multimodal Fusion Network for Multi-label Sewer Defect Classification
Chen, Ziyang
Wan, Lin
PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 437 - 451
[48] Research on Micro-video Multi-Label Classification Based on Deep Multimodal Association Learning
Li, Yun
Lu, Zhixiang
Liu, Shuyi
Wang, Su
Lü, Zimin
Jing, Peiguang
Data Analysis and Knowledge Discovery, 2024, 8 (07) : 77 - 88
[49] MLCE: A Multi-Label Crotch Ensemble Method for Multi-Label Classification
Yao, Yuan
Li, Yan
Ye, Yunming
Li, Xutao
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (04)
[50] A multi-stage multi-modal learning algorithm with adaptive multimodal fusion for improving multi-label skin lesion classification
Zuo, Lihan
Wang, Zizhou
Wang, Yan
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2025, 162

← 1 2 3 4 5 →