Movie genre classification: A multi-label approach based on convolutions through time

被引:54
|
作者
Wehrmann, Jonatas [1 ]
Barros, Rodrigo C. [1 ]
机构
[1] Pontificia Univ Catolica Rio Grande do Sul, Machine Intelligence & Robot Res Grp, Av Ipiranga 6681, BR-90619900 Porto Alegre, RS, Brazil
关键词
Movie genre classification; Convolutional neural networks; Convolutions through time; Multi-label classification; FEATURES;
D O I
10.1016/j.asoc.2017.08.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of labeling movies according to their corresponding genre is a challenging classification problem, having in mind that genre is an immaterial feature that cannot be directly pinpointed in any of the movie frames. Hence, off-the-shelf image classification approaches are not capable of handling this task in a straightforward fashion. Moreover, movies may belong to multiple genres at the same time, making movie genre assignment a typical multi-label classification problem, which is per se much more challenging than standard single-label classification. In this paper, we propose a novel deep neural architecture based on convolutional neural networks(ConvNets) for performing multi-label movie-trailer genre classification. It encapsulates an ultra-deep ConvNet with residual connections, and it makes use of a special convolutional layer to extract temporal information from image-based features prior to performing the mapping of movie trailers to genres. We compare the proposed approach with the current state-of-theart methods for movie classification that employ well-known image descriptors and other low-level handcrafted features. Results show that our method substantially outperforms the state-of-the-art for this task, improving classification performance for all movie genres. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:973 / 982
页数:10
相关论文
共 50 条
  • [1] A multimodal approach for multi-label movie genre classification
    Rafael B. Mangolin
    Rodolfo M. Pereira
    Alceu S. Britto
    Carlos N. Silla
    Valéria D. Feltrim
    Diego Bertolini
    Yandre M. G. Costa
    Multimedia Tools and Applications, 2022, 81 : 19071 - 19096
  • [2] A multimodal approach for multi-label movie genre classification
    Mangolin, Rafael B.
    Pereira, Rodolfo M.
    Britto, Alceu S., Jr.
    Silla, Carlos N., Jr.
    Feltrim, Valeria D.
    Bertolini, Diego
    Costa, Yandre M. G.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 19071 - 19096
  • [3] Multi-label movie genre classification based on multimodal fusion
    Cai, Zihui
    Ding, Hongwei
    Wu, Jinlu
    Xi, Ying
    Wu, Xuemeng
    Cui, Xiaohui
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36823 - 36840
  • [4] Multi-label movie genre classification based on multimodal fusion
    Zihui Cai
    Hongwei Ding
    Jinlu Wu
    Ying Xi
    Xuemeng Wu
    Xiaohui Cui
    Multimedia Tools and Applications, 2024, 83 : 36823 - 36840
  • [5] A multi-label movie genre classification scheme based on the movie’s subtitles
    Nikhil Kumar Rajput
    Bhavya Ahuja Grover
    Multimedia Tools and Applications, 2022, 81 : 32469 - 32490
  • [6] A multi-label movie genre classification scheme based on the movie's subtitles
    Rajput, Nikhil Kumar
    Grover, Bhavya Ahuja
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 32469 - 32490
  • [7] Evaluating multimodal strategies for multi-label movie genre classification
    Paulino, Marco Aurelio D.
    Costa, Yandre M. G.
    Feltrim, Valeria D.
    2022 29TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2022,
  • [8] A Turkish Topic Modeling Dataset For Multi-label Classification of Movie Genre
    Jabrayilzade, Elgun
    Arslan, Algin Poyraz
    Para, Hasan
    Polatbilek, Ozan
    Sezerer, Erhan
    Tekir, Selma
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [9] Video Representation Fusion Network For Multi-Label Movie Genre Classification
    Bi, Tianyu
    Jarnikov, Dmitri
    Lukkien, Johan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9386 - 9391
  • [10] Enhancing Multi-Label Music Genre Classification Through Ensemble Techniques
    Sanden, Chris
    Zhang, John Z.
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 705 - 714