Multimodal movie genre classification using recurrent neural network

被引:0
|
作者
Tina Behrouzi
Ramin Toosi
Mohammad Ali Akhaee
机构
[1] University of Toronto,Department of Electrical and Computer Engineering
[2] College of Engineering,School of Electrical and Computer Engineering
[3] University of Tehran,undefined
来源
关键词
Movie genre detection; Multi-label classification; Gated recurrent unit (GRU); Long short-term memory (LSTM); 1D Convolutional neural network (1D_Conv);
D O I
暂无
中图分类号
学科分类号
摘要
Genre is one of the features of a movie that defines its structure and type of audience. The number of streaming companies interested in automatically deriving movies’ genres is rapidly increasing. Genre categorization of trailers is a challenging problem because of the conceptual nature of the genre, which is not presented physically within a frame and can only be perceived by the whole trailer. Moreover, several genres may appear in the movie at the same time. The multi-label learning algorithms have not been improved as significantly as the single-label classification models, which causes the genre categorization problem to be highly complicated. In this paper, we propose a novel multi-modal deep recurrent model for movie genre classification. A new structure based on Gated Recurrent Unit (GRU) is designed to derive spatial-temporal features of movie frames. The video features are then concatenated with the audio features to predict the final genres of the movie. The proposed design outperforms the state-of-art models based on accuracy and computational cost and substantially improves the movie genre classifier system’s performance.
引用
收藏
页码:5763 / 5784
页数:21
相关论文
共 50 条
  • [1] Multimodal movie genre classification using recurrent neural network
    Behrouzi, Tina
    Toosi, Ramin
    Akhaee, Mohammad Ali
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) : 5763 - 5784
  • [2] Multimodal PLSA for Movie Genre Classification
    Hong, Hao-Zhi
    Hwang, Jen-Ing G.
    [J]. MULTIPLE CLASSIFIER SYSTEMS (MCS 2015), 2015, 9132 : 159 - 167
  • [3] Movie Poster Genre Classification with Convolutional Neural Network
    Marcellus
    Herwindiati, Dyah Erny
    Hendryli, Janson
    [J]. 2021 IEEE SEVENTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2021), 2021, : 74 - 77
  • [4] Music Genre Classification Using Independent Recurrent Neural Network
    Wu, Wenli
    Song, Guangxiao
    Wang, Zhijie
    Han, Fang
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 192 - 195
  • [5] Movie trailer genre classification using multimodal pretrained features
    Sulun, Serkan
    Viana, Paula
    Davies, Matthew E. P.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [6] Movie Genre Classification with Convolutional Neural Networks
    Simoes, Gabriel S.
    Wehrmann, Jonatas
    Barros, Rodrigo C.
    Ruiz, Duncan D.
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 259 - 266
  • [7] A multimodal approach for multi-label movie genre classification
    Rafael B. Mangolin
    Rodolfo M. Pereira
    Alceu S. Britto
    Carlos N. Silla
    Valéria D. Feltrim
    Diego Bertolini
    Yandre M. G. Costa
    [J]. Multimedia Tools and Applications, 2022, 81 : 19071 - 19096
  • [8] A multimodal approach for multi-label movie genre classification
    Mangolin, Rafael B.
    Pereira, Rodolfo M.
    Britto, Alceu S., Jr.
    Silla, Carlos N., Jr.
    Feltrim, Valeria D.
    Bertolini, Diego
    Costa, Yandre M. G.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 19071 - 19096
  • [9] Video Genre Classification using Convolutional Recurrent Neural Networks
    Lakshmi, K. Prasanna
    Solanki, Mihir
    Dara, Jyothi Swaroop
    Kompalli, Avinash Bhargav
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 170 - 176
  • [10] Evaluating multimodal strategies for multi-label movie genre classification
    Paulino, Marco Aurelio D.
    Costa, Yandre M. G.
    Feltrim, Valeria D.
    [J]. 2022 29TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2022,