Popularity prediction of movies: from statistical modeling to machine learning techniques

被引:0
|
作者
Syed Muhammad Raza Abidi
Yonglin Xu
Jianyue Ni
Xiangmeng Wang
Wu Zhang
机构
[1] Shanghai University,School of Computer Engineering and Science
[2] Shanghai University,Shanghai Institute of Applied Mathematics and Mechanics
来源
Multimedia Tools and Applications | 2020年 / 79卷
关键词
Movie popularity; Machine learning; Movie success; Regression; IMDb; Supervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Film industries all over the world are producing several hundred movies rapidly and grabbing the attraction of people of all ages. Every movie producer is of keen interest in knowing which movies are either likely to hit or flop in the box office. So, the early prediction of the popularity of a movie is of the utmost importance to the film industry. In this study, we examine factors inside the hidden patterns which become movie popular. In past studies, machine learning techniques were implemented on blog articles, social networking, and social media to predict the success of a movie. Their works focused on which algorithms are better at predicting the success of a movie but less focused on data and attributes related to an ongoing movie and in various directions. In this paper, we inspect this perspective that might be related to the prediction of the results. Data collected from the publicly available Internet Movie Database (IMDb). We implemented five machine learning algorithms, i.e., Generalized Linear Model (GLM), Deep Learning (DL), Decision Tree (DT), Random Forest (RF), and Gradient Boosted Tree (GBT) using Root Mean Squared Error (RMSE) as a performance metric and got the accuracy performances of GLM: 47.9%, DL: 51.1%, DT: 54.5%, RF: 50.0%, and GBT: 49.5%, respectively. We found that GLM is the high achieving accuracy regression classifier due to the lower value of RMSE, which is considered to be better.
引用
收藏
页码:35583 / 35617
页数:34
相关论文
共 50 条
  • [21] Machine learning versus statistical modeling
    Boulesteix, Anne-Laure
    Schmid, Matthias
    BIOMETRICAL JOURNAL, 2014, 56 (04) : 588 - 593
  • [22] Prediction of Wind Speed Using Real Data: An analysis of Statistical Machine Learning Techniques
    Ali, K. M. E.
    Hassan, M. Z.
    Ali, A. B. M. Shawkat
    Kumar, Jashnil
    2017 4TH ASIA-PACIFIC WORLD CONGRESS ON COMPUTER SCIENCE AND ENGINEERING (APWCONCSE 2017), 2017, : 259 - 264
  • [23] Lung Cancer Survival Prediction via Machine Learning Regression, Classification, and Statistical Techniques
    Bartholomai, James A.
    Frieboes, Hermann B.
    2018 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2018, : 632 - 637
  • [24] Cryptocurrency price prediction using traditional statistical and machine-learning techniques: A survey
    Khedr, Ahmed M.
    Arif, Ifra
    Raj, Pravija P., V
    El-Bannany, Magdi
    Alhashmi, Saadat M.
    Sreedharan, Meenu
    INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2021, 28 (01): : 3 - 34
  • [25] A Hybrid Model for the Prediction of Air Pollutants Concentration, Based on Statistical and Machine Learning Techniques
    Minutti-Martinez, Carlos
    Arellano-Vazquez, Magali
    Zamora-Machado, Marlene
    ADVANCES IN SOFT COMPUTING (MICAI 2021), PT II, 2021, 13068 : 252 - 264
  • [26] Analysis and prediction of erosion behavior of epoxy composites using statistical and machine learning techniques
    Mahapatra, Sourav Kumar
    Satapathy, Alok
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART E-JOURNAL OF PROCESS MECHANICAL ENGINEERING, 2024,
  • [27] Nonlinear modeling and machine learning techniques are needed for accurate prediction of contaminant sorption
    Mahdi, Z.
    Hanandeh, A. E.
    Pratt, C.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2025,
  • [28] House price prediction modeling using machine learning techniques: a comparative study
    Yagmur, Ayten
    Kayakus, Mehmet
    Terzioglu, Mustafa
    AESTIMUM, 2022, 81 : 39 - 51
  • [29] Machine Learning and Statistical Analysis Techniques on Terrorism
    Rajesh, P.
    Babitha, D.
    Alam, Mansoor
    Tahernezhadi, Mansour
    Monika, A.
    FUZZY SYSTEMS AND DATA MINING VI, 2020, 331 : 210 - 222
  • [30] Modeling the Steel Case Carburizing Quenching Process Using Statistical and Machine Learning Techniques
    Deshpande, Parijat D.
    Gupta, Ujjawal
    Gautham, B. P.
    Khan, Danish
    2014 9TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2014, : 664 - 669