Machine learning-based new approach to films review

被引:3
|
作者
Jassim, Mustafa Abdalrassual [1 ,2 ,3 ]
Abd, Dhafar Hamed [4 ]
Omri, Mohamed Nazih [1 ]
机构
[1] Univ Sousse, MARS Res Lab, Sousse, Tunisia
[2] Univ Monastir, Monastir Fac Sci, Monastir, Tunisia
[3] Al Muthanna Univ, Samawah, Iraq
[4] Al Maaref Univ Coll, Dept Comp Sci, Alanbar, Iraq
关键词
Sentiment analysis; Movie review; Machine learning; Word selection; Decision-making; Text analysis; Data science; SENTIMENT ANALYSIS; FUZZY TOPSIS; SELECTION;
D O I
10.1007/s13278-023-01042-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main purpose of Sentiment Analysis (SA) is to derive useful insights from large amounts of unstructured data compiled from various sources. This analysis helps to interpret and classify textual data using different techniques applied in machine learning (ML) models. In this paper, we compared simple and ensemble ML methods as classifiers for SA: Random Forest, K-Nearest Neighbor, Artificial Neural Network, Gradient Boosting, Support Vector Machine (SVM), AdaBoost, Extreme Gradient Boosting, Decision Tree, Light GBM, Stochastic Gradient Descent and Bagging. For this, we considered a test set database of 50,000 movie reviews, of which 25,000 were rated positive and 25,000 negatives. We have chosen 20,000 words that have an impact on the feelings of the documents. This work aims to propose a new rating prediction approach based on a textual customer review. We consider term frequency characteristics and term frequency-inverse document frequency from the large-scale and serial trials to compare the results obtained by various classifiers using feature extraction techniques. For the decision phase, we applied the Fuzzy Decision by Opinion Score Method, one of the most recent methods for multi-criteria decision-making. To evaluate and quantify the performance of the different ML methods we considered, we apply six standard measures namely precision, accuracy, recall, F-score, AUC, and Kappa-measure. The results we obtained, at the end of the experimental work that we conducted, indicated that the SVM classier is the best with 88,333% as a precision rate followed by the FDOSM method, with 0.800 for the same measurement.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Machine learning-based new approach to films review
    Mustafa Abdalrassual Jassim
    Dhafar Hamed Abd
    Mohamed Nazih Omri
    [J]. Social Network Analysis and Mining, 13
  • [2] Inspection by exception: A new machine learning-based approach for multistage manufacturing
    Papananias, Moschos
    McLeay, Thomas E.
    Obajemu, Olusayo
    Mahfouf, Mahdi
    Kadirkamanathan, Visakan
    [J]. APPLIED SOFT COMPUTING, 2020, 97
  • [3] Machine learning-based approach to GPS antijamming
    Wang, Cheng-Zhen
    Kong, Ling-Wei
    Jiang, Junjie
    Lai, Ying-Cheng
    [J]. GPS SOLUTIONS, 2021, 25 (03)
  • [4] A Machine Learning-based Approach for Groundwater Mapping
    Zzaman, Rashed Uz
    Nowreen, Sara
    Khan, Irtesam Mahmud
    Islam, Md Rajibul
    Ibtehaz, Nabil
    Rahman, M. Saifur
    Zahid, Anwar
    Farzana, Dilruba
    Sharmin, Afroza
    Rahman, M. Sohel
    [J]. NATURAL RESOURCES RESEARCH, 2022, 31 (01) : 281 - 299
  • [5] A Machine Learning-based Approach for Groundwater Mapping
    Rashed Uz Zzaman
    Sara Nowreen
    Irtesam Mahmud Khan
    Md. Rajibul Islam
    Nabil Ibtehaz
    M. Saifur Rahman
    Anwar Zahid
    Dilruba Farzana
    Afroza Sharmin
    M. Sohel Rahman
    [J]. Natural Resources Research, 2022, 31 : 281 - 299
  • [6] Machine learning-based approach to GPS antijamming
    Cheng-Zhen Wang
    Ling-Wei Kong
    Junjie Jiang
    Ying-Cheng Lai
    [J]. GPS Solutions, 2021, 25
  • [7] New Results on Machine Learning-Based Distinguishers
    Baksi, Anubhab
    Breier, Jakub
    Dasu, Vishnu Asutosh
    Hou, Xiaolu
    Kim, Hyunji
    Seo, Hwajeong
    [J]. IEEE ACCESS, 2023, 11 : 54175 - 54187
  • [8] A New Machine Learning-Based Complementary Approach for Screening of NAFLD (Hepatic Steatosis)
    Panigrahi, Suranjan
    Deo, Ridhi
    Liechty, Edward A.
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2343 - 2346
  • [9] A New Approach for Machine Learning-Based Fault Detection and Classification in Power Systems
    Tokel, Mil Alper
    Al Halaseh, Rana
    Alirezaei, Gholamreza
    Mathar, Rudolf
    [J]. 2018 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE (ISGT), 2018,
  • [10] Machine Learning-Based Approach for Fake News Detection
    Gururaj H.L.
    Lakshmi H.
    Soundarya B.C.
    Flammini F.
    Janhavi V.
    [J]. Journal of ICT Standardization, 2022, 10 (04): : 509 - 530