Leveraging textual information for social media news categorization and sentiment analysis

被引:0
|
作者
Hasan, Mahmudul [1 ]
Ahmed, Tanver [2 ]
Islam, Md. Rashedul [1 ]
Uddin, Md. Palash [1 ]
机构
[1] Hajee Mohammad Danesh Sci & Technol Univ, Dept Comp Sci & Engn, Dinajpur, Bangladesh
[2] Varendra Univ, Dept Comp Sci & Engn, Rajshahi, Bangladesh
来源
PLOS ONE | 2024年 / 19卷 / 07期
关键词
ALGORITHM;
D O I
10.1371/journal.pone.0307027
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The rise of social media has changed how people view connections. Machine Learning (ML)-based sentiment analysis and news categorization help understand emotions and access news. However, most studies focus on complex models requiring heavy resources and slowing inference times, making deployment difficult in resource-limited environments. In this paper, we process both structured and unstructured data, determining the polarity of text using the TextBlob scheme to determine the sentiment of news headlines. We propose a Stochastic Gradient Descent (SGD)-based Ridge classifier (RC) for blending SGDR with an advanced string processing technique to effectively classify news articles. Additionally, we explore existing supervised and unsupervised ML algorithms to gauge the effectiveness of our SGDR classifier. The scalability and generalization capability of SGD and L2 regularization techniques in RCs to handle overfitting and balance bias and variance provide the proposed SGDR with better classification capability. Experimental results highlight that our string processing pipeline significantly boosts the performance of all ML models. Notably, our ensemble SGDR classifier surpasses all state-of-the-art ML algorithms, achieving an impressive 98.12% accuracy. McNemar's significance tests reveal that our SGDR classifier achieves a 1% significance level improvement over K-Nearest Neighbor, Decision Tree, and AdaBoost and a 5% significance level improvement over other algorithms. These findings underscore the superior proficiency of linear models in news categorization compared to tree-based and nonlinear counterparts. This study contributes valuable insights into the efficacy of the proposed methodology, elucidating its potential for news categorization and sentiment analysis.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Sentiment Analysis for Social Media Images
    Wang, Yilin
    Li, Baoxin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1584 - 1591
  • [42] Monitoring travel-related information on Social Media through sentiment analysis
    Gonzalez-Rodriguez, M. R.
    Martinez-Torres, M. R.
    Toral, S. L.
    [J]. 2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2014, : 636 - 641
  • [43] Developing Turkish Sentiment Lexicon for Sentiment Analysis Using Online News Media
    Saglam, Fatih
    Sever, Hayri
    Genc, Burkay
    [J]. 2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [44] Leveraging News Sentiment to Improve Microblog Sentiment Classification in the Financial Domain
    Daudert, Tobias
    Buitelaar, Paul
    Negi, Sapna
    [J]. ECONOMICS AND NATURAL LANGUAGE PROCESSING (ECONLP 2018), 2018, : 49 - 54
  • [45] Sentiment Analysis as a Service: A social media based sentiment analysis framework
    Ali, Kashif
    Dong, Hai
    Bouguettaya, Athman
    Erradi, Abdelkarim
    Hadjidj, Rachid
    [J]. 2017 IEEE 24TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2017), 2017, : 660 - 667
  • [46] News Media, Inflation, and Sentiment
    Macaulay, Alistair
    Song, Wenting
    [J]. AEA PAPERS AND PROCEEDINGS, 2023, 113 : 172 - 176
  • [47] Predicting Sentiment toward Transportation in Social Media using Visual and Textual Features
    Giancristofaro, Gabriel T.
    Panangadan, Anand
    [J]. 2016 IEEE 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2016, : 2113 - 2118
  • [48] Quantifying the effect of sentiment on information diffusion in social media
    Ferrara, Emilio
    Yang, Zeyao
    [J]. PEERJ COMPUTER SCIENCE, 2015,
  • [49] Digital Luxury Fashion Shows: Leveraging Interactive Marketing Opportunities Through Social Media Sentiment Analysis
    Farah, Maya F.
    Ramadan, Zahy
    Sammouri, Wissam
    Tawk, Patricia
    [J]. ADVANCES IN DIGITAL MARKETING AND ECOMMERCE, DMEC 2024, 2024, : 23 - 30
  • [50] Correction to: Fake news detection in social media based on sentiment analysis using classifier techniques
    Sarita V. Balshetwar
    Abilash RS
    Dani Jermisha R
    [J]. Multimedia Tools and Applications, 2023, 82 : 35813 - 35813