Leveraging textual information for social media news categorization and sentiment analysis

被引:0
|
作者
Hasan, Mahmudul [1 ]
Ahmed, Tanver [2 ]
Islam, Md. Rashedul [1 ]
Uddin, Md. Palash [1 ]
机构
[1] Hajee Mohammad Danesh Sci & Technol Univ, Dept Comp Sci & Engn, Dinajpur, Bangladesh
[2] Varendra Univ, Dept Comp Sci & Engn, Rajshahi, Bangladesh
来源
PLOS ONE | 2024年 / 19卷 / 07期
关键词
ALGORITHM;
D O I
10.1371/journal.pone.0307027
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The rise of social media has changed how people view connections. Machine Learning (ML)-based sentiment analysis and news categorization help understand emotions and access news. However, most studies focus on complex models requiring heavy resources and slowing inference times, making deployment difficult in resource-limited environments. In this paper, we process both structured and unstructured data, determining the polarity of text using the TextBlob scheme to determine the sentiment of news headlines. We propose a Stochastic Gradient Descent (SGD)-based Ridge classifier (RC) for blending SGDR with an advanced string processing technique to effectively classify news articles. Additionally, we explore existing supervised and unsupervised ML algorithms to gauge the effectiveness of our SGDR classifier. The scalability and generalization capability of SGD and L2 regularization techniques in RCs to handle overfitting and balance bias and variance provide the proposed SGDR with better classification capability. Experimental results highlight that our string processing pipeline significantly boosts the performance of all ML models. Notably, our ensemble SGDR classifier surpasses all state-of-the-art ML algorithms, achieving an impressive 98.12% accuracy. McNemar's significance tests reveal that our SGDR classifier achieves a 1% significance level improvement over K-Nearest Neighbor, Decision Tree, and AdaBoost and a 5% significance level improvement over other algorithms. These findings underscore the superior proficiency of linear models in news categorization compared to tree-based and nonlinear counterparts. This study contributes valuable insights into the efficacy of the proposed methodology, elucidating its potential for news categorization and sentiment analysis.
引用
收藏
页数:28
相关论文
共 50 条
  • [11] Integrating Visual and Textual Affective Descriptors for Sentiment Analysis of Social Media Posts
    Dai, Shuanglu
    Man, Hong
    [J]. IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, : 13 - 18
  • [12] Sensitivity to sentiment: News vs social media
    Gan, Baoqing
    Alexeev, Vitali
    Bird, Ron
    Yeung, Danny
    [J]. INTERNATIONAL REVIEW OF FINANCIAL ANALYSIS, 2020, 67
  • [13] Leveraging semantics for sentiment polarity detection in social media
    Dridi, Amna
    Recupero, Diego Reforgiato
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (08) : 2045 - 2055
  • [14] Leveraging semantics for sentiment polarity detection in social media
    Amna Dridi
    Diego Reforgiato Recupero
    [J]. International Journal of Machine Learning and Cybernetics, 2019, 10 : 2045 - 2055
  • [15] Combining Textual Cues with Social Clues: Utilizing Social Features to Improve Sentiment Analysis in Social Media
    Ilk, Noyan
    Fan, Shaokun
    [J]. DECISION SCIENCES, 2022, 53 (02) : 320 - 347
  • [16] Hybrid sentiment analysis with textual and interactive information
    Wen, Jiahui
    Huang, Anwen
    Zhong, Mingyang
    Ma, Jingwei
    Wei, Youcai
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [17] A comprehensive review of visual-textual sentiment analysis from social media networks
    Al-Tameemi, Israa Khalaf Salman
    Feizi-Derakhshi, Mohammad-Reza
    Pashazadeh, Saeed
    Asadpour, Mohammad
    [J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2024,
  • [18] Social Sensing and Sentiment Analysis: Using Social Media as Useful Information Source
    Ducange, Pietro
    Fazzolari, Michela
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON SMART SYSTEMS AND TECHNOLOGIES (SST), 2017, : 301 - 306
  • [19] Real estate media sentiment through textual analysis
    Ruscheinsky, Jessica Roxanne
    Lang, Marcel
    Schaefers, Wolfgang
    [J]. JOURNAL OF PROPERTY INVESTMENT & FINANCE, 2018, 36 (05) : 410 - 428
  • [20] Sentiment Analysis for Social Media
    Iglesias, Carlos A.
    Moreno, Antonio
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (23):