Position-context additive transformer-based model for classifying text data on social media

被引:0
|
作者
M. M. Abd-Elaziz [1 ]
Nora El-Rashidy [2 ]
Ahmed Abou Elfetouh [1 ]
Hazem M. El-Bakry [1 ]
机构
[1] Mansoura University,Information Systems Department, Faculty of Computers and Information Sciences
[2] Kaferelshikh University,Machine Learning and Information Retrieval Department, Faculty of Artificial Intelligence
关键词
Social media; Transformer-based model; Word embedding; Bi-LSTM network; Additive attention;
D O I
10.1038/s41598-025-90738-1
中图分类号
学科分类号
摘要
In recent years, the continuous increase in the growth of text data on social media has been a major reason to rely on the pre-training method to develop new text classification models specially transformer-based models that have proven worthwhile in most natural language processing tasks. This paper introduces a new Position-Context Additive transformer-based model (PCA model) that consists of two-phases to increase the accuracy of text classification tasks on social media. Phase I aims to develop a new way to extract text characteristics by paying attention to the position and context of each word in the input layer. This is done by integrating the improved word embedding method (the position) with the developed Bi-LSTM network to increase the focus on the connection of each word with the other words around it (the context). As for phase II, it focuses on the development of a transformer-based model based primarily on improving the additive attention mechanism. The PCA model has been tested for the implementation of the classification of health-related social media texts in 6 data sets. Results showed that performance accuracy was improved by an increase in F1-Score between 0.2 and 10.2% in five datasets compared to the best published results. On the other hand, the performance of PCA model was compared with three transformer-based models that proved high accuracy in classifying texts, and experiments also showed that PCA model overcame the other models in 4 datasets to achieve an improvement in F1-score between 0.1 and 2.1%. The results also led us to conclude a direct correlation between the volume of training data and the accuracy of performance as the increase in the volume of training data positively affects F1-Score improvement.
引用
收藏
相关论文
共 50 条
  • [1] Detection of Depression Severity in Social Media Text Using Transformer-Based Models
    Qasim, Amna
    Mehak, Gull
    Hussain, Nisar
    Gelbukh, Alexander
    Sidorov, Grigori
    Information (Switzerland), 2025, 16 (02)
  • [2] TRANSQL: A Transformer-based Model for Classifying SQL Queries
    Tahmasebi, Shirin
    Payberah, Amir H.
    Paragraph, Ahmet Soylu
    Roman, Dumitru
    Matskin, Mihhail
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 788 - 793
  • [3] Transformer-based deep learning models for the sentiment analysis of social media data
    Kokab, Sayyida Tabinda
    Asghar, Sohail
    Naz, Shehneela
    ARRAY, 2022, 14
  • [4] Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media
    Dong, Xiangjue
    Li, Changmao
    Choi, Jinho D.
    FIGURATIVE LANGUAGE PROCESSING, 2020, : 276 - 280
  • [5] RobuTrans: A Robust Transformer-Based Text-to-Speech Model
    Li, Naihan
    Liu, Yanqing
    Wu, Yu
    Liu, Shujie
    Zhao, Sheng
    Liu, Ming
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8228 - 8235
  • [6] Transformer-Based Extractive Social Media Question Answering on TweetQA
    Butt, Sabur
    Ashraf, Noman
    Fahim, Hammad
    Sidorov, Grigori
    Gelbukh, Alexander
    COMPUTACION Y SISTEMAS, 2021, 25 (01): : 23 - 32
  • [7] Comparison of pretrained transformer-based models for influenza and COVID-19 detection using social media text data in Saskatchewan, Canada
    Tian, Yuan
    Zhang, Wenjing
    Duan, Lujie
    McDonald, Wade
    Osgood, Nathaniel
    FRONTIERS IN DIGITAL HEALTH, 2023, 5
  • [8] A Model for Classifying Emergency Events Based on Social Media Multimodal Data
    Wu, ZhenHua
    Chen, Liangyu
    Song, YuanTao
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT I, 2023, 14134 : 316 - 327
  • [9] Calibration of Transformer-Based Models for Identifying Stress and Depression in Social Media
    Ilias, Loukas
    Mouzakitis, Spiros
    Askounis, Dimitris
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1979 - 1990
  • [10] A transformer-based multi-task framework for joint detection of aggression and hate on social media data
    Ghosh, Soumitra
    Priyankar, Amit
    Ekbal, Asif
    Bhattacharyya, Pushpak
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (06) : 1495 - 1515