A deep dive into automated sexism detection using fine-tuned deep learning and large language models

被引:0
|
作者
Vetagiri, Advaitha [1 ]
Pakray, Partha [1 ]
Das, Amitava [2 ,3 ]
机构
[1] Natl Inst Technol Silchar, Comp Sci & Engn, Silchar 7 88010, Assam, India
[2] UofSC, Artificial Intelligence Inst, Columbia, SC USA
[3] Wipro AI Lab, Bangalore, Karnataka, India
关键词
Online sexism; Sexism classification; MultiHate dataset; Machine learning; Deep learning; Convolutional Neural Networks-Bidirectional; Long Short-Term Memory; Generative Pre-trained Transformer 2; HATE SPEECH DETECTION; ONLINE;
D O I
10.1016/j.engappai.2025.110167
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The issue of sexism in online content has recently been a significant concern. With the increasing number of online interactions and the rise of social media platforms, the need for automated techniques to identify and classify sexism has become more critical than ever. This paper addresses this problem by fine-tuning deep-learning models for sexism classification using "MultiHate". It is a comprehensive dataset created by curating ten different datasets on sexism. The dataset consists of 1.76 M English texts labelled as sexist and not sexist, then fine-tuned two deep learning models, Convolutional Neural Networks-Bidirectional Long Short-Term Memory and Generative Pre-trained Transformer 2, which accurately detect and classify sexism. A comparative analysis has been conducted on several machine learning and deep learning models using the MultiHate dataset. Investigation reveals that the Generative Pre-trained Transformer 2 model outperforms other models with an accuracy of 92%, while the Convolutional Neural Networks-Bidirectional Long Short-Term Memory model achieved an accuracy of 90% using precision, recall, and F1 scores as performance metrics. The models' performances are promising, indicating that automated techniques can be employed to classify sexist content effectively. A comprehensive error analysis of the models' performance has been presented, highlighting their limitations and challenges. The computational time required for training and testing the models is a significant challenge, especially for larger datasets.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Exploring Memorization in Fine-tuned Language Models
    Zeng, Shenglai
    Li, Yaxin
    Ren, Jie
    Liu, Yiding
    Xu, Han
    He, Pengfei
    Xing, Yue
    Wang, Shuaiqiang
    Tang, Jiliang
    Yin, Dawei
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 3917 - 3948
  • [22] DEEP LINE ENGINEERING FINE-TUNED BY TRANSMED
    BENEDINI, G
    BERTI, A
    PIPELINE & GAS JOURNAL, 1983, 210 (04) : 46 - &
  • [23] Fingerprinting Fine-tuned Language Models in the Wild
    Diwan, Nirav
    Chakravorty, Tanmoy
    Shafiq, Zubair
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4652 - 4664
  • [24] Accelerating aerodynamic simulations with a hybrid fine-tuned deep learning model
    Li, Jiahui
    Zhang, Xiaoya
    Peng, Wei
    Liu, Xu
    Wang, Wenhui
    Yao, Wen
    PHYSICS OF FLUIDS, 2024, 36 (11)
  • [25] Enhancing parasitic organism detection in microscopy images through deep learning and fine-tuned optimizer
    Kumar, Yogesh
    Garg, Pertik
    Moudgil, Manu Raj
    Singh, Rupinder
    Wozniak, Marcin
    Shafi, Jana
    Ijaz, Muhammad Fazal
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [26] Fine-tuned deep neural networks for polyp detection in colonoscopy images
    Ellahyani A.
    Jaafari I.E.
    Charfi S.
    Ansari M.E.
    Personal and Ubiquitous Computing, 2023, 27 (02) : 235 - 247
  • [27] Deciphering language disturbances in schizophrenia: A study using fine-tuned language models
    Li, Renyu
    Cao, Minne
    Fu, Dawei
    Wei, Wei
    Wang, Dequan
    Yuan, Zhaoxia
    Hu, Ruofei
    Deng, Wei
    SCHIZOPHRENIA RESEARCH, 2024, 271 : 120 - 128
  • [28] Using fine-tuned large language models to parse clinical notes in musculoskeletal pain disorders
    Vaid, Akhil
    Landi, Isotta
    Nadkarni, Girish
    Nabeel, Ismail
    LANCET DIGITAL HEALTH, 2023, 5 (12): : E855 - E858
  • [29] Mining Insights from Large-Scale Corpora Using Fine-Tuned Language Models
    Palakodety, Shriphani
    KhudaBukhsh, Ashiqur R.
    Carbonell, Jaime G.
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1890 - 1897
  • [30] Transfer learning and fine-tuned transfer learning methods' effectiveness analyse in the CNN-based deep learning models
    Ozturk, Celal
    Tasyurek, Murat
    Turkdamar, Mehmet Ugur
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (04):