A deep dive into automated sexism detection using fine-tuned deep learning and large language models

被引:0
|
作者
Vetagiri, Advaitha [1 ]
Pakray, Partha [1 ]
Das, Amitava [2 ,3 ]
机构
[1] Natl Inst Technol Silchar, Comp Sci & Engn, Silchar 7 88010, Assam, India
[2] UofSC, Artificial Intelligence Inst, Columbia, SC USA
[3] Wipro AI Lab, Bangalore, Karnataka, India
关键词
Online sexism; Sexism classification; MultiHate dataset; Machine learning; Deep learning; Convolutional Neural Networks-Bidirectional; Long Short-Term Memory; Generative Pre-trained Transformer 2; HATE SPEECH DETECTION; ONLINE;
D O I
10.1016/j.engappai.2025.110167
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The issue of sexism in online content has recently been a significant concern. With the increasing number of online interactions and the rise of social media platforms, the need for automated techniques to identify and classify sexism has become more critical than ever. This paper addresses this problem by fine-tuning deep-learning models for sexism classification using "MultiHate". It is a comprehensive dataset created by curating ten different datasets on sexism. The dataset consists of 1.76 M English texts labelled as sexist and not sexist, then fine-tuned two deep learning models, Convolutional Neural Networks-Bidirectional Long Short-Term Memory and Generative Pre-trained Transformer 2, which accurately detect and classify sexism. A comparative analysis has been conducted on several machine learning and deep learning models using the MultiHate dataset. Investigation reveals that the Generative Pre-trained Transformer 2 model outperforms other models with an accuracy of 92%, while the Convolutional Neural Networks-Bidirectional Long Short-Term Memory model achieved an accuracy of 90% using precision, recall, and F1 scores as performance metrics. The models' performances are promising, indicating that automated techniques can be employed to classify sexist content effectively. A comprehensive error analysis of the models' performance has been presented, highlighting their limitations and challenges. The computational time required for training and testing the models is a significant challenge, especially for larger datasets.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] The Impact of AUTOGEN and Similar Fine-Tuned Large Language Models on the Integrity of Scholarly Writing
    Resnik, David B.
    Hosseini, Mohammad
    AMERICAN JOURNAL OF BIOETHICS, 2023, 23 (10): : 50 - 52
  • [42] Comparative Analysis of Generic and Fine-Tuned Large Language Models for Conversational Agent Systems
    Villa, Laura
    Carneros-Prado, David
    Dobrescu, Cosmin C.
    Sanchez-Miguel, Adrian
    Cubero, Guillermo
    Hervas, Ramon
    ROBOTICS, 2024, 13 (05)
  • [43] Automated labeling of training data for improved object detection in traffic videos by fine-tuned deep convolutional neural networks
    Garcia-Aguilar, Ivan
    Garcia-Gonzalez, Jorge
    Luque-Baena, Rafael Marcos
    Lopez-Rubio, Ezequiel
    PATTERN RECOGNITION LETTERS, 2023, 167 : 45 - 52
  • [44] A fine-tuned YOLOv5 deep learning approach for real-time house number detection
    Taşyürek M.
    Öztürk C.
    PeerJ Computer Science, 2023, 9
  • [45] A fine-tuned YOLOv5 deep learning approach for real-time house number detection
    Tasyurek, Murat
    Ozturk, Celal
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [46] Empowering COVID-19 detection: Optimizing performance through fine-tuned EfficientNet deep learning architecture
    Talukder, Md. Alamin
    Abu Layek, Md.
    Kazi, Mohsin
    Uddin, Md. Ashraf
    Aryal, Sunil
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
  • [47] Simply Fine-Tuned Deep Learning-Based Classification for Breast Cancer with Mammograms
    Mudeng, Vicky
    Jeong, Jin-woo
    Choe, Se-woon
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 4677 - 4693
  • [48] An Ensemble of Fine-Tuned Deep Learning Networks for Wet-Blue Leather Segmentation
    Aslam, Masood
    Khan, Tariq M.
    Naqvi, Syed Saud
    Holmes, Geoff
    JOURNAL OF THE AMERICAN LEATHER CHEMISTS ASSOCIATION, 2022, 117 (04): : 164 - 170
  • [49] HADE: Exploiting Human Action Recognition Through Fine-Tuned Deep Learning Methods
    Karim, Misha
    Khalid, Shah
    Aleryani, Aliya
    Tairan, Nasser
    Ali, Zafar
    Ali, Farman
    IEEE ACCESS, 2024, 12 : 42769 - 42790
  • [50] Supervised fine-tuned approach for automated detection of diabetic retinopathy
    Kriti Ohri
    Mukesh Kumar
    Multimedia Tools and Applications, 2024, 83 : 14259 - 14280