A deep dive into automated sexism detection using fine-tuned deep learning and large language models

被引：0

作者：

Vetagiri, Advaitha ^{[1
]}

Pakray, Partha ^{[1
]}

Das, Amitava ^{[2
,3
]}

机构：

[1] Natl Inst Technol Silchar, Comp Sci & Engn, Silchar 7 88010, Assam, India

[2] UofSC, Artificial Intelligence Inst, Columbia, SC USA

[3] Wipro AI Lab, Bangalore, Karnataka, India

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 145卷

关键词：

Online sexism; Sexism classification; MultiHate dataset; Machine learning; Deep learning; Convolutional Neural Networks-Bidirectional; Long Short-Term Memory; Generative Pre-trained Transformer 2; HATE SPEECH DETECTION; ONLINE;

D O I：

10.1016/j.engappai.2025.110167

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The issue of sexism in online content has recently been a significant concern. With the increasing number of online interactions and the rise of social media platforms, the need for automated techniques to identify and classify sexism has become more critical than ever. This paper addresses this problem by fine-tuning deep-learning models for sexism classification using "MultiHate". It is a comprehensive dataset created by curating ten different datasets on sexism. The dataset consists of 1.76 M English texts labelled as sexist and not sexist, then fine-tuned two deep learning models, Convolutional Neural Networks-Bidirectional Long Short-Term Memory and Generative Pre-trained Transformer 2, which accurately detect and classify sexism. A comparative analysis has been conducted on several machine learning and deep learning models using the MultiHate dataset. Investigation reveals that the Generative Pre-trained Transformer 2 model outperforms other models with an accuracy of 92%, while the Convolutional Neural Networks-Bidirectional Long Short-Term Memory model achieved an accuracy of 90% using precision, recall, and F1 scores as performance metrics. The models' performances are promising, indicating that automated techniques can be employed to classify sexist content effectively. A comprehensive error analysis of the models' performance has been presented, highlighting their limitations and challenges. The computational time required for training and testing the models is a significant challenge, especially for larger datasets.

引用

页数：17

共 50 条

[21] Exploring Memorization in Fine-tuned Language Models
Zeng, Shenglai
Li, Yaxin
Ren, Jie
Liu, Yiding
Xu, Han
He, Pengfei
Xing, Yue
Wang, Shuaiqiang
Tang, Jiliang
Yin, Dawei
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 3917 - 3948
[22] DEEP LINE ENGINEERING FINE-TUNED BY TRANSMED
BENEDINI, G
BERTI, A
PIPELINE & GAS JOURNAL, 1983, 210 (04) : 46 - &
[23] Fingerprinting Fine-tuned Language Models in the Wild
Diwan, Nirav
Chakravorty, Tanmoy
Shafiq, Zubair
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4652 - 4664
[24] Accelerating aerodynamic simulations with a hybrid fine-tuned deep learning model
Li, Jiahui
Zhang, Xiaoya
Peng, Wei
Liu, Xu
Wang, Wenhui
Yao, Wen
PHYSICS OF FLUIDS, 2024, 36 (11)
[25] Enhancing parasitic organism detection in microscopy images through deep learning and fine-tuned optimizer
Kumar, Yogesh
Garg, Pertik
Moudgil, Manu Raj
Singh, Rupinder
Wozniak, Marcin
Shafi, Jana
Ijaz, Muhammad Fazal
SCIENTIFIC REPORTS, 2024, 14 (01)
[26] Fine-tuned deep neural networks for polyp detection in colonoscopy images
Ellahyani A.
Jaafari I.E.
Charfi S.
Ansari M.E.
Personal and Ubiquitous Computing, 2023, 27 (02) : 235 - 247
[27] Deciphering language disturbances in schizophrenia: A study using fine-tuned language models
Li, Renyu
Cao, Minne
Fu, Dawei
Wei, Wei
Wang, Dequan
Yuan, Zhaoxia
Hu, Ruofei
Deng, Wei
SCHIZOPHRENIA RESEARCH, 2024, 271 : 120 - 128
[28] Using fine-tuned large language models to parse clinical notes in musculoskeletal pain disorders
Vaid, Akhil
Landi, Isotta
Nadkarni, Girish
Nabeel, Ismail
LANCET DIGITAL HEALTH, 2023, 5 (12): : E855 - E858
[29] Mining Insights from Large-Scale Corpora Using Fine-Tuned Language Models
Palakodety, Shriphani
KhudaBukhsh, Ashiqur R.
Carbonell, Jaime G.
ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1890 - 1897
[30] Transfer learning and fine-tuned transfer learning methods' effectiveness analyse in the CNN-based deep learning models
Ozturk, Celal
Tasyurek, Murat
Turkdamar, Mehmet Ugur
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (04):

← 1 2 3 4 5 →