Towards Generalized Offensive Language Identification

被引:0
|
作者
Dmonte, Alphaeus [1 ]
Arya, Tejas [2 ]
Ranasinghe, Tharindu [3 ]
Zampieri, Marcos [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
[2] Rochester Inst Technol, Rochester, NY USA
[3] Univ Lancaster, Lancaster, England
关键词
Offensive Language; Large Language Models; Generalizability;
D O I
10.1007/978-3-031-78541-2_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prevalence of offensive content on the internet, encompassing hate speech and cyberbullying, is a pervasive issue worldwide. Consequently, it has garnered significant attention from the machine learning (ML) and natural language processing (NLP) communities. As a result, numerous systems have been developed to automatically identify potentially harmful content and to mitigate its impact. These systems can follow two approaches; (i) Use publicly available models and application endpoints, including prompting large language models (LLMs) (ii) Annotate datasets and train ML models on them. However, both approaches lack an understanding of how generalizable they are. Furthermore, the applicability of these systems is often questioned in off-domain and practical environments. This paper empirically evaluates the generalizability of offensive language detection models and datasets across a novel generalized benchmark: GenOffense. We answer three research questions on generalizability. Our findings will be useful in creating robust real-world offensive language detection systems.
引用
收藏
页码:271 / 286
页数:16
相关论文
共 50 条
  • [21] An Evaluation of Multilingual Offensive Language Identification Methods for the Languages of India
    Ranasinghe, Tharindu
    Zampieri, Marcos
    INFORMATION, 2021, 12 (08)
  • [22] Towards offensive language detection and reduction in four Software Engineering communities
    Cheriyan, Jithin
    Savarimuthu, Bastin Tony Roy
    Cranefield, Stephen
    PROCEEDINGS OF EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING (EASE 2021), 2021, : 254 - 259
  • [23] Towards protocols for describing racially offensive language in UK public archives
    Alicia Chilcott
    Archival Science, 2019, 19 : 359 - 376
  • [24] Towards protocols for describing racially offensive language in UK public archives
    Chilcott, Alicia
    ARCHIVAL SCIENCE, 2019, 19 (04) : 359 - 376
  • [25] Aggressive and Offensive Language Identification in Hindi, Bangla, and English: A Comparative Study
    Kumar R.
    Lahiri B.
    Ojha A.K.
    SN Computer Science, 2021, 2 (1)
  • [26] OLID-BR: offensive language identification dataset for Brazilian Portuguese
    Trajano, Douglas
    Bordini, Rafael H.
    Vieira, Renata
    LANGUAGE RESOURCES AND EVALUATION, 2024, 58 (04) : 1263 - 1289
  • [27] Graph Knowledge Transfer for Offensive Language Identification with Graph Neural Networks
    Huang, Yen-Hao
    Harryyanto, Kevin
    Tsai, Che-Wei
    Pornvattanavichai, Ratana
    Chen, Yi-Shin
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2022), 2022, : 216 - 221
  • [28] An Automatic Approach for the Identification of Offensive Language in Perso-Arabic Urdu Language: Dataset Creation and Evaluation
    Din, Salah Ud
    Khusro, Shah
    Khan, Farman Ali
    Ahmad, Munir
    Ali, Oualid
    Ghazal, Taher M.
    IEEE ACCESS, 2025, 13 : 19755 - 19769
  • [29] Offensive Language in the General Hospital
    Zimmerman, Daniel J.
    Stern, Theodore A.
    PSYCHOSOMATICS, 2010, 51 (05) : 377 - 385
  • [30] Persian offensive language detection
    Kebriaei, Emad
    Homayouni, Ali
    Faraji, Roghayeh
    Razavi, Armita
    Shakery, Azadeh
    Faili, Heshaam
    Yaghoobzadeh, Yadollah
    MACHINE LEARNING, 2024, 113 (07) : 4359 - 4379