Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer

被引:0
|
作者
dos Santos, Cicero Nogueira [1 ]
Melnyk, Igor [1 ]
Padhi, Inkit [2 ]
机构
[1] IBM Res, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] IBM Watson, TJ Watson Res Ctr, Yorktown Hts, NY USA
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We introduce a new approach to tackle the problem of offensive language in online social media. Our approach uses unsupervised text style transfer to translate offensive sentences into non-offensive ones. We propose a new method for training encoder-decoders using non-parallel data that combines a collaborative classifier, attention and the cycle consistency loss. Experimental results on data from Twitter and Reddit show that our method outperforms a state-of-the-art text style transfer system in two out of three quantitative metrics and produces reliable non-offensive transferred sentences.
引用
收藏
页码:189 / 194
页数:6
相关论文
共 50 条
  • [1] Offensive Language Detection on Social Media Based on Text Classification
    Hajibabaee, Parisa
    Malekzadeh, Masoud
    Ahmadi, Mohsen
    Heidari, Maryam
    Esmaeilzadeh, Armin
    Abdolazimi, Reyhaneh
    Jones, James H., Jr.
    [J]. 2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 92 - 98
  • [2] Unsupervised Text Style Transfer using Language Models as Discriminators
    Yang, Zichao
    Hu, Zhiting
    Dyer, Chris
    Xing, Eric P.
    Berg-Kirkpatrick, Taylor
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] Unsupervised Text Style Transfer with Padded Masked Language Models
    Malmi, Eric
    Severyn, Aliaksei
    Rothe, Sascha
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8671 - 8680
  • [4] Offensive Language Recognition in Social Media
    Shushkevich, Elena
    Cardiff, John
    Rosso, Paolo
    Akhtyamova, Liliya
    [J]. COMPUTACION Y SISTEMAS, 2020, 24 (02): : 523 - 532
  • [5] Transductive Learning for Unsupervised Text Style Transfer
    Xiao, Fei
    Pang, Liang
    Lan, Yanyan
    Wang, Yan
    Shen, Huawei
    Cheng, Xueqi
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2510 - 2521
  • [6] Offensive Language Detection in Nepali Social Media
    Niraula, Nobal B.
    Dulal, Saurab
    Koirala, Diwa
    [J]. WOAH 2021: THE 5TH WORKSHOP ON ONLINE ABUSE AND HARMS, 2021, : 67 - 75
  • [7] A Corpus of Turkish Offensive Language on Social Media
    Coltekin, Cagri
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6174 - 6184
  • [8] A Dataset of Offensive Language in Kosovo Social Media
    Ajvazi, Adem
    Hardmeier, Christian
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1860 - 1869
  • [9] A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images
    Shivakumara, Palaiahnakote
    Banerjee, Ayan
    Pal, Umapada
    Nandanwar, Lokesh
    Lu, Tong
    Liu, Cheng-Lin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3552 - 3566
  • [10] MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer
    Yang, Yazheng
    Zhao, Zhou
    Liu, Qi
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3022 - 3032