The State of Profanity Obfuscation in Natural Language Processing Scientific Publications

被引:0
|
作者
Nozza, Debora [1 ]
Hovy, Dirk [1 ]
机构
[1] Bocconi Univ, Milan, Italy
基金
欧洲研究理事会;
关键词
TABOO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Work on hate speech has made considering rude and harmful examples in scientific publications inevitable. This situation raises various problems, such as whether or not to obscure profanities. While science must accurately disclose what it does, the unwarranted spread of hate speech can harm readers and increases its internet frequency. While maintaining publications' professional appearance, obfuscating profanities makes it challenging to evaluate the content, especially for non-native speakers. Surveying 150 ACL papers, we discovered that obfuscation is usually used for English but not other languages, and even then, quite unevenly. We discuss the problems with obfuscation and suggest a multilingual community resource called PROF with a Python module to standardize profanity obfuscation processes. We believe PROF can help scientific publication policies to make hate speech work accessible and comparable, irrespective of language.
引用
收藏
页码:3897 / 3909
页数:13
相关论文
共 50 条
  • [31] Survey: Finite-state technology in natural language processing
    Maletti, Andreas
    THEORETICAL COMPUTER SCIENCE, 2017, 679 : 2 - 17
  • [32] Measuring Innovation in Speech and Language Processing Publications
    Mariani, Joseph
    Francopoulo, Gil
    Paroubek, Patrick
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1890 - 1895
  • [33] Trends in Computational Science: Natural Language Processing and Network Analysis of 23 Years of ICCS Publications
    Luo, Lijing
    Kovalchuk, Sergey
    Krzhizhanovskaya, Valeria
    Paszynski, Maciej
    de Mulatier, Clelia
    Dongarra, Jack
    Sloot, Peter M. A.
    COMPUTATIONAL SCIENCE, ICCS 2024, PT II, 2024, 14833 : 19 - 33
  • [34] Natural language processing
    Chowdhury, GG
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2003, 37 : 51 - 89
  • [35] Natural language processing
    Martinez, Angel R.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (03) : 352 - 357
  • [36] Natural language processing
    EDITORIAL: Automatische Sprachverarbeitung
    Hoepel-Man, Jakob, 1600, De Gruyter Oldenbourg (36):
  • [37] Natural language processing
    Anon
    1600, Knowledge Technology Inc. (15):
  • [38] Natural language processing
    Gelbukh, A
    HIS 2005: 5th International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 6 - 6
  • [39] Semantic Annotation of Data Processing Pipelines in Scientific Publications
    Mesbah, Sepideh
    Fragkeskos, Kyriakos
    Lofi, Christoph
    Bozzon, Alessandro
    Houben, Geert-Jan
    SEMANTIC WEB ( ESWC 2017), PT I, 2017, 10249 : 321 - 336
  • [40] Putting Natural in Natural Language Processing
    Chrupala, Grzegorz
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 7820 - 7827