The State of Profanity Obfuscation in Natural Language Processing Scientific Publications

被引：0

作者：

Nozza, Debora ^{[1
]}

Hovy, Dirk ^{[1
]}

机构：

[1] Bocconi Univ, Milan, Italy

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023 | 2023年

基金：

欧洲研究理事会;

关键词：

TABOO;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Work on hate speech has made considering rude and harmful examples in scientific publications inevitable. This situation raises various problems, such as whether or not to obscure profanities. While science must accurately disclose what it does, the unwarranted spread of hate speech can harm readers and increases its internet frequency. While maintaining publications' professional appearance, obfuscating profanities makes it challenging to evaluate the content, especially for non-native speakers. Surveying 150 ACL papers, we discovered that obfuscation is usually used for English but not other languages, and even then, quite unevenly. We discuss the problems with obfuscation and suggest a multilingual community resource called PROF with a Python module to standardize profanity obfuscation processes. We believe PROF can help scientific publication policies to make hate speech work accessible and comparable, irrespective of language.

引用

页码：3897 / 3909

页数：13

共 50 条

[1] Potential of natural language processing for metadata extraction fromenvironmental scientific publications
Blanchy, Guillaume
Albrecht, Lukas
Koestel, John
Garre, Sarah
SOIL, 2023, 9 (01) : 155 - 168
[2] A Natural Language Processing System for Extracting Evidence of Drug Repurposing from Scientific Publications
Subramanian, Shivashankar
Baldini, Ioana
Ravichandran, Sushma
Katz-Rogozhnikov, Dmitriy A.
Ramamurthy, Karthikeyan Natesan
Sattigeri, Prasanna
Varshney, Kush R.
Wang, Annmarie
Mangalath, Pradeep
Kleiman, Laura B.
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13376 - 13381
[3] Towards robust tags for scientific publications from natural language processing tools and Wikipedia
Lopuszynski, Michal
Bolikowski, Lukasz
INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2015, 16 (01) : 25 - 36
[4] A Study of Profanity Effect in Sentiment Analysis on Natural Language Processing Using ANN
Kim, Cheong-Ghil
Hwang, Young-Jun
Kamyod, Chayapol
JOURNAL OF WEB ENGINEERING, 2022, 21 (03): : 751 - 766
[5] Tagging Scientific Publications Using Wikipedia and Natural Language Processing Tools Comparison on the ArXiv Dataset
Lopuszynski, Michal
Bolikowski, Lukasz
THEORY AND PRACTICE OF DIGITAL LIBRARIES - TPDL 2013 SELECTED WORKSHOPS, 2014, 416 : 16 - 27
[6] Reuse and plagiarism in Speech and Natural Language Processing publications
Mariani, Joseph
Francopoulo, Gil
Paroubek, Patrick
INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2018, 19 (2-3) : 113 - 126
[7] Scientific Landscape of Publications in Natural Language Processing in the ASEAN Region on COVID-19: A Bibliometric Approach
Roxas, Rachel Edita
Tobias, Rogelio Ruzcko
Minglana, Johanna
2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 379 - 384
[8] LANGUAGE OF SCIENTIFIC PUBLICATIONS
PASQUALINI, CD
MEDICINA-BUENOS AIRES, 1977, 37 (5-6) : 594 - 595
[9] Connectionist natural language processing: The state of the art
Christiansen, MH
Chater, N
COGNITIVE SCIENCE, 1999, 23 (04) : 417 - 437
[10] The use of tentative language in scientific publications
LeCouteur, Richard A.
JOURNAL OF VETERINARY INTERNAL MEDICINE, 2024,

← 1 2 3 4 5 →