The State of Profanity Obfuscation in Natural Language Processing Scientific Publications

被引:0
|
作者
Nozza, Debora [1 ]
Hovy, Dirk [1 ]
机构
[1] Bocconi Univ, Milan, Italy
基金
欧洲研究理事会;
关键词
TABOO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Work on hate speech has made considering rude and harmful examples in scientific publications inevitable. This situation raises various problems, such as whether or not to obscure profanities. While science must accurately disclose what it does, the unwarranted spread of hate speech can harm readers and increases its internet frequency. While maintaining publications' professional appearance, obfuscating profanities makes it challenging to evaluate the content, especially for non-native speakers. Surveying 150 ACL papers, we discovered that obfuscation is usually used for English but not other languages, and even then, quite unevenly. We discuss the problems with obfuscation and suggest a multilingual community resource called PROF with a Python module to standardize profanity obfuscation processes. We believe PROF can help scientific publication policies to make hate speech work accessible and comparable, irrespective of language.
引用
收藏
页码:3897 / 3909
页数:13
相关论文
共 50 条
  • [1] Potential of natural language processing for metadata extraction fromenvironmental scientific publications
    Blanchy, Guillaume
    Albrecht, Lukas
    Koestel, John
    Garre, Sarah
    SOIL, 2023, 9 (01) : 155 - 168
  • [2] A Natural Language Processing System for Extracting Evidence of Drug Repurposing from Scientific Publications
    Subramanian, Shivashankar
    Baldini, Ioana
    Ravichandran, Sushma
    Katz-Rogozhnikov, Dmitriy A.
    Ramamurthy, Karthikeyan Natesan
    Sattigeri, Prasanna
    Varshney, Kush R.
    Wang, Annmarie
    Mangalath, Pradeep
    Kleiman, Laura B.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13376 - 13381
  • [3] Towards robust tags for scientific publications from natural language processing tools and Wikipedia
    Lopuszynski, Michal
    Bolikowski, Lukasz
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2015, 16 (01) : 25 - 36
  • [4] A Study of Profanity Effect in Sentiment Analysis on Natural Language Processing Using ANN
    Kim, Cheong-Ghil
    Hwang, Young-Jun
    Kamyod, Chayapol
    JOURNAL OF WEB ENGINEERING, 2022, 21 (03): : 751 - 766
  • [5] Tagging Scientific Publications Using Wikipedia and Natural Language Processing Tools Comparison on the ArXiv Dataset
    Lopuszynski, Michal
    Bolikowski, Lukasz
    THEORY AND PRACTICE OF DIGITAL LIBRARIES - TPDL 2013 SELECTED WORKSHOPS, 2014, 416 : 16 - 27
  • [6] Reuse and plagiarism in Speech and Natural Language Processing publications
    Mariani, Joseph
    Francopoulo, Gil
    Paroubek, Patrick
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2018, 19 (2-3) : 113 - 126
  • [7] Scientific Landscape of Publications in Natural Language Processing in the ASEAN Region on COVID-19: A Bibliometric Approach
    Roxas, Rachel Edita
    Tobias, Rogelio Ruzcko
    Minglana, Johanna
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 379 - 384
  • [8] LANGUAGE OF SCIENTIFIC PUBLICATIONS
    PASQUALINI, CD
    MEDICINA-BUENOS AIRES, 1977, 37 (5-6) : 594 - 595
  • [9] Connectionist natural language processing: The state of the art
    Christiansen, MH
    Chater, N
    COGNITIVE SCIENCE, 1999, 23 (04) : 417 - 437
  • [10] The use of tentative language in scientific publications
    LeCouteur, Richard A.
    JOURNAL OF VETERINARY INTERNAL MEDICINE, 2024,