The State of Profanity Obfuscation in Natural Language Processing Scientific Publications

被引:0
|
作者
Nozza, Debora [1 ]
Hovy, Dirk [1 ]
机构
[1] Bocconi Univ, Milan, Italy
基金
欧洲研究理事会;
关键词
TABOO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Work on hate speech has made considering rude and harmful examples in scientific publications inevitable. This situation raises various problems, such as whether or not to obscure profanities. While science must accurately disclose what it does, the unwarranted spread of hate speech can harm readers and increases its internet frequency. While maintaining publications' professional appearance, obfuscating profanities makes it challenging to evaluate the content, especially for non-native speakers. Surveying 150 ACL papers, we discovered that obfuscation is usually used for English but not other languages, and even then, quite unevenly. We discuss the problems with obfuscation and suggest a multilingual community resource called PROF with a Python module to standardize profanity obfuscation processes. We believe PROF can help scientific publication policies to make hate speech work accessible and comparable, irrespective of language.
引用
收藏
页码:3897 / 3909
页数:13
相关论文
共 50 条
  • [41] DEEP LEARNING IN NATURAL LANGUAGE PROCESSING: A STATE-OF-THE-ART SURVEY
    Chai, Junyi
    Li, Anming
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 535 - 540
  • [42] Natural language processing in dermatology: A systematic literature review and state of the art
    Paganelli, Alessia
    Spadafora, Marco
    Navarrete-Dechent, Cristian
    Guida, Stefania
    Pellacani, Giovanni
    Longo, Caterina
    JOURNAL OF THE EUROPEAN ACADEMY OF DERMATOLOGY AND VENEREOLOGY, 2024, 38 (12) : 2225 - 2234
  • [43] Network Analysis and Natural Language Processing to Obtain a Landscape of the Scientific Literature on Materials Applications
    Brito, Ana Caroline M.
    Oliveira, Maria Cristina F.
    Oliveira Jr, Osvaldo N.
    Silva, Filipi N.
    Amancio, Diego R.
    ACS APPLIED MATERIALS & INTERFACES, 2023, 15 (23) : 27437 - 27446
  • [44] Predicting the Utility of Scientific Articles for Emerging Pandemics Using Their Titles and Natural Language Processing
    Dobolyi, Kinga
    Hussain, Sidra
    Mcpeak, Grady
    DISASTER MEDICINE AND PUBLIC HEALTH PREPAREDNESS, 2024, 18
  • [45] NLP-KG: A System for Exploratory Search of Scientific Literature in Natural Language Processing
    Schopf, Tim
    Matthes, Florian
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 3: SYSTEM DEMONSTRATIONS, 2024, : 127 - 135
  • [46] Revealing the technology development of natural language processing: A Scientific entity-centric perspective
    Zhang, Heng
    Zhang, Chengzhi
    Wang, Yuzhuo
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
  • [47] NEW TRENDS IN NATURAL-LANGUAGE PROCESSING - STATISTICAL NATURAL-LANGUAGE PROCESSING
    MARCUS, M
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (22) : 10052 - 10059
  • [48] Introduction to Chinese Natural Language Processing (Review of Introduction to Chinese Natural Language Processing)
    Jiang Song
    JOURNAL OF TECHNOLOGY AND CHINESE LANGUAGE TEACHING, 2010, 1 (01): : 94 - 98
  • [49] Person-first language in diabetes and obesity scientific publications
    Dickinson, Jane K. K.
    Bialonczyk, Damian
    Reece, Jessica
    Kyle, Theodore K. K.
    Close, Kelly L. L.
    Nadglowski, Joseph
    Johnson, Katie
    Garza, Matthew
    Pash, Elizabeth
    Chiquette, Elaine
    DIABETIC MEDICINE, 2023, 40 (09)
  • [50] An architecture for language processing for scientific texts
    Copestake, Ann
    Corbett, Peter
    Murray-Rust, Peter
    Rupp, C. J.
    Siddharthan, Advaith
    Teufel, Simone
    Waldron, Ben
    PROCEEDINGS OF THE UK E-SCIENCE ALL HANDS MEETING 2006, 2006, : 614 - 621