Towards Responsible Natural Language Annotation for the Varieties of Arabic

被引:0
|
作者
Bergman, A. Stevie [1 ]
Diab, Mona T. [2 ]
机构
[1] Meta, Responsible AI, New York, NY 10003 USA
[2] Meta, Responsible AI, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When building NLP models, there is a tendency to aim for broader coverage, often overlooking cultural and (socio)linguistic nuance. In this position paper, we make the case for care and attention to such nuances, particularly in dataset annotation, as well as the inclusion of cultural and linguistic expertise in the process. We present a playbook for responsible dataset creation for polyglossic, multidialectal languages. This work is informed by a study on Arabic annotation of social media content.
引用
收藏
页码:364 / 371
页数:8
相关论文
共 50 条
  • [1] Towards Controlled Natural Language for Semantic Annotation
    Davis, Brian
    Dantuluri, Pradeep
    Handschuh, Siegfried
    Cunningham, Hamish
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2010, 6 (04) : 64 - 91
  • [2] Arab Gloss Annotation System for Arabic Sign Language
    Aouiti, Nadia
    Jemni, Mohamed
    Semreen, Sameer
    2015 5TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND ACCESSIBILITY (ICTA), 2015,
  • [3] Towards a Linguistic Annotation of Arabic Legal Texts: A Multilingual Electronic Dictionary for Arabic
    ElFqih, Khadija Ait
    Di Buono, Maria Pia
    Monti, Johanna
    FORMALIZING NATURAL LANGUAGES: APPLICATIONS TO NATURAL LANGUAGE PROCESSING AND DIGITAL HUMANITIES, NOOJ 2023, 2024, 1816 : 51 - 63
  • [4] Towards a Cascade of Morpho-syntactic Tools for Arabic Natural Language Processing
    Mesfar, Slim
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2010, 6008 : 150 - 162
  • [5] Towards automatic body language annotation
    Chippendale, Paul
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, 2006, : 487 - 492
  • [6] Towards a historical dictionary for Arabic language
    Laatar, Rim
    Aloulou, Chafik
    Belguith, Lamia Hadrich
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 29 - 41
  • [7] Controlled Natural Language for Semantic Annotation
    Davis, Brian
    Varma, Pradeep
    Handschuh, Siegfried
    Dragan, Laura
    Cunningham, Hamish
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, 2009, 5554 : 816 - +
  • [8] Towards a historical dictionary for Arabic language
    Rim Laatar
    Chafik Aloulou
    Lamia Hadrich Belguith
    International Journal of Speech Technology, 2022, 25 : 29 - 41
  • [9] Annotation of Spatial Relations in Natural language
    Shen, Qijun
    Zhang, Xueying
    Jiang, Wenming
    2009 INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND INFORMATION APPLICATION TECHNOLOGY, VOL III, PROCEEDINGS,, 2009, : 418 - 421
  • [10] Toward the Creation of an Arab Gloss for Arabic Sign Language Annotation
    Jemni, Mohamed
    Semreen, Sameer
    Othman, Achraf
    Tmar, Zouhour
    Aouiti, Nadia
    2013 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY AND ACCESSIBILITY (ICTA), 2013,