Multi-label Categorization of Accounts of Sexism using a Neural Framework

被引:0
|
作者
Parikh, Pulkit [1 ]
Abburi, Harika [1 ]
Badjatiya, Pinkesh [1 ]
Krishnan, Radhika [1 ]
Chhaya, Niyati [1 ,2 ]
Gupta, Manish [1 ]
Varma, Vasudeva [1 ]
机构
[1] IIIT Hyderabad, Hyderabad, India
[2] Adobe Res, Bangalore, Karnataka, India
关键词
AGREEMENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sexism, an injustice that subjects women and girls to enormous suffering, manifests in blatant as well as subtle ways. In the wake of growing documentation of experiences of sexism on the web, the automatic categorization of accounts of sexism has the potential to assist social scientists and policy makers in utilizing such data to study and counter sexism better. The existing work on sexism classification, which is different from sexism detection, has certain limitations in terms of the categories of sexism used and/or whether they can co-occur. To the best of our knowledge, this is the first work on the multi-label classification of sexism of any kind(s), and we contribute the largest dataset for sexism categorization. We develop a neural solution for this multi-label classification that can combine sentence representations obtained using models such as BERT with distributional and linguistic word embeddings using a flexible, hierarchical architecture involving recurrent components and optional convolutional ones. Further, we leverage unlabeled accounts of sexism to infuse domain-specific elements into our framework. The best proposed method outperforms several deep learning as well as traditional machine learning baselines by an appreciable margin.
引用
收藏
页码:1642 / 1652
页数:11
相关论文
共 50 条
  • [1] A 6-tuple Framework for the Current Multi-label Text Categorization
    Fang, Ying
    Huang, He-yan
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (ICCSAI 2014), 2015, : 38 - 42
  • [2] Minimizing Supervision in Multi-label Categorization
    Rajat
    Varshney, Munender
    Singh, Pravendra
    Namboodiri, Vinay P.
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 93 - 102
  • [3] Multi-Label Object Categorization Using Histograms of Global Relations
    Mustafa, Wail
    Xiong, Hanchen
    Kraft, Dirk
    Szedmak, Sandor
    Piater, Justus
    Kruger, Norbert
    [J]. 2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 309 - 317
  • [4] A Framework for Multi-Label Learning Using Label Ranking and Correlation
    Shaukat, Malik Irfan
    Usman, Muhammad
    [J]. ADVANCES IN DIGITAL TECHNOLOGIES, 2015, 275 : 296 - 303
  • [5] A Multi-label Scene Categorization Model Based on Deep Convolutional Neural Network
    Zhao, Gaofeng
    Luo, Wang
    Cui, Yang
    Fan, Qiang
    Peng, Qiwei
    Kong, Zhen
    Zhu, Liang
    Zhang, Tai
    [J]. COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 128 - 135
  • [6] Multi-label Aspect Categorization with Convolutional Neural Networks and Extreme Gradient Boosting
    Azhar, Annisa Nurul
    Khodra, Masayu Leylia
    Sutiono, Arie Pratama
    [J]. PROCEEDING OF 2019 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI), 2019, : 35 - 40
  • [7] Ensemble Application of Convolutional and Recurrent Neural Networks for Multi-label Text Categorization
    Chen, Guibin
    Ye, Deheng
    Xing, Zhenchang
    Chen, Jieshan
    Cambria, Erik
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2377 - 2383
  • [8] Selection strategies for multi-label text categorization
    Montejo-Raez, Arturo
    Urena-Lopez, Luis Alfonso
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4139 : 585 - 592
  • [9] Boosting multi-label hierarchical text categorization
    Esuli, Andrea
    Fagni, Tiziano
    Sebastiani, Fabrizio
    [J]. INFORMATION RETRIEVAL, 2008, 11 (04): : 287 - 313
  • [10] Boosting multi-label hierarchical text categorization
    Andrea Esuli
    Tiziano Fagni
    Fabrizio Sebastiani
    [J]. Information Retrieval, 2008, 11 : 287 - 313