Multilingual and Multimodal Abuse Detection

被引:0
|
作者
Sharon, Rini [1 ]
Shah, Heet [1 ]
Mukherjee, Debdoot [1 ]
Gupta, Vikram [1 ]
机构
[1] ShareChat, New Delhi, India
来源
关键词
abusive speech detection; multimodal abuse detection; multilingual abuse detection;
D O I
10.21437/Interspeech.2022-10629
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The presence of abusive content on social media platforms is undesirable as it severely impedes healthy and safe social media interactions. While automatic abuse detection has been widely explored in textual domain, audio abuse detection still remains unexplored. In this paper, we attempt abuse detection in conversational audio from a multimodal perspective in a multilingual social media setting. Our key hypothesis is that along with the modelling of audio, incorporating discriminative information from other modalities can be beneficial for this task. Our proposed method, MADA, explicitly focuses on two modalities other than the audio itself, namely, the underlying emotions expressed in the abusive audio and the semantic information encapsulated in the corresponding text. Observations prove that MADA demonstrates gains over audio-only approaches on the ADIMA dataset. We test the proposed approach on 10 different languages and observe consistent gains in the range 0.6%-5.2% by leveraging multiple modalities. We also perform extensive ablation experiments for studying the contributions of every modality and observe the best results while leveraging all the modalities together. Additionally, we perform experiments to empirically confirm that there is a strong correlation between underlying emotions and abusive behaviour. Code is available at https://github.com/ShareChatAI/MADA
引用
收藏
页码:4631 / 4635
页数:5
相关论文
共 50 条
  • [1] ADIMA: ABUSE DETECTION IN MULTILINGUAL AUDIO
    Gupta, Vikram
    Sharon, Rini
    Sawhney, Ramit
    Mukherjee, Debdoot
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6172 - 6176
  • [2] Blue Sky: Multilingual, Multimodal Domain Independent Deception Detection
    Boumber, Dainis
    Verma, Rakesh M.
    Qachfar, Fatima Zahra
    [J]. PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2024, : 396 - 399
  • [3] Multilingual Cyber Abuse Detection using Advanced Transformer Architecture
    Malte, Aditya
    Ratadiya, Pratik
    [J]. PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 784 - 789
  • [4] Multilingual and Multimodal Interactions
    Wagner, Johannes
    [J]. APPLIED LINGUISTICS, 2018, 39 (01) : 99 - 107
  • [5] Flood Detection in Social Media Using Multimodal Fusion on Multilingual Dataset
    Jony, Rabiul Islam
    Woodley, Alan
    Perrin, Dimitri
    [J]. 2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 566 - 573
  • [6] Multilingual Image Corpus - Towards a Multimodal and Multilingual Dataset
    Koeva, Svetla
    Stoyanova, Ivelina
    Kralev, Jordan
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1509 - 1518
  • [7] Dialogue in Multilingual and Multimodal Communities
    Milburn, Trudy
    [J]. JOURNAL OF LANGUAGE AND SOCIAL PSYCHOLOGY, 2017, 36 (03) : 380 - 382
  • [8] Media Identities - multimodal and multilingual
    Schwegler, Carolin
    Steen, Pamela
    [J]. LILI-ZEITSCHRIFT FUR LITERATURWISSENSCHAFT UND LINGUISTIK, 2024, : 383 - 391
  • [9] MMTD: A Multilingual and Multimodal Spam Detection Model Combining Text and Document Images
    Zhang, Ziqi
    Deng, Zhaohong
    Zhang, Wei
    Bu, Lingchao
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [10] MuMUR: Multilingual Multimodal Universal Retrieval
    Madasu, Avinash
    Aflalo, Estelle
    Stan, Gabriela Ben Melech
    Rosenman, Shachar
    Tseng, Shao-Yen
    Bertasius, Gedas
    Lal, Vasudev
    [J]. INFORMATION RETRIEVAL JOURNAL, 2023, 26 (1-2):