A survey of hate speech detection in Indian languages

被引:0
|
作者
Arpan Nandi
Kamal Sarkar
Arjun Mallick
Arkadeep De
机构
[1] Jadavpur University,Department of Computer Science and Engineering
关键词
Hate speech detection; Abusive comments; Indian languages; Mixed languages; Code-mixed;
D O I
暂无
中图分类号
学科分类号
摘要
With the enormous increase in accessibility of high-speed internet, the number of social media users is increasing rapidly. Due to a lack of proper regulations and ethics, social media platforms are often contaminated by posts and comments containing abusive language and offensive remarks toward individuals, groups, races, religions, and communities. A single remark often triggers a huge chain of reactions with similar abusiveness, or even more. To prevent such occurrences, there is a need for automated systems that can detect abusive texts and hate speeches and remove them immediately. However, most existing research works are limited only to globally popular languages like English. Since India is a nation of many diverse languages and multiple religions, nowadays abusive posts and remarks in Indian languages (monolingual or code-mixed form) are not infrequent on social media platforms. Although resources such as hate speech lexicon and annotated datasets are limited for Indian languages, most research works on hate speech detection in such languages used traditional machine learning and deep learning methods for this task. However, multilingualism and code-mixing make hate speech detection in Indian languages more challenging. Given these facts, this paper mainly focuses on reviewing the latest impactful research works on hate speech detection in Indian languages. In this paper, we have analyzed and compared the latest research works on hate speech detection in Indian languages in terms of various aspects—datasets used, feature extraction and classification methods applied, and the results achieved.
引用
收藏
相关论文
共 50 条
  • [1] A survey of hate speech detection in Indian languages
    Nandi, Arpan
    Sarkar, Kamal
    Mallick, Arjun
    De, Arkadeep
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [2] Hate Speech is not Free Speech: Explainable Machine Learning for Hate Speech Detection in Code-Mixed Languages
    Yadav, Sargam
    Kaushik, Abhishek
    McDaid, Kevin
    [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGY AND SOCIETY, ISTAS, 2023,
  • [3] A survey on speech synthesis techniques in Indian languages
    Soumya Priyadarsini Panda
    Ajit Kumar Nayak
    Satyananda Champati Rai
    [J]. Multimedia Systems, 2020, 26 : 453 - 478
  • [4] A survey on speech synthesis techniques in Indian languages
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    Rai, Satyananda Champati
    [J]. MULTIMEDIA SYSTEMS, 2020, 26 (04) : 453 - 478
  • [5] A Survey on Automatic Detection of Hate Speech in Text
    Fortuna, Paula
    Nunes, Sergio
    [J]. ACM COMPUTING SURVEYS, 2018, 51 (04)
  • [6] Hate speech detection in the Bengali language: a comprehensive survey
    Al Maruf, Abdullah
    Abidin, Ahmad Jainul
    Haque, Md. Mahmudul
    Jiyad, Zakaria Masud
    Golder, Aditi
    Alubady, Raaid
    Aung, Zeyar
    [J]. JOURNAL OF BIG DATA, 2024, 11 (01)
  • [7] ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages
    Amitoj Singh
    Virender Kadyan
    Munish Kumar
    Nancy Bassan
    [J]. Artificial Intelligence Review, 2020, 53 : 3673 - 3704
  • [8] ASRoIL: a comprehensive survey for automatic speech recognition of Indian languages
    Singh, Amitoj
    Kadyan, Virender
    Kumar, Munish
    Bassan, Nancy
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (05) : 3673 - 3704
  • [9] A Survey of Machine Translation and Parts of Speech Tagging for Indian Languages
    Khedkar, Vijayshri
    Shah, Pritesh
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (04): : 245 - 253
  • [10] Automatic Hate Speech Detection on Social Media: A Brief Survey
    Alrehili, Ahlam
    [J]. 2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,