Abusive Text Examination Using Latent Dirichlet Allocation, Self Organizing Maps and K Means Clustering

被引:0
|
作者
Saini, Yash [1 ]
Bachchas, Vishal [1 ]
Kumar, Yogesh [1 ]
Kumar, Sanjay [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Delhi, India
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020) | 2020年
关键词
Hate speech; topic modeling; latent dirichlet allocation; and self-organizing maps;
D O I
10.1109/iciccs48265.2020.9121090
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The widespread misuse of social media to disseminate hate speech targeted at a particular individual, community, religion, race, sex or caste has engendered researchers from across the world to formulate strategies and methodologies to counter this menace. Without detection and analysis of hate speech, one cannot imagine the social media to be free of malicious content. This paper proposes a methodology which employs a combination of popular topic modeling technique i.e. Latent Dirichlet Allocation (LDA) and an unsupervised machine learning technique i.e. self-organizing maps (SOM) to analyze hate speech spread over social media. This method is compared to K means clustering used after the application of LDA. Both the techniques used together provide a powerful analysis. The proposed LDA model outputted ten topics for features and had a low perplexity with a higher negative log-likelihood score.
引用
收藏
页码:1233 / 1238
页数:6
相关论文
共 50 条
  • [1] A Chinese Word Clustering Method Using Latent Dirichlet Allocation and K-means
    Qiu Lin
    Xu Jungang
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER SCIENCE AND ENGINEERING (CSE 2013), 2013, 42 : 267 - 270
  • [2] Clustering gene expression data using self-organizing maps and k-means clustering
    Yano, N
    Kotani, A
    SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 3211 - 3215
  • [3] Self-organizing maps as substitutes for k-means clustering
    Baçao, F
    Lobo, V
    Painho, M
    COMPUTATIONAL SCIENCE - ICCS 2005, PT 3, 2005, 3516 : 476 - 483
  • [4] Analysis of gene expression data by using self-organizing maps and K-means clustering
    Sugiyama, A
    Kotani, M
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 1342 - 1345
  • [5] COMPARING SELF ORGANIZING MAPS WITH K-MEANS CLUSTERING: AN APPLICATION TO CUSTOMER PROFILING
    Ince, Huseyin
    Imamoglu, Salih Zeki
    Keskin, Halit
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2013, 28 (04): : 723 - 731
  • [6] Comparing self organizing maps with k-means clustering: An application to customer profiling
    Öz-düzenlemel harta alari le k-ortalama kümeleme analznn karilatirilmasi: tüketc proflleme örne
    1600, Gazi Universitesi (28):
  • [7] Automatic Unsupervised Segmentation of Retinal Vessels Using Self-Organizing Maps and K-Means Clustering
    Lupascu, Carmen Alina
    Tegolo, Domenico
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2011, 6685 : 263 - 274
  • [8] Feature extraction for document text using Latent Dirichlet Allocation
    Prihatini, P. M.
    Suryawan, I. K.
    Mandia, I. N.
    2ND INTERNATIONAL JOINT CONFERENCE ON SCIENCE AND TECHNOLOGY (IJCST) 2017, 2018, 953
  • [9] A Study on Self-Organizing Maps and K-Means Clustering on a Music Genre Dataset
    Azcarraga, A.
    Flores, F. K.
    THEORY AND PRACTICE OF COMPUTATION, 2018, : 219 - 234
  • [10] Arabic Text Clustering Using Self-Organizing Maps and Grey Wolf Optimization
    Larabi-Marie-Sainte, Souad
    Bin Alamir, Mashael
    Alameer, Abdulmajeed
    APPLIED SCIENCES-BASEL, 2023, 13 (18):