Automatic content moderation on social media

被引:0
|
作者
Karabulut, Dogus [1 ]
Ozcinar, Cagri [1 ]
Anbarjafari, Gholamreza [1 ,2 ,3 ,4 ]
机构
[1] Univ Tartu, Inst Technol, iCV Res Lab, EE-50411 Tartu, Estonia
[2] Yildiz Tech Univ, Istanbul, Turkey
[3] iVCV OU, Tartu, Estonia
[4] PwC Advisory, Helsinki, Finland
关键词
Inappropriate scene recognition; Content obfuscation; Convolutional neural networks; WEB PAGES;
D O I
10.1007/s11042-022-11968-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Millions of users produce and consume billions of content on social media. Therefore, human-reviewed content moderation is not achievable in such volume. Automating content moderation is a scalable solution for social media platforms. In this research work, we propose an automatic content moderation pipeline based on deep neural networks. Our solution consists of two main parts: the first part classifies a given image into granular content classes; and a second part obfuscates the part of a given image that might be inappropriate for the target audience. Our proposed solution is a cost-efficient in terms of human labour and practical for deploying the real-time systems. Our classification network is trained with automatically labelled data using noise-robust techniques. Our automatic obfuscation algorithm uses the information obtained from the classification network and does not require additional annotation or supplementary training. This obfuscation algorithm presents a novel-use case of class-specific activation mappings for censoring regional explicit nudity in images. The classification network achieves a top-1 accuracy of 0.903 and a top-2 accuracy of 0.986. The obfuscation algorithm covers a minimum explicitly nude area of 0.68 on average.
引用
收藏
页码:4439 / 4463
页数:25
相关论文
共 50 条