Towards Unified Deep Learning Model for NSFW Image and Video Captioning

被引:0
|
作者
Ko, Jong-Won [1 ]
Hwang, Dong-Hyun [1 ]
机构
[1] Enumnet Co Ltd, Res & Dev Ctr, Seoul Si, South Korea
关键词
Deep learning; CNN; RNN; NSFW image and video captioning;
D O I
10.1007/978-981-13-1328-8_8
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The deep learning model is an evolution of an artificial intelligence model called the Artificial Neural Network. And the internal layer of an artificial neural network consisting of layers is a multi-stage structure, the latest deep learning model has a larger number of internal layers, which can result in up to billions of nodes. In addition, learning models combining CNN and RNN to comment on pictures or video are currently being studied. The images or videos are input by CNN, summarized, and the results are input into RNN for printing out meaningful sentences, the so-called image and video captioning. This paper proposes unified deep learning model for NSFW image and video captioning. As noted above, traditional studies on image and video captioning have been approached via a combination of CNN and RNN models. In contrast, in this paper, the classification for safety judgement, object detection, and captioning can all be handled through one dataset definition.
引用
收藏
页码:57 / 63
页数:7
相关论文
共 50 条
  • [1] Image and Video Captioning for Apparels Using Deep Learning
    Agarwal, Govind
    Jindal, Kritika
    Chowdhury, Abishi
    Singh, Vishal K.
    Pal, Amrit
    [J]. IEEE ACCESS, 2024, 12 : 113138 - 113150
  • [2] Unified deep learning model for multitask representation and transfer learning: image classification, object detection, and image captioning
    Bayisa, Leta Yobsan
    Wang, Weidong
    Wang, Qingxian
    Ukwuoma, Chiagoziem C.
    Gutema, Hirpesa Kebede
    Endris, Ahmed
    Abu, Turi
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (10) : 4617 - 4637
  • [3] Deep learning based, a new model for video captioning
    Department of Computer Engineering, Faculty of Engineering Gazi University, Ankara, Turkey
    [J]. 1600, Science and Information Organization (11):
  • [4] Deep Learning based, a New Model for Video Captioning
    Ozer, Elif Gusta
    Karapinar, Ilteber Nur
    Busbug, Sena
    Turan, Sumeyye
    Utku, Anil
    Akcayol, M. Ali
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 514 - 519
  • [5] Deep Learning for Video Captioning: A Review
    Chen, Shaoxiang
    Yao, Ting
    Jiang, Yu-Gang
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6283 - 6290
  • [6] A Deep Structured Model for Video Captioning
    Vinodhini, V.
    Sathiyabhama, B.
    Sankar, S.
    Somula, Ramasubbareddy
    [J]. INTERNATIONAL JOURNAL OF GAMING AND COMPUTER-MEDIATED SIMULATIONS, 2020, 12 (02) : 44 - 56
  • [7] Deep Learning and Video Quality Analysis: Towards A Unified VQA
    Topiwala, P.
    Dai, W.
    Pian, J.
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIII, 2020, 11510
  • [8] VIDEO CAPTIONING BASED ON JOINT IMAGE-AUDIO DEEP LEARNING TECHNIQUES
    Wang, Chien-Yao
    Liaw, Pei-Sin
    Liang, Kai-Wen
    Wang, Jai-Ching
    Chang, Pao-Chi
    [J]. 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE-BERLIN), 2019, : 127 - 131
  • [9] Deep Learning for Military Image Captioning
    Das, Subrata
    Jain, Lalit
    Das, Amp
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2165 - 2171
  • [10] Image Captioning using Deep Learning
    Jain, Yukti Sanjay
    Dhopeshwar, Tanisha
    Chadha, Supreet Kaur
    Pagire, Vrushali
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021,