A lightweight convolutional neural network for large-scale Chinese image caption

被引:0
|
作者
Dexin Zhao
Ruixue Yang
Shutao Guo
机构
[1] Tianjin University of Technology,Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology
来源
Optoelectronics Letters | 2021年 / 17卷
关键词
A;
D O I
暂无
中图分类号
学科分类号
摘要
Image caption is a high-level task in the area of image understanding, in which most of the models adopt a convolutional neural network (CNN) to extract image features assigning a recurrent neural network (RNN) to generate sentences. Researchers tend to design complex networks with deeper layers to improve the performance of feature extraction in recent years. Increasing the size of the network could obtain features of high quality, but it is not an efficient way in terms of computational cost. A large number of parameters brought by CNN makes the research difficult to apply in human daily life. In order to reduce the information loss of the convolutional process with less cost, we propose a lightweight convolutional neural network, named as Bifurcate-CNN (B-CNN). Furthermore, recent works are devoted to generating captions in English, in this paper, we develop an image caption model that generates descriptions in Chinese. Compared with Inception-v3, the depth of our model is shallower with fewer parameters, and the computational cost is lower. Evaluated on the AI CHALLENGER dataset, we prove that our model can enhance the performance, improving BLEU-4 from 46.1 to 49.9 and CIDEr from 142.5 to 156.6 respectively.
引用
收藏
页码:361 / 366
页数:5
相关论文
共 50 条
  • [41] CfRNet: A Lightweight Convolutional Neural Network Classification Model for Rock Image
    Tao, Liuyi
    Li, Xiaochuan
    Li, Jiaqi
    Wang, Jinyi
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 1013 - 1018
  • [42] Sentiment Classification with Convolutional Neural Networks: an Experimental Study on a Large-scale Chinese Conversation Corpus
    Zhang, Lei
    Chen, Chengcai
    PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2016, : 165 - 169
  • [43] A lightweight face detector by integrating the convolutional neural network with the image pyramid
    Luo, Jiapeng
    Liu, Jiaying
    Lin, Jun
    Wang, Zhongfeng
    PATTERN RECOGNITION LETTERS, 2020, 133 (133) : 180 - 187
  • [44] Satellite cloud image segmentation based on lightweight convolutional neural network
    Li, Xi
    Chen, Shilan
    Wu, Jin
    Li, Jun
    Wang, Ting
    Tang, Junquan
    Hu, Tongyi
    Wu, Wenzhu
    PLOS ONE, 2023, 18 (02):
  • [45] GRAPH NEURAL NETWORK FOR LARGE-SCALE NETWORK LOCALIZATION
    Yan, Wenzhong
    Jin, Di
    Lin, Zhidi
    Yin, Feng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5250 - 5254
  • [46] See, caption, cluster: Large-scale image analysis using captioning and topic modeling
    Kang, Kyeongpil
    Jin, Kyohoon
    Jang, Soojin
    Choo, Jaegul
    Kim, Youngbin
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [47] Large-scale neural network for sentence processing
    Cooke, A
    Grossman, M
    DeVita, C
    Gonzalez-Atavales, J
    Moore, P
    Chen, W
    Gee, J
    Detre, J
    BRAIN AND LANGUAGE, 2006, 96 (01) : 14 - 36
  • [48] Network of Experts for Large-Scale Image Categorization
    Ahmed, Karim
    Baig, Mohammad Haris
    Torresani, Lorenzo
    COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 516 - 532
  • [49] Very Large-Scale Integration for Premature Ventricular Contraction Detection Using a Convolutional Neural Network
    Chen, Yuan-Ho
    Hua, Hsin-Tung
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (05)
  • [50] THE LARGE-SCALE WILDFIRE SPREAD PREDICTION USING A MULTI-KERNEL CONVOLUTIONAL NEURAL NETWORK
    Marjani, M.
    Mesgari, M. S.
    ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, : 483 - 488