A lightweight convolutional neural network for large-scale Chinese image caption

被引:0
|
作者
Dexin Zhao
Ruixue Yang
Shutao Guo
机构
[1] Tianjin University of Technology,Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology
来源
Optoelectronics Letters | 2021年 / 17卷
关键词
A;
D O I
暂无
中图分类号
学科分类号
摘要
Image caption is a high-level task in the area of image understanding, in which most of the models adopt a convolutional neural network (CNN) to extract image features assigning a recurrent neural network (RNN) to generate sentences. Researchers tend to design complex networks with deeper layers to improve the performance of feature extraction in recent years. Increasing the size of the network could obtain features of high quality, but it is not an efficient way in terms of computational cost. A large number of parameters brought by CNN makes the research difficult to apply in human daily life. In order to reduce the information loss of the convolutional process with less cost, we propose a lightweight convolutional neural network, named as Bifurcate-CNN (B-CNN). Furthermore, recent works are devoted to generating captions in English, in this paper, we develop an image caption model that generates descriptions in Chinese. Compared with Inception-v3, the depth of our model is shallower with fewer parameters, and the computational cost is lower. Evaluated on the AI CHALLENGER dataset, we prove that our model can enhance the performance, improving BLEU-4 from 46.1 to 49.9 and CIDEr from 142.5 to 156.6 respectively.
引用
收藏
页码:361 / 366
页数:5
相关论文
共 50 条
  • [1] A lightweight convolutional neural network for large-scale Chinese image caption
    赵德新
    杨瑞雪
    郭淑涛
    OptoelectronicsLetters, 2021, 17 (06) : 361 - 366
  • [2] A lightweight convolutional neural network for large-scale Chinese image caption
    Zhao, Dexin
    Yang, Ruixue
    Guo, Shutao
    OPTOELECTRONICS LETTERS, 2021, 17 (06) : 361 - 366
  • [3] Efficient Inference of Large-Scale and Lightweight Convolutional Neural Networks on FPGA
    Wu, Xiao
    Ma, Yufei
    Wang, Zhongfeng
    2020 IEEE 33RD INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC), 2020, : 168 - 173
  • [4] Large-Scale Hierarchical Medical Image Retrieval Based on a Multilevel Convolutional Neural Network
    Lo, Chung-Ming
    Hsieh, Cheng-Yeh
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [5] UNSUPERVISED CONVOLUTIONAL NEURAL NETWORKS FOR LARGE-SCALE IMAGE CLUSTERING
    Hsu, Chih-Chung
    Lin, Chia-Wen
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 390 - 394
  • [6] Optimization of deep convolutional neural network for large scale image retrieval
    Bai, Cong
    Huang, Ling
    Pan, Xiang
    Zheng, Jianwei
    Chen, Shengyong
    NEUROCOMPUTING, 2018, 303 : 60 - 67
  • [7] Error-Driven Incremental Learning in Deep Convolutional Neural Network for Large-Scale Image Classification
    Xiao, Tianjun
    Zhang, Jiaxing
    Yang, Kuiyuan
    Peng, Yuxin
    Zhang, Zheng
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 177 - 186
  • [8] Training Convolutional Neural Network for Sketch Recognition on Large-Scale Dataset
    Zhou, Wen
    Jia, Jinyuan
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (01) : 82 - 89
  • [9] Large scale automatic image annotation based on convolutional neural network
    Wang, Ronggui
    Xie, Yunfei
    Yang, Juan
    Xue, Lixia
    Hu, Min
    Zhang, Qingyang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 49 : 213 - 224
  • [10] DGCNN: A convolutional neural network over large-scale labeled graphs
    Anh Viet Phan
    Minh Le Nguyen
    Yen Lam Hoang Nguyen
    Lam Thu Bui
    NEURAL NETWORKS, 2018, 108 : 533 - 543