A lightweight convolutional neural network for large-scale Chinese image caption

被引:0
|
作者
Dexin Zhao
Ruixue Yang
Shutao Guo
机构
[1] Tianjin University of Technology,Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology
来源
Optoelectronics Letters | 2021年 / 17卷
关键词
A;
D O I
暂无
中图分类号
学科分类号
摘要
Image caption is a high-level task in the area of image understanding, in which most of the models adopt a convolutional neural network (CNN) to extract image features assigning a recurrent neural network (RNN) to generate sentences. Researchers tend to design complex networks with deeper layers to improve the performance of feature extraction in recent years. Increasing the size of the network could obtain features of high quality, but it is not an efficient way in terms of computational cost. A large number of parameters brought by CNN makes the research difficult to apply in human daily life. In order to reduce the information loss of the convolutional process with less cost, we propose a lightweight convolutional neural network, named as Bifurcate-CNN (B-CNN). Furthermore, recent works are devoted to generating captions in English, in this paper, we develop an image caption model that generates descriptions in Chinese. Compared with Inception-v3, the depth of our model is shallower with fewer parameters, and the computational cost is lower. Evaluated on the AI CHALLENGER dataset, we prove that our model can enhance the performance, improving BLEU-4 from 46.1 to 49.9 and CIDEr from 142.5 to 156.6 respectively.
引用
收藏
页码:361 / 366
页数:5
相关论文
共 50 条
  • [31] Large-Scale Whale Call Classification Using Deep Convolutional Neural Network Architectures
    Wang, Dezhi
    Zhang, Lilun
    Lu, Zengquan
    Xu, Kele
    2018 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2018,
  • [32] Large-Scale Point Cloud Segmentation by Learnable Dynamic Grouping Convolutional Neural Network
    Yue, Kang
    Jun, Yang
    Computer Engineering and Applications, 60 (10): : 217 - 226
  • [33] Sentiment Analysis of Chinese Paintings Based on Lightweight Convolutional Neural Network
    Bian, Jianying
    Shen, Xiaoying
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [34] FontRNN: Generating Large-scale Chinese Fonts via Recurrent Neural Network
    Tang, Shusen
    Xia, Zeqing
    Lian, Zhouhui
    Tang, Yingmin
    Xiao, Jianguo
    COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 567 - 577
  • [35] LiteCCLKNet: A lightweight criss-cross large kernel convolutional neural network for hyperspectral image classification
    Zhong, Chengcheng
    Gong, Na
    Zhang, Zitong
    Jiang, Yanan
    Zhang, Kai
    IET COMPUTER VISION, 2023, 17 (07) : 763 - 776
  • [36] Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation Network Speed Prediction
    Ma, Xiaolei
    Dai, Zhuang
    He, Zhengbing
    Ma, Jihui
    Wang, Yong
    Wang, Yunpeng
    SENSORS, 2017, 17 (04)
  • [37] Image recognition based on lightweight convolutional neural network: Recent advances
    Liu, Ying
    Xue, Jiahao
    Li, Daxiang
    Zhang, Weidong
    Chiew, Tuan Kiang
    Xu, Zhijie
    IMAGE AND VISION COMPUTING, 2024, 146
  • [38] Lightweight Parallel Octave Convolutional Neural Network for Hyperspectral Image Classification
    Li, Dan
    Wu, Hanjie
    Wang, Yujian
    Li, Xiaojun
    Kong, Fanqiang
    Wang, Qiang
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2023, 89 (04): : 233 - 243
  • [39] Lightweight convolutional neural network for bitemporal SAR image change detection
    Wang, Rongfang
    Ding, Fan
    Jiao, Licheng
    Chen, Jia-Wei
    Liu, Bo
    Ma, Wenping
    Wang, Mi
    JOURNAL OF APPLIED REMOTE SENSING, 2020, 14 (03)
  • [40] Lightweight Attention Convolutional Neural Network for Retinal Vessel Image Segmentation
    Li, Xiang
    Jiang, Yuchen
    Li, Minglei
    Yin, Shen
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (03) : 1958 - 1967