Convolutional Recurrent Neural Networks for Better Image Understanding

被引:0
|
作者
Vallet, Alexis [1 ]
Sakamoto, Hiroyasu [2 ]
机构
[1] Kyushu Univ, Grad Sch Design, Fukuoka 812, Japan
[2] Kyushu Univ, Fac Design, Fukuoka 812, Japan
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Although deep convolutional neural networks have brought basic computer vision tasks to unprecedented accuracy, the best models still struggle to produce higher level image understanding. Indeed, current models for tasks such as visual question answering, often based on recurrent neural networks, have difficulties surpassing baseline methods. We suspect that this is due in part to spatial information in the image not being properly leveraged. We attempt to solve these difficulties by introducing a recurrent unit able to keep and process spatial information throughout the network. On a simple task, we show that our method is significantly more accurate than alternative baselines which discard spatial information. We also demonstrate that higher resolution input performs better than lower resolution input to a surprising degree, even when the input features are less discriminative. Notably, we show that our approach based on higher resolution input is better able to detect details of the images such as the precise number of objects, and the presence of smaller objects, while being less sensitive to biases in the label distribution of the training set.
引用
收藏
页码:675 / 681
页数:7
相关论文
共 50 条
  • [1] Convolutional and Recurrent Neural Networks for Face Image Analysis
    Yuksel, Kivanc
    Skarbek, Wladyslaw
    [J]. FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2019, 44 (03) : 331 - 347
  • [2] Convolutional neural networks in medical image understanding: a survey
    D. R. Sarvamangala
    Raghavendra V. Kulkarni
    [J]. Evolutionary Intelligence, 2022, 15 : 1 - 22
  • [3] Convolutional neural networks in medical image understanding: a survey
    Sarvamangala, D. R.
    Kulkarni, Raghavendra V.
    [J]. EVOLUTIONARY INTELLIGENCE, 2022, 15 (01) : 1 - 22
  • [4] Image Captioning using Convolutional Neural Networks and Recurrent Neural Network
    Calvin, Rachel
    Suresh, Shravya
    [J]. 2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [5] Convolutional Recurrent Neural Networks for Dynamic MR Image Reconstruction
    Qin, Chen
    Schlemper, Jo
    Caballero, Jose
    Price, Anthony N.
    Hajnal, Joseph V.
    Rueckert, Daniel
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (01) : 280 - 290
  • [6] Recurrent Convolutional Neural Networks: A Better Model of Biological Object Recognition
    Spoerer, Courtney J.
    McClure, Patrick
    Kriegeskorte, Nikolaus
    [J]. FRONTIERS IN PSYCHOLOGY, 2017, 8
  • [7] Convolutional Nonlinear Differential Recurrent Neural Networks for Crowd Scene Understanding
    Zhuang, Naifan
    The Duc Kieu
    Ye, Jun
    Hua, Kien A.
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2018, 12 (04) : 481 - 500
  • [8] Convolutional Recurrent Neural Networks: Learning Spatial Dependencies for Image Representation
    Zuo, Zhen
    Shuai, Bing
    Wang, Gang
    Liu, Xiao
    Wang, Xingxing
    Wang, Bing
    Chen, Yushi
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [9] DIRECTION FINDING USING CONVOLUTIONAL NEURAL NETWORKS and CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Uckun, Fehmi Ayberk
    Ozer, Hakan
    Nurbas, Ekin
    Onat, Emrah
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [10] Convolutional Neural Networks with Recurrent Neural Filters
    Yang, Yi
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 912 - 917