Applications of graph convolutional networks in computer vision

被引:24
|
作者
Cao, Pingping [1 ]
Zhu, Zeqi [1 ]
Wang, Ziyuan [1 ]
Zhu, Yanping [2 ]
Niu, Qiang [1 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221006, Jiangsu, Peoples R China
[2] Missouri Univ Sci & Technol, Dept Civil Architectural & Environm Engn, 500 W 16th St, Rolla, MO 65409 USA
来源
NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 16期
基金
中国国家自然科学基金;
关键词
Graph convolution network; Non-Euclidean space; Relational modeling; Computer vision;
D O I
10.1007/s00521-022-07368-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Convolutional Network (GCN) which models the potential relationship between non-Euclidean spatial data has attracted researchers' attention in deep learning in recent years. It has been widely used in different computer vision tasks by modeling the latent space, topology, semantics, and other information in Euclidean spatial data and has achieved significant success. To better understand the work principles and future GCN applications in the computer vision field, this study reviewed the basic principles of GCN, summarized the difficulties and solutions using GCN in different visual tasks, and introduced in detail the methods for constructing graphs from the Euclidean spatial data in different visual tasks. At the same time, the review divided the application of GCN in basic visual tasks into image recognition, object detection, semantic segmentation, instance segmentation and object tracking. The role and performance of GCN in basic visual tasks were summarized and compared in detail for different tasks. This review emphasizes that the application of GCN in computer vision faces three challenges: computational complexity, the paradigm of constructing graphs from the Euclidean spatial data, and the interpretability of the model. Finally, this review proposes two future trends of GCN in the vision field, namely model lightweight and fusing GCN with other models to improve the performance of the visual model and meet the higher requirements of vision tasks.
引用
收藏
页码:13387 / 13405
页数:19
相关论文
共 50 条
  • [1] Applications of graph convolutional networks in computer vision
    Pingping Cao
    Zeqi Zhu
    Ziyuan Wang
    Yanping Zhu
    Qiang Niu
    [J]. Neural Computing and Applications, 2022, 34 : 13387 - 13405
  • [2] Convolutional Networks and Applications in Vision
    LeCun, Yann
    Kavukcuoglu, Koray
    Farabet, Clement
    [J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 253 - 256
  • [3] Graph convolutional network-based image matting algorithm for computer vision applications
    Dong, Li
    Liang, Zheng
    Wang, Yue
    [J]. IET IMAGE PROCESSING, 2022, 16 (10) : 2817 - 2825
  • [4] Graph convolutional networks in language and vision: A survey
    Ren, Haotian
    Lu, Wei
    Xiao, Yun
    Chang, Xiaojun
    Wang, Xuanhong
    Dong, Zhiqiang
    Fang, Dingyi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [5] Convolutional Neural Networks Implementations for Computer Vision
    Michalski, Pawel
    Ruszczak, Bogdan
    Tomaszewski, Michal
    [J]. BIOMEDICAL ENGINEERING AND NEUROSCIENCE, 2018, 720 : 98 - 110
  • [6] A review of convolutional neural networks in computer vision
    Xia Zhao
    Limin Wang
    Yufei Zhang
    Xuming Han
    Muhammet Deveci
    Milan Parmar
    [J]. Artificial Intelligence Review, 57
  • [7] A review of convolutional neural networks in computer vision
    Zhao, Xia
    Wang, Limin
    Zhang, Yufei
    Han, Xuming
    Deveci, Muhammet
    Parmar, Milan
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (04)
  • [8] Obstacle Recognition using Computer Vision and Convolutional Neural Networks for Powered Prosthetic Leg Applications
    Novo-Torres, Luis
    Ramirez-Paredes, Juan-Pablo
    Villarreal, Dario J.
    [J]. 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 3360 - 3363
  • [9] Efficient Planar Graph Cuts with Applications in Computer Vision
    Schmidt, Frank R.
    Toeppe, Eno
    Cremers, Daniel
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 351 - 356
  • [10] A comprehensive review of graph convolutional networks: approaches and applications
    Xu, Xinzheng
    Zhao, Xiaoyang
    Wei, Meng
    Li, Zhongnian
    [J]. ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (07): : 4185 - 4215