Applications of graph convolutional networks in computer vision

被引：24

作者：

Cao, Pingping ^{[1
]}

Zhu, Zeqi ^{[1
]}

Wang, Ziyuan ^{[1
]}

Zhu, Yanping ^{[2
]}

Niu, Qiang ^{[1
]}

机构：

[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221006, Jiangsu, Peoples R China

[2] Missouri Univ Sci & Technol, Dept Civil Architectural & Environm Engn, 500 W 16th St, Rolla, MO 65409 USA

来源：

NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 16期

基金：

中国国家自然科学基金;

关键词：

Graph convolution network; Non-Euclidean space; Relational modeling; Computer vision;

D O I：

10.1007/s00521-022-07368-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Graph Convolutional Network (GCN) which models the potential relationship between non-Euclidean spatial data has attracted researchers' attention in deep learning in recent years. It has been widely used in different computer vision tasks by modeling the latent space, topology, semantics, and other information in Euclidean spatial data and has achieved significant success. To better understand the work principles and future GCN applications in the computer vision field, this study reviewed the basic principles of GCN, summarized the difficulties and solutions using GCN in different visual tasks, and introduced in detail the methods for constructing graphs from the Euclidean spatial data in different visual tasks. At the same time, the review divided the application of GCN in basic visual tasks into image recognition, object detection, semantic segmentation, instance segmentation and object tracking. The role and performance of GCN in basic visual tasks were summarized and compared in detail for different tasks. This review emphasizes that the application of GCN in computer vision faces three challenges: computational complexity, the paradigm of constructing graphs from the Euclidean spatial data, and the interpretability of the model. Finally, this review proposes two future trends of GCN in the vision field, namely model lightweight and fusing GCN with other models to improve the performance of the visual model and meet the higher requirements of vision tasks.

引用

页码：13387 / 13405

页数：19

共 50 条

[1] Applications of graph convolutional networks in computer vision
Pingping Cao
Zeqi Zhu
Ziyuan Wang
Yanping Zhu
Qiang Niu
[J]. Neural Computing and Applications, 2022, 34 : 13387 - 13405
[2] Convolutional Networks and Applications in Vision
LeCun, Yann
Kavukcuoglu, Koray
Farabet, Clement
[J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 253 - 256
[3] Graph convolutional network-based image matting algorithm for computer vision applications
Dong, Li
Liang, Zheng
Wang, Yue
[J]. IET IMAGE PROCESSING, 2022, 16 (10) : 2817 - 2825
[4] Graph convolutional networks in language and vision: A survey
Ren, Haotian
Lu, Wei
Xiao, Yun
Chang, Xiaojun
Wang, Xuanhong
Dong, Zhiqiang
Fang, Dingyi
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 251
[5] Convolutional Neural Networks Implementations for Computer Vision
Michalski, Pawel
Ruszczak, Bogdan
Tomaszewski, Michal
[J]. BIOMEDICAL ENGINEERING AND NEUROSCIENCE, 2018, 720 : 98 - 110
[6] A review of convolutional neural networks in computer vision
Xia Zhao
Limin Wang
Yufei Zhang
Xuming Han
Muhammet Deveci
Milan Parmar
[J]. Artificial Intelligence Review, 57
[7] A review of convolutional neural networks in computer vision
Zhao, Xia
Wang, Limin
Zhang, Yufei
Han, Xuming
Deveci, Muhammet
Parmar, Milan
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (04)
[8] Obstacle Recognition using Computer Vision and Convolutional Neural Networks for Powered Prosthetic Leg Applications
Novo-Torres, Luis
Ramirez-Paredes, Juan-Pablo
Villarreal, Dario J.
[J]. 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 3360 - 3363
[9] Efficient Planar Graph Cuts with Applications in Computer Vision
Schmidt, Frank R.
Toeppe, Eno
Cremers, Daniel
[J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 351 - 356
[10] A comprehensive review of graph convolutional networks: approaches and applications
Xu, Xinzheng
Zhao, Xiaoyang
Wei, Meng
Li, Zhongnian
[J]. ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (07): : 4185 - 4215

← 1 2 3 4 5 →