Causal reasoning in typical computer vision tasks

被引:0
|
作者
ZHANG KeXuan [1 ]
SUN QiYu [1 ]
ZHAO ChaoQiang [2 ,3 ]
TANG Yang [1 ]
机构
[1] Key Laboratory of Advanced Control and Optimization for Chemical Process, Ministry of Education, East China University of Science and Technology
[2] National Key Laboratory of Air based Information Perception and Fusion
[3] Luoyang Institute of Electro Optical Equipment of Avic
基金
中国国家自然科学基金; 中央高校基本科研业务费专项资金资助;
关键词
D O I
暂无
中图分类号
TP391.41 []; TP18 [人工智能理论];
学科分类号
080203 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has revolutionized the field of artificial intelligence. Based on the statistical correlations uncovered by deep learning-based methods, computer vision tasks, such as autonomous driving and robotics, are growing rapidly. Despite being the basis of deep learning, such correlation strongly depends on the distribution of the original data and is susceptible to uncontrolled factors. Without the guidance of prior knowledge, statistical correlations alone cannot correctly reflect the essential causal relations and may even introduce spurious correlations. As a result, researchers are now trying to enhance deep learningbased methods with causal theory. Causal theory can model the intrinsic causal structure unaffected by data bias and effectively avoids spurious correlations. This paper aims to comprehensively review the existing causal methods in typical vision and visionlanguage tasks such as semantic segmentation, object detection, and image captioning. The advantages of causality and the approaches for building causal paradigms will be summarized. Future roadmaps are also proposed, including facilitating the development of causal theory and its application in other complex scenarios and systems.
引用
收藏
页码:105 / 120
页数:16
相关论文
共 50 条
  • [1] Causal reasoning in typical computer vision tasks
    Zhang, Kexuan
    Sun, Qiyu
    Zhao, Chaoqiang
    Tang, Yang
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (01) : 105 - 120
  • [2] Causal reasoning in typical computer vision tasks
    KeXuan Zhang
    QiYu Sun
    ChaoQiang Zhao
    Yang Tang
    Science China Technological Sciences, 2024, 67 : 105 - 120
  • [3] GEOMETRIC REASONING FOR COMPUTER VISION
    ORR, MJL
    FISHER, RB
    IMAGE AND VISION COMPUTING, 1987, 5 (03) : 233 - 238
  • [4] Causal Attention for Vision-Language Tasks
    Yang, Xu
    Zhang, Hanwang
    Qi, Guojun
    Cai, Jianfei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9842 - 9852
  • [5] Semantic Bottleneck for Computer Vision Tasks
    Bucher, Maxime
    Herbin, Stephane
    Jurie, Frederic
    COMPUTER VISION - ACCV 2018, PT II, 2019, 11362 : 695 - 712
  • [6] MULTILEVEL VISION BASED SPATIAL REASONING FOR ROBOTIC TASKS
    MAGEE, M
    BECKER, J
    MATHIS, D
    SKLAIR, C
    WOLFE, W
    PROCEEDINGS - 1989 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOL 1-3, 1989, : 503 - 508
  • [8] Application of Graph Structures in Computer Vision Tasks
    Andriyanov, Nikita
    MATHEMATICS, 2022, 10 (21)
  • [9] Computer Vision Onboard UAVs for Civilian Tasks
    Pascual Campoy
    Juan F. Correa
    Ivan Mondragón
    Carol Martínez
    Miguel Olivares
    Luis Mejías
    Jorge Artieda
    Journal of Intelligent and Robotic Systems, 2009, 54 : 105 - 135
  • [10] Learning to Resize Images for Computer Vision Tasks
    Talebi, Hossein
    Milanfar, Peyman
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 487 - 496