Causal reasoning in typical computer vision tasks

被引:0
|
作者
ZHANG KeXuan [1 ]
SUN QiYu [1 ]
ZHAO ChaoQiang [2 ,3 ]
TANG Yang [1 ]
机构
[1] Key Laboratory of Advanced Control and Optimization for Chemical Process, Ministry of Education, East China University of Science and Technology
[2] National Key Laboratory of Air based Information Perception and Fusion
[3] Luoyang Institute of Electro Optical Equipment of Avic
基金
中央高校基本科研业务费专项资金资助; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP391.41 []; TP18 [人工智能理论];
学科分类号
080203 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has revolutionized the field of artificial intelligence. Based on the statistical correlations uncovered by deep learning-based methods, computer vision tasks, such as autonomous driving and robotics, are growing rapidly. Despite being the basis of deep learning, such correlation strongly depends on the distribution of the original data and is susceptible to uncontrolled factors. Without the guidance of prior knowledge, statistical correlations alone cannot correctly reflect the essential causal relations and may even introduce spurious correlations. As a result, researchers are now trying to enhance deep learningbased methods with causal theory. Causal theory can model the intrinsic causal structure unaffected by data bias and effectively avoids spurious correlations. This paper aims to comprehensively review the existing causal methods in typical vision and visionlanguage tasks such as semantic segmentation, object detection, and image captioning. The advantages of causality and the approaches for building causal paradigms will be summarized. Future roadmaps are also proposed, including facilitating the development of causal theory and its application in other complex scenarios and systems.
引用
收藏
页码:105 / 120
页数:16
相关论文
共 50 条
  • [31] Harvesting weakly-tagged images for computer vision tasks
    Shen, Yi
    Yang, Chunlei
    Gao, Yuli
    Fan, Jianping
    IMAGING AND PRINTING IN A WEB 2.0 WORLD; AND MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS IV, 2010, 7540
  • [32] A survey of public datasets for computer vision tasks in precision agriculture
    Lu, Yuzhen
    Young, Sierra
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 178
  • [33] Computer Vision Tasks for Ambient Intelligence in Children's Health
    Germanese, Danila
    Colantonio, Sara
    Del Coco, Marco
    Carcagni, Pierluigi
    Leo, Marco
    INFORMATION, 2023, 14 (10)
  • [34] NOVA: Rendering Virtual Worlds with Humans for Computer Vision Tasks
    Kerim, Abdulrahman
    Aslan, Cem
    Celikcan, Ufuk
    Erdem, Erkut
    Erdem, Aykut
    COMPUTER GRAPHICS FORUM, 2021, 40 (06) : 258 - 272
  • [35] Neural Architecture Search for Dense Prediction Tasks in Computer Vision
    Mohan, Rohit
    Elsken, Thomas
    Zela, Arber
    Metzen, Jan Hendrik
    Staffler, Benedikt
    Brox, Thomas
    Valada, Abhinav
    Hutter, Frank
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (07) : 1784 - 1807
  • [36] How Close Are Other Computer Vision Tasks to Deepfake Detection?
    Nguyen, Huy H.
    Yamagishi, Junichi
    Echizen, Isao
    2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
  • [37] Hyneter:Hybrid Network Transformer for Multiple Computer Vision Tasks
    Chen, Dong
    Miao, Duoqian
    Zhao, Xuerong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (06) : 8773 - 8785
  • [38] Compact and lightweight panoramic annular lens for computer vision tasks
    Gao, Shaohua
    Sun, Lei
    Jiang, Qi
    Shi, Hao
    Wang, Jia
    Wang, Kaiwei
    Bai, Jian
    OPTICS EXPRESS, 2022, 30 (17) : 29940 - 29956
  • [39] In-and-Out: a data augmentation technique for computer vision tasks
    Li, Chenghao
    Zhang, Jing
    Hu, Li
    Zhao, Hao
    Zhu, Huilong
    Shan, Maomao
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
  • [40] Geological reservoir characterization tasks based on computer vision techniques
    Bomfim, Leticia da Silva
    Soares, Marcus Vinicius Theodoro
    Vidal, Alexandre Campane
    Pedrini, Helio
    MARINE AND PETROLEUM GEOLOGY, 2025, 173