InvFlow: Involution and multi-scale interaction for unsupervised learning of optical flow

被引:0
|
作者
Xiang, Xuezhi [1 ,2 ]
Abdein, Rokia [1 ,2 ]
Lv, Ning [1 ,2 ]
El Saddik, Abdulmotaleb [3 ]
机构
[1] Harbin Engn Univ, Sch Informat & Commun Engn, Harbin 150001, Peoples R China
[2] Minist Ind & Informat Technol, Key Lab Adv Marine Commun & Informat Technol, Harbin 150001, Peoples R China
[3] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON K1N 6N5, Canada
基金
黑龙江省自然科学基金; 中国国家自然科学基金;
关键词
Unsupervised optical flow estimation; Involution; Feature interaction; Self-attention; Deformable convolution;
D O I
10.1016/j.patcog.2023.109918
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The convolution neural network is still the main tool for extracting the image features and the motion features for most of the optical flow models. The convolution neural networks cannot model the long-range dependencies, and more details are lost in deeper layers. All the deficiencies in the extracted features affect the estimated flow. Therefore, in this work, we concentrated on optimizing the convolution neural network in both the encoder and decoder parts to improve the image and motion features. To enhance the image features, we utilize the involution to provide rich features and model the long-range dependencies. In addition, we propose a Multi-Scale-Interaction module which utilizes the self-attention to make an interaction between the feature scales to avoid detail loss. Additionally, we propose a Motion-Features-Optimization block that utilizes the deformable convolution to enhance the motion features. Our model achieves the state-of-the-art performance on Sintel and KITTI 2015 benchmarks.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] MRDFlow: Unsupervised Optical Flow Estimation Network With Multi-Scale Recurrent Decoder
    Zhao, Rui
    Xiong, Ruiqin
    Ding, Ziluo
    Fan, Xiaopeng
    Zhang, Jian
    Huang, Tiejun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4639 - 4652
  • [2] Advancing unsupervised anomaly detection with normalizing flow and multi-scale ensemble learning
    Campos-Romero, Miguel
    Carranza-Garcia, Manuel
    Riquelme, Jose C.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [3] A multi-scale unsupervised learning for deformable image registration
    Shao, Shuwei
    Pei, Zhongcai
    Chen, Weihai
    Zhu, Wentao
    Wu, Xingming
    Zhang, Baochang
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (01) : 157 - 166
  • [4] A multi-scale unsupervised learning for deformable image registration
    Shuwei Shao
    Zhongcai Pei
    Weihai Chen
    Wentao Zhu
    Xingming Wu
    Baochang Zhang
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 157 - 166
  • [5] Deep Optical Flow Estimation Via Multi-Scale Correspondence Structure Learning
    Zhao, Shanshan
    Li, Xi
    Bourahla, Omar El Farouk
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3490 - 3496
  • [6] Unsupervised Learning of Depth Estimation and Camera Pose With Multi-Scale GANs
    Xu, Yufan
    Wang, Yan
    Huang, Rui
    Lei, Zeyu
    Yang, Junyao
    Li, Zijian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17039 - 17047
  • [7] Unsupervised dehazing of multi-scale residuals based on weighted contrastive learning
    Jianing Wang
    Yongsheng zhang
    Zuoyang Liu
    Signal, Image and Video Processing, 2025, 19 (6)
  • [8] Multi-scale interaction of particulate flow and the artery wall
    Halliday, I.
    Atherton, M.
    Care, C. M.
    Collins, M. W.
    Evans, D.
    Evans, P. C.
    Hose, D. R.
    Khir, A. W.
    Koenig, C. S.
    Krams, R.
    Lawford, P. V.
    Lishchuk, S. V.
    Pontrelli, G.
    Ridger, V.
    Spencer, T. J.
    Ventikos, Y.
    Walker, D. C.
    Watton, P. N.
    MEDICAL ENGINEERING & PHYSICS, 2011, 33 (07) : 840 - 848
  • [9] Gradient Consistency Based Multi-Scale Optical Flow
    Gray, James L.
    Naman, Aous T.
    Taubman, David S.
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [10] Unsupervised Learning of Multi-Frame Optical Flow with Occlusions
    Janai, Joel
    Guney, Fatma
    Ranjan, Anurag
    Black, Michael
    Geiger, Andreas
    COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 713 - 731