SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model

被引:0
|
作者
Zhou, Shili [1 ]
He, Ruian [1 ]
Tan, Weimin [1 ]
Yan, Bo [1 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Sch Comp Sci, Shanghai, Peoples R China
基金
上海市自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Flow Estimation aims to find the 2D dense motion field between two frames. Due to the limitation of model structures and training datasets, existing methods often rely too much on local clues and ignore the integrity of objects, resulting in fragmented motion estimation. Through theoretical analysis, we find the pre-trained large vision models are helpful in optical flow estimation, and we notice that the recently famous Segment Anything Model (SAM) demonstrates a strong ability to segment complete objects, which is suitable for solving the fragmentation problem. We thus propose a solution to embed the frozen SAM image encoder into FlowFormer to enhance object perception. To address the challenge of in-depth utilizing SAM in non-segmentation tasks like optical flow estimation, we propose an Optical Flow Task-Specific Adaption scheme, including a Context Fusion Module to fuse the SAM encoder with the optical flow context encoder, and a Context Adaption Module to adapt the SAM features for optical flow task with Learned Task-Specific Embedding. Our proposed SAMFlow model reaches 0.86/2.10 clean/final EPE and 3.55/12.32 EPE/F1-all on Sintel and KITTI-15 training set, surpassing Flowformer by 8.5%/9.9% and 13.2%/16.3%. Furthermore, our model achieves state-of-the-art performance on the Sintel and KITTI-15 benchmarks, ranking #1 among all two-frame methods on Sintel clean pass.
引用
收藏
页码:7695 / 7703
页数:9
相关论文
共 50 条
  • [1] Detect Any Shadow: Segment Anything for Video Shadow Detection
    Wang, Yonghui
    Zhou, Wengang
    Mao, Yunyao
    Li, Houqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3782 - 3794
  • [2] MeSAM: Multiscale Enhanced Segment Anything Model for Optical Remote Sensing Images
    Zhou, Xichuan
    Liang, Fu
    Chen, Lihui
    Liu, Haijun
    Song, Qianqian
    Vivone, Gemine
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [3] Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping
    Li, Wenwen
    Hsu, Chia-Yu
    Wang, Sizhe
    Yang, Yezhou
    Lee, Hyunho
    Liljedahl, Anna
    Witharana, Chandi
    Yang, Yili
    Rogers, Brendan M.
    Arundel, Samantha T.
    Jones, Matthew B.
    McHenry, Kenton
    Solis, Patricia
    REMOTE SENSING, 2024, 16 (05)
  • [4] Segment anything model for medical images?
    Huang, Yuhao
    Yang, Xin
    Liu, Lian
    Zhou, Han
    Chang, Ao
    Zhou, Xinrui
    Chen, Rusi
    Yu, Junxuan
    Chen, Jiongquan
    Chen, Chaoyu
    Liu, Sijing
    Chi, Haozhe
    Hu, Xindi
    Yue, Kejuan
    Li, Lei
    Grau, Vicente
    Fan, Deng-Ping
    Dong, Fajin
    Ni, Dong
    MEDICAL IMAGE ANALYSIS, 2024, 92
  • [5] Matte anything: Interactive natural image matting with segment anything model
    Yao, Jingfeng
    Wang, Xinggang
    Ye, Lang
    Liu, Wenyu
    IMAGE AND VISION COMPUTING, 2024, 147
  • [6] BubSAM: Bubble segmentation and shape reconstruction based on Segment Anything Model of bubbly flow
    Xu, Haohan
    Feng, Xin
    Pu, Yuqi
    Wang, Xiaoyue
    Huang, Dingwang
    Zhang, Weipeng
    Duan, Xiaoxia
    Chen, Jie
    Yang, Chao
    AICHE JOURNAL, 2024,
  • [7] Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization
    Lai, Yingxin
    Luo, Zhiming
    Yu, Zitong
    BIOMETRIC RECOGNITION, CCBR 2023, 2023, 14463 : 180 - 190
  • [8] Explain Any Concept: Segment Anything Meets Concept-Based Explanation
    Sun, Ao
    Ma, Pingchuan
    Yuan, Yuanyuan
    Wang, Shuai
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] Segment Any Medical Model Extended
    Liu, Yihao
    Zhang, Jiaming
    Diaz-Pinto, Andres
    Li, Haowei
    Martin-Gomez, Alejandro
    Kheradmand, Amir
    Armand, Mehran
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [10] Make Segment Anything Model Perfect on Shadow Detection
    Chen, Xiao-Diao
    Wu, Wen
    Yang, Wenya
    Qin, Hongshuai
    Wu, Xiantao
    Mao, Xiaoyang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 13