SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model

被引:0
|
作者
Zhou, Shili [1 ]
He, Ruian [1 ]
Tan, Weimin [1 ]
Yan, Bo [1 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Sch Comp Sci, Shanghai, Peoples R China
基金
上海市自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Flow Estimation aims to find the 2D dense motion field between two frames. Due to the limitation of model structures and training datasets, existing methods often rely too much on local clues and ignore the integrity of objects, resulting in fragmented motion estimation. Through theoretical analysis, we find the pre-trained large vision models are helpful in optical flow estimation, and we notice that the recently famous Segment Anything Model (SAM) demonstrates a strong ability to segment complete objects, which is suitable for solving the fragmentation problem. We thus propose a solution to embed the frozen SAM image encoder into FlowFormer to enhance object perception. To address the challenge of in-depth utilizing SAM in non-segmentation tasks like optical flow estimation, we propose an Optical Flow Task-Specific Adaption scheme, including a Context Fusion Module to fuse the SAM encoder with the optical flow context encoder, and a Context Adaption Module to adapt the SAM features for optical flow task with Learned Task-Specific Embedding. Our proposed SAMFlow model reaches 0.86/2.10 clean/final EPE and 3.55/12.32 EPE/F1-all on Sintel and KITTI-15 training set, surpassing Flowformer by 8.5%/9.9% and 13.2%/16.3%. Furthermore, our model achieves state-of-the-art performance on the Sintel and KITTI-15 benchmarks, ranking #1 among all two-frame methods on Sintel clean pass.
引用
收藏
页码:7695 / 7703
页数:9
相关论文
共 50 条
  • [31] PESAM: Privacy-Enhanced Segment Anything Model for Medical Image Segmentation
    Cai, Jiuyun
    Niu, Ke
    Pan, Yijie
    Tai, Wenjuan
    Han, Jiacheng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14863 : 94 - 105
  • [32] COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detection
    Xiaoqin ZHANG
    Zhenni YU
    Li ZHAO
    DengPing FAN
    Guobao XIAO
    Science China(Information Sciences), 2025, 68 (01) : 189 - 203
  • [34] Labeling Construction, Renovation, and Demolition Waste through Segment Anything Model (SAM)
    Panizza, Rafaela Orenga
    Allam, Amr S.
    Kasliwal, Aparimit
    Nik-Bakht, Mazdak
    CONSTRUCTION RESEARCH CONGRESS 2024: ADVANCED TECHNOLOGIES, AUTOMATION, AND COMPUTER APPLICATIONS IN CONSTRUCTION, 2024, : 279 - 288
  • [35] SMALNet: Segment Anything Model Aided Lightweight Network for Infrared Image Segmentation
    Ding, Kun
    Xiang, Shiming
    Pan, Chunhong
    INFRARED PHYSICS & TECHNOLOGY, 2024, 142
  • [36] LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model
    Cao, Yuxin
    Li, Jinghao
    Xiao, Xi
    Wang, Derui
    Xue, Minhui
    Ge, Hao
    Liu, Wei
    Hu, Guangwu
    PROCEEDINGS 45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS, SPW 2024, 2024, : 48 - 56
  • [37] Intraoperative Stereovision Cortical Surface Segmentation Using Fast Segment Anything Model
    Li, Chengpei
    Fan, Xiaoyao
    Duke, Ryan
    Chen, Kristen
    Evans, Linton T.
    Paulsen, Keith
    IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, MEDICAL IMAGING 2024, 2024, 12928
  • [38] A Novel Universal Image Forensics Localization Model Based on Image Noise and Segment Anything Model
    Su, Yang
    Tan, Shunquan
    Huang, Jiwu
    PROCEEDINGS OF THE 2024 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2024, 2024, : 149 - 158
  • [39] Adapting Segment Anything Model for Change Detection in VHR Remote Sensing Images
    Ding, Lei
    Zhu, Kun
    Peng, Daifeng
    Tang, Hao
    Yang, Kuiwu
    Bruzzone, Lorenzo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 11