SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model

被引：0

作者：

Zhou, Shili ^{[1
]}

He, Ruian ^{[1
]}

Tan, Weimin ^{[1
]}

Yan, Bo ^{[1
]}

机构：

[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Sch Comp Sci, Shanghai, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7 | 2024年

基金：

上海市自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Optical Flow Estimation aims to find the 2D dense motion field between two frames. Due to the limitation of model structures and training datasets, existing methods often rely too much on local clues and ignore the integrity of objects, resulting in fragmented motion estimation. Through theoretical analysis, we find the pre-trained large vision models are helpful in optical flow estimation, and we notice that the recently famous Segment Anything Model (SAM) demonstrates a strong ability to segment complete objects, which is suitable for solving the fragmentation problem. We thus propose a solution to embed the frozen SAM image encoder into FlowFormer to enhance object perception. To address the challenge of in-depth utilizing SAM in non-segmentation tasks like optical flow estimation, we propose an Optical Flow Task-Specific Adaption scheme, including a Context Fusion Module to fuse the SAM encoder with the optical flow context encoder, and a Context Adaption Module to adapt the SAM features for optical flow task with Learned Task-Specific Embedding. Our proposed SAMFlow model reaches 0.86/2.10 clean/final EPE and 3.55/12.32 EPE/F1-all on Sintel and KITTI-15 training set, surpassing Flowformer by 8.5%/9.9% and 13.2%/16.3%. Furthermore, our model achieves state-of-the-art performance on the Sintel and KITTI-15 benchmarks, ranking #1 among all two-frame methods on Sintel clean pass.

引用

页码：7695 / 7703

页数：9

共 50 条

[1] Detect Any Shadow: Segment Anything for Video Shadow Detection
Wang, Yonghui
Zhou, Wengang
Mao, Yunyao
Li, Houqiang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3782 - 3794
[2] MeSAM: Multiscale Enhanced Segment Anything Model for Optical Remote Sensing Images
Zhou, Xichuan
Liang, Fu
Chen, Lihui
Liu, Haijun
Song, Qianqian
Vivone, Gemine
Chanussot, Jocelyn
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
[3] Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping
Li, Wenwen
Hsu, Chia-Yu
Wang, Sizhe
Yang, Yezhou
Lee, Hyunho
Liljedahl, Anna
Witharana, Chandi
Yang, Yili
Rogers, Brendan M.
Arundel, Samantha T.
Jones, Matthew B.
McHenry, Kenton
Solis, Patricia
REMOTE SENSING, 2024, 16 (05)
[4] Segment anything model for medical images?
Huang, Yuhao
Yang, Xin
Liu, Lian
Zhou, Han
Chang, Ao
Zhou, Xinrui
Chen, Rusi
Yu, Junxuan
Chen, Jiongquan
Chen, Chaoyu
Liu, Sijing
Chi, Haozhe
Hu, Xindi
Yue, Kejuan
Li, Lei
Grau, Vicente
Fan, Deng-Ping
Dong, Fajin
Ni, Dong
MEDICAL IMAGE ANALYSIS, 2024, 92
[5] Matte anything: Interactive natural image matting with segment anything model
Yao, Jingfeng
Wang, Xinggang
Ye, Lang
Liu, Wenyu
IMAGE AND VISION COMPUTING, 2024, 147
[6] BubSAM: Bubble segmentation and shape reconstruction based on Segment Anything Model of bubbly flow
Xu, Haohan
Feng, Xin
Pu, Yuqi
Wang, Xiaoyue
Huang, Dingwang
Zhang, Weipeng
Duan, Xiaoxia
Chen, Jie
Yang, Chao
AICHE JOURNAL, 2024,
[7] Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization
Lai, Yingxin
Luo, Zhiming
Yu, Zitong
BIOMETRIC RECOGNITION, CCBR 2023, 2023, 14463 : 180 - 190
[8] Explain Any Concept: Segment Anything Meets Concept-Based Explanation
Sun, Ao
Ma, Pingchuan
Yuan, Yuanyuan
Wang, Shuai
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[9] Segment Any Medical Model Extended
Liu, Yihao
Zhang, Jiaming
Diaz-Pinto, Andres
Li, Haowei
Martin-Gomez, Alejandro
Kheradmand, Amir
Armand, Mehran
MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
[10] Make Segment Anything Model Perfect on Shadow Detection
Chen, Xiao-Diao
Wu, Wen
Yang, Wenya
Qin, Hongshuai
Wu, Xiantao
Mao, Xiaoyang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 13

← 1 2 3 4 5 →