Video object segmentation based on motion-aware ROI prediction and adaptive reference updating

被引:7
|
作者
Fu, Lihua [1 ]
Zhao, Yu [1 ,2 ]
Sun, Xiaowei [1 ]
Huang, Jialiang [1 ]
Wang, Dan [1 ]
Ding, Yu [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
关键词
Video object segmentation; Region of interest prediction; Adaptive reference updating; Siamese network;
D O I
10.1016/j.eswa.2020.114153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video object segmentation (VOS) is a research hotspot in the field of computer vision. Traditional video object segmentation methods based on deep learning have some problems such as difficulty in adapting to the change of object appearance and low segmentation speed. In this manuscript, we propose a robust VOS method based on motion-aware region of interest (ROI) prediction and adaptive reference updating. Firstly, based on the historical movement trajectory of target region to perceive motion trend dynamically, we predict the motion-aware ROI of target object in the current frame and use it as the input of segmentation network. Then, in order to adapt to the appearance changes of target in the video, the adaptive updating strategy of reference is given to dynamically update the reference frame during the segmentation process. Finally, VOS Siamese network is designed for fast segmentation. Experiments on three public benchmark datasets, DAVIS-2016 and DAVIS-2017, show that the proposed method performs better than the state-of-the-art approaches.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Enlarged Motion-Aware and Frequency-Aware Network for Compressed Video Artifact Reduction
    Liu, Wang
    Gao, Wei
    Li, Ge
    Ma, Siwei
    Zhao, Tiesong
    Yuan, Hui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 10339 - 10352
  • [32] MVFI-Net: Motion-Aware Video Frame Interpolation Network
    Lin, Xuhu
    Zhao, Lili
    Liu, Xi
    Chen, Jianwen
    COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 340 - 356
  • [33] Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing
    Lu, Ruiying
    Cheng, Ziheng
    Chen, Bo
    Yuan, Xin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 7850 - 7866
  • [34] Delving into Motion-Aware Matching for Monocular 3D Object Tracking
    Huang, Kuan-Chih
    Yang, Ming-Hsuan
    Tsai, Yi-Hsuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6886 - 6895
  • [35] Saliency-Aware Video Object Segmentation
    Wang, Wenguan
    Shen, Jianbing
    Yang, Ruigang
    Porikli, Fatih
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (01) : 20 - 33
  • [36] Automatic Video Object Segmentation Based on Visual and Motion Saliency
    Peng, Qinmu
    Cheung, Yiu-Ming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) : 3083 - 3094
  • [37] Video object segmentation based on global motion estimation/compensation
    Hsu, CT
    Tsai, YS
    Hsieh, MH
    Chien, YN
    ICCE: 2001 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2001, : 168 - 169
  • [38] MoonShot: Towards Controllable Video Generation and Editing with Motion-Aware Multimodal Conditions
    Zhang, David Junhao
    Li, Dongxu
    Le, Hung
    Shou, Mike Zheng
    Xiong, Caiming
    Sahoo, Doyen
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [39] MUNet: Motion uncertainty-aware semi-supervised video object segmentation
    Sun, Jiadai
    Mao, Yuxin
    Dai, Yuchao
    Zhong, Yiran
    Wang, Jianyuan
    PATTERN RECOGNITION, 2023, 138
  • [40] Flow Adaptive Video Object Segmentation
    Lin, Fanqing
    Chou, Yao
    Martinez, Tony
    IMAGE AND VISION COMPUTING, 2020, 94