Pseudo-labeling with keyword refining for few-supervised video captioning

被引:0
|
作者
Li, Ping [1 ]
Wang, Tao [1 ]
Zhao, Xinkui [2 ]
Xu, Xianghua [1 ]
Song, Mingli [3 ]
机构
[1] School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
[2] School of Software Technology, Zhejiang University, Ningbo, China
[3] College of Computer Science, Zhejiang University, Hangzhou, China
基金
中国国家自然科学基金;
关键词
Semantics;
D O I
10.1016/j.patcog.2024.111176
中图分类号
学科分类号
摘要
Video captioning generate a sentence that describes the video content. Existing methods always require a number of captions (e.g., 10 or 20) per video to train the model, which is quite costly. In this work, we explore the possibility of using only one or very few ground-truth sentences, and introduce a new task named few-supervised video captioning. Specifically, we propose a few-supervised video captioning framework that consists of lexically constrained pseudo-labeling module and keyword-refined captioning module. Unlike the random sampling in natural language processing that may cause invalid modifications (i.e., edit words), the former module guides the model to edit words using some actions (e.g., copy, replace, insert, and delete) by a pretrained token-level classifier, and then fine-tunes candidate sentences by a pretrained language model. Meanwhile, the former employs the repetition penalized sampling to encourage the model to yield concise pseudo-labeled sentences with less repetition, and selects the most relevant sentences upon a pretrained video-text model. Moreover, to keep semantic consistency between pseudo-labeled sentences and video content, we develop the transformer-based keyword refiner with the video-keyword gated fusion strategy to emphasize more on relevant words. Extensive experiments on several benchmarks demonstrate the advantages of the proposed approach in both few-supervised and fully-supervised scenarios. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [41] One-Shot Learning with Pseudo-Labeling for Cattle Video Segmentation in Smart Livestock Farming
    Qiao, Yongliang
    Xue, Tengfei
    Kong, He
    Clark, Cameron
    Lomax, Sabrina
    Rafique, Khalid
    Sukkarieh, Salah
    [J]. ANIMALS, 2022, 12 (05):
  • [42] Class-Distribution-Aware Pseudo-Labeling for Semi-Supervised Multi-Label Learning
    Xie, Ming-Kun
    Xiao, Jia-Hao
    Liu, Hao-Zhe
    Niu, Gang
    Sugiyama, Masashi
    Huang, Sheng-Jun
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [43] SPLAL: Similarity-based pseudo-labeling with alignment loss for semi-supervised medical image classification
    Mahmood, Md Junaid
    Raj, Pranaw
    Agarwal, Divyansh
    Kumari, Suruchi
    Singh, Pravendra
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [44] A semi-supervised medical image classification method based on combined pseudo-labeling and distance metric consistency
    Boya Ke
    Huijuan Lu
    Cunqian You
    Wenjie Zhu
    Li Xie
    Yudong Yao
    [J]. Multimedia Tools and Applications, 2024, 83 : 33313 - 33331
  • [45] Semi-supervised medical image classification with adaptive threshold pseudo-labeling and unreliable sample contrastive loss
    Peng, Zhen
    Tian, Shengwei
    Yu, Long
    Zhang, Dezhi
    Wu, Weidong
    Zhou, Shaofeng
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [46] Semi-Supervised Learning for Low-light Image Restoration through Quality Assisted Pseudo-Labeling
    Malik, Sameer
    Soundararajan, Rajiv
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4094 - 4103
  • [47] A semi-supervised medical image classification method based on combined pseudo-labeling and distance metric consistency
    Ke, Boya
    Lu, Huijuan
    You, Cunqian
    Zhu, Wenjie
    Xie, Li
    Yao, Yudong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 33313 - 33331
  • [48] Class-Aware Pseudo-Labeling for Non-Random Missing Labels in Semi-Supervised Learning
    Gui, Qian
    Wu, Xinting
    Niu, Baoning
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2023, 17 (04) : 531 - 543
  • [49] AdaptMatch: Adaptive Consistency Regularization for Semi-supervised Learning with Top-k Pseudo-labeling and Contrastive Learning
    Yang, Nan
    Huang, Fan
    Yuan, Dong
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT I, 2024, 14471 : 227 - 238
  • [50] DCRP: Class-Aware Feature Diffusion Constraint and Reliable Pseudo-Labeling for Imbalanced Semi-Supervised Learning
    Guo, Xiaoyu
    Wei, Xiang
    Zhang, Shunli
    Lu, Wei
    Xing, Weiwei
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7146 - 7159