Progressive self-supervised learning: A pre-training method for crowd counting

被引：0

作者：

Gu, Yao ^{[1
]}

Zheng, Zhe ^{[1
]}

Wu, Yingna ^{[1
]}

Xie, Guangping ^{[1
]}

Ni, Na ^{[1
]}

机构：

[1] ShanghaiTech Univ, Ctr Adapt Syst Engn, Shanghai 201210, Peoples R China

来源：

PATTERN RECOGNITION LETTERS | 2025年 / 188卷

关键词：

Crowd counting; Self-supervised learning; Dataset construction; NETWORK;

D O I：

10.1016/j.patrec.2024.12.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Crowd counting technologies possess substantial social significance, and deep learning methods are increasingly seen as potent tools for advancing this field. Traditionally, many approaches have sought to enhance model performance by transferring knowledge from ImageNet, utilizing its classification weights to initialize models. However, the application of these pre-training weights is suboptimal for crowd counting, which involves dense prediction significantly different from image classification. To address these limitations, we introduce progressive self-supervised learning approach, designed to generate more suitable pre-training weights from a large collection of density-related images. We gathered 173k images using custom-designed prompts and implemented a two-stage learning process to refine the feature representations of image patches with similar densities. In the first stage, mutual information between overlapping patches within the same image is maximized. Subsequently, a combination of global and local losses is evaluated to enhance feature similarity, with the latter assessing patches from different images of comparable densities. Our innovative pre-training approach demonstrated substantial improvements, reducing the Mean Absolute Error (MAE) by 7.5%, 17.6%, and 28.7% on the ShanghaiTech Part A & Part Band UCF_QNRF datasets respectively. Furthermore, when these pre-training weights were used to initialize existing models, such as CSRNet for density map regression and DM-Count for point supervision, a significant enhancement in performance was observed.

引用

页码：148 / 154

页数：7

共 50 条

[21] Self-supervised pre-training on industrial time-series
Biggio, Luca
Kastanis, Iason
2021 8TH SWISS CONFERENCE ON DATA SCIENCE, SDS, 2021, : 56 - 57
[22] DiT: Self-supervised Pre-training for Document Image Transformer
Li, Junlong
Xu, Yiheng
Lv, Tengchao
Cui, Lei
Zhang, Cha
Wei, Furu
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3530 - 3539
[23] Masked Feature Prediction for Self-Supervised Visual Pre-Training
Wei, Chen
Fan, Haoqi
Xie, Saining
Wu, Chao-Yuan
Yuille, Alan
Feichtenhofer, Christoph
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14648 - 14658
[24] CDS: Cross-Domain Self-supervised Pre-training
Kim, Donghyun
Saito, Kuniaki
Oh, Tae-Hyun
Plummer, Bryan A.
Sclaroff, Stan
Saenko, Kate
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9103 - 9112
[25] FALL DETECTION USING SELF-SUPERVISED PRE-TRAINING MODEL
Yhdego, Haben
Audette, Michel
Paolini, Christopher
PROCEEDINGS OF THE 2022 ANNUAL MODELING AND SIMULATION CONFERENCE (ANNSIM'22), 2022, : 361 - 371
[26] MEASURING THE IMPACT OF DOMAIN FACTORS IN SELF-SUPERVISED PRE-TRAINING
Sanabria, Ramon
Wei-Ning, Hsu
Alexei, Baevski
Auli, Michael
2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
[27] Correlational Image Modeling for Self-Supervised Visual Pre-Training
Li, Wei
Xie, Jiahao
Loy, Chen Change
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15105 - 15115
[28] Contrastive Self-Supervised Pre-Training for Video Quality Assessment
Chen, Pengfei
Li, Leida
Wu, Jinjian
Dong, Weisheng
Shi, Guangming
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 458 - 471
[29] AN ADAPTER BASED PRE-TRAINING FOR EFFICIENT AND SCALABLE SELF-SUPERVISED SPEECH REPRESENTATION LEARNING
Kessler, Samuel
Thomas, Bethan
Karout, Salah
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3179 - 3183
[30] Self-supervised graph neural network with pre-training generative learning for recommendation systems
Min, Xin
Li, Wei
Yang, Jinzhao
Xie, Weidong
Zhao, Dazhe
SCIENTIFIC REPORTS, 2022, 12 (01)

← 1 2 3 4 5 →