Progressive self-supervised learning: A pre-training method for crowd counting

被引：0

作者：

Gu, Yao ^{[1
]}

Zheng, Zhe ^{[1
]}

Wu, Yingna ^{[1
]}

Xie, Guangping ^{[1
]}

Ni, Na ^{[1
]}

机构：

[1] ShanghaiTech Univ, Ctr Adapt Syst Engn, Shanghai 201210, Peoples R China

来源：

PATTERN RECOGNITION LETTERS | 2025年 / 188卷

关键词：

Crowd counting; Self-supervised learning; Dataset construction; NETWORK;

D O I：

10.1016/j.patrec.2024.12.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Crowd counting technologies possess substantial social significance, and deep learning methods are increasingly seen as potent tools for advancing this field. Traditionally, many approaches have sought to enhance model performance by transferring knowledge from ImageNet, utilizing its classification weights to initialize models. However, the application of these pre-training weights is suboptimal for crowd counting, which involves dense prediction significantly different from image classification. To address these limitations, we introduce progressive self-supervised learning approach, designed to generate more suitable pre-training weights from a large collection of density-related images. We gathered 173k images using custom-designed prompts and implemented a two-stage learning process to refine the feature representations of image patches with similar densities. In the first stage, mutual information between overlapping patches within the same image is maximized. Subsequently, a combination of global and local losses is evaluated to enhance feature similarity, with the latter assessing patches from different images of comparable densities. Our innovative pre-training approach demonstrated substantial improvements, reducing the Mean Absolute Error (MAE) by 7.5%, 17.6%, and 28.7% on the ShanghaiTech Part A & Part Band UCF_QNRF datasets respectively. Furthermore, when these pre-training weights were used to initialize existing models, such as CSRNet for density map regression and DM-Count for point supervision, a significant enhancement in performance was observed.

引用

页码：148 / 154

页数：7

共 50 条

[41] Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training
Huang, Sung-Feng
Chuang, Shun-Po
Liu, Da-Rong
Chen, Yi-Chen
Yang, Gene-Ping
Lee, Hung-yi
INTERSPEECH 2021, 2021, : 3056 - 3060
[42] Self-Supervised Pre-training for Protein Embeddings Using Tertiary Structures
Guo, Yuzhi
Wu, Jiaxiang
Ma, Hehuan
Huang, Junzhou
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6801 - 6809
[43] SslTransT: Self-supervised pre-training visual object tracking with Transformers
Cai, Yannan
Tan, Ke
Wei, Zhenzhong
OPTICS COMMUNICATIONS, 2024, 557
[44] GUIDED CONTRASTIVE SELF-SUPERVISED PRE-TRAINING FOR AUTOMATIC SPEECH RECOGNITION
Khare, Aparna
Wu, Minhua
Bhati, Saurabhchand
Droppo, Jasha
Maas, Roland
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 174 - 181
[45] Self-supervised Heterogeneous Graph Pre-training Based on Structural Clustering
Yang, Yaming
Guan, Ziyu
Wang, Zhe
Zhao, Wei
Xu, Cai
Lu, Weigang
Huang, Jianbin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[46] Masked Autoencoder for Self-Supervised Pre-training on Lidar Point Clouds
Hess, Georg
Jaxing, Johan
Svensson, Elias
Hagerman, David
Petersson, Christoffer
Svensson, Lennart
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 350 - 359
[47] DenseCL: A simple framework for self-supervised dense visual pre-training
Wang, Xinlong
Zhang, Rufeng
Shen, Chunhua
Kong, Tao
VISUAL INFORMATICS, 2023, 7 (01) : 30 - 40
[48] Feature-Suppressed Contrast for Self-Supervised Food Pre-training
Liu, Xinda
Zhu, Yaohui
Liu, Linhu
Tian, Jiang
Wang, Lili
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4359 - 4367
[49] Self-supervised Pre-training with Acoustic Configurations for Replay Spoofing Detection
Shim, Hye-jin
Heo, Hee-Soo
Jung, Jee-weon
Yu, Ha-Jin
INTERSPEECH 2020, 2020, : 1091 - 1095
[50] PreTraM: Self-supervised Pre-training via Connecting Trajectory and Map
Xu, Chenfeng
Li, Tian
Tang, Chen
Sun, Lingfeng
Keutzer, Kurt
Tomizuka, Masayoshi
Fathi, Alireza
Zhan, Wei
COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 34 - 50

← 1 2 3 4 5 →