Event Camera Data Pre-training

被引：0

作者：

Yang, Yan ^{[1
]}

Pan, Liyuan ^{[2
,3
]}

Liu, Liu ^{[4
]}

机构：

[1] Australian Natl Univ, BDSI, Canberra, ACT, Australia

[2] BITSZ, Beijing, Peoples R China

[3] BIT, Sch CSAT, Beijing, Peoples R China

[4] Huawei, Cyberverse Dept, Shenzhen, Peoples R China

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICCV51070.2023.00982

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a pre-trained neural network for handling event camera data. Our model is a self-supervised learning framework, and uses paired event camera data and natural RGB images for training. Our method contains three modules connected in a sequence: i) a family of event data augmentations, generating meaningful event images for self-supervised training; ii) a conditional masking strategy to sample informative event patches from event images, encouraging our model to capture the spatial layout of a scene and accelerating training; iii) a contrastive learning approach, enforcing the similarity of embeddings between matching event images, and between paired event and RGB images. An embedding projection loss is proposed to avoid the model collapse when enforcing the event image embedding similarities. A probability distribution alignment loss is proposed to encourage the event image to be consistent with its paired RGB image in the feature space. Transfer learning performance on downstream tasks shows the superiority of our method over state-of-the-art methods. For example, we achieve top-1 accuracy at 64.83% on the N-ImageNet dataset. Our code is available at https://github.com/Yan98/Event-Camera-Data-Pre-training.

引用

下载

页码：10665 / 10675

页数：11

共 50 条

[41] Speech Pre-training with Acoustic Piece
Ren, Shuo
Liu, Shujie
Wu, Yu
Zhou, Long
Wei, Furu
INTERSPEECH 2022, 2022, : 2648 - 2652
[42] Unsupervised Pre-Training for Detection Transformers
Dai, Zhigang
Cai, Bolun
Lin, Yugeng
Chen, Junying
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 12772 - 12782
[43] Structural Pre-training for Dialogue Comprehension
Zhang, Zhuosheng
Zhao, Hai
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 5134 - 5145
[44] Simulated SAR for ATR pre-training
Willis, Christopher J.
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS III, 2021, 11870
[45] Robot Learning with Sensorimotor Pre-training
Radosavovic, Ilija
Shi, Baifeng
Fu, Letian
Goldberg, Ken
Darrell, Trevor
Malik, Jitendra
CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[46] Rethinking pre-training on medical imaging
Wen, Yang
Chen, Leiting
Deng, Yu
Zhou, Chuan
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 78
[47] Pre-training Methods in Information Retrieval
Fan, Yixing
Xie, Xiaohui
Cai, Yinqiong
Chen, Jia
Ma, Xinyu
Li, Xiangsheng
Zhang, Ruqing
Guo, Jiafeng
FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2022, 16 (03): : 178 - 317
[48] Quality Diversity for Visual Pre-Training
Chavhan, Ruchika
Gouk, Henry
Li, Da
Hospedales, Timothy
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5361 - 5371
[49] Ontology Pre-training for Poison Prediction
Glauer, Martin
Neuhaus, Fabian
Mossakowski, Till
Hastings, Janna
ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2023, 2023, 14236 : 31 - 45
[50] Realistic Channel Models Pre-training
Huangfu, Yourui
Wang, Jian
Xu, Chen
Li, Rong
Ge, Yiqun
Wang, Xianbin
Zhang, Huazi
Wang, Jun
2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2019,

← 1 2 3 4 5 →