POLARDB: FORMULA-DRIVEN DATASET FOR PRE-TRAINING TRAJECTORY ENCODERS

被引:0
|
作者
Miyamoto, Sota [1 ]
Yagi, Takuma [3 ]
Makimoto, Yuto [1 ]
Ukai, Mahiro [1 ]
Ushiku, Yoshitaka [2 ]
Hashimoto, Atsushi [2 ]
Inoue, Nakamasa [1 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
[2] OMRON SINIC X Corp, Bunkyo, Japan
[3] Natl Inst Adv Ind Sci & Technol, Ibaraki, Japan
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024 | 2024年
关键词
Formula-driven supervised learning; Polar equations; Fine-grained action recognition; Cutting method recognition;
D O I
10.1109/ICASSP48485.2024.10448448
中图分类号
学科分类号
摘要
Formula-driven supervised learning (FDSL) is a growing research topic for finding simple mathematical formulas that generate synthetic data and labels for pre-training neural networks. The main advantage of FDSL is that there is no risk of generating data with ethical implications such as gender bias and racial bias because it does not rely on real data as discussed in previous studies using fractals and polygons for pre-training image encoders. While FDSL has been proposed for pre-training image encoders, it has not been considered for temporal trajectory data. In this paper, we introduce PolarDB, the first formula-driven dataset for pre-training trajectory encoders with an application to fine-grained cutting-method recognition using hand trajectories. More specifically, we generate 270k trajectories for 432 categories on the basis of polar equations and use them to pre-train a Transformer-based trajectory encoder in an FDSL manner. In the experiments, we show that pre-training on PolarDB improves the accuracy of fine-grained cutting-method recognition on cooking videos of EPIC-KITCHEN and Ego4D datasets, where the pre-trained trajectory encoder is used as a plug-in module for a video recognition network.
引用
收藏
页码:5465 / 5469
页数:5
相关论文
共 47 条
  • [21] Trajectory-based fish event classification through pre-training with diffusion models
    Canovi, Noemi
    Ellis, Benjamin A.
    Sordalen, Tonje K.
    Allken, Vaneeda
    Halvorsen, Kim T.
    Malde, Ketil
    Beyan, Cigdem
    ECOLOGICAL INFORMATICS, 2024, 82
  • [22] Boosting UAV Tracking With Voxel-Based Trajectory-Aware Pre-Training
    Li, Sihang
    Fu, Changhong
    Lu, Kunhan
    Zuo, Haobo
    Li, Yiming
    Feng, Chen
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 1133 - 1140
  • [23] Pre-training dataset generation for classifying beams of vehicle headlights from nighttime camera image
    Oyabu, Tatsuya
    Sultana, Rebeka
    Ohashi, Gosuke
    FIFTEENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2021, 11794
  • [24] Pre-Training Protein Encoder via Siamese Sequence-Structure Diffusion Trajectory Prediction
    Zhang, Zuobai
    Xu, Minghao
    Lozano, Aurelie
    Chenthamarakshan, Vijil
    Das, Payel
    Tang, Jian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [25] Behavior-Pred: A semantic-enhanced trajectory pre-training framework for motion forecasting
    Shi, Jianxin
    Chen, Jinhao
    Wang, Yuandong
    Feng, Tao
    Yang, Zhen
    Wo, Tianyu
    Information Fusion, 2025, 120
  • [26] Beyond the Sequence: Statistics-Driven Pre-training for Stabilizing Sequential Recommendation Model
    Wang, Sirui
    Li, Peiguang
    Xian, Yunsen
    Zhang, Hongzhi
    PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 723 - 729
  • [27] Multi-Faceted Knowledge-Driven Pre-Training for Product Representation Learning
    Zhang, Denghui
    Liu, Yanchi
    Yuan, Zixuan
    Fu, Yanjie
    Chen, Haifeng
    Xiong, Hui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7239 - 7250
  • [28] Mutual information-driven self-supervised point cloud pre-training
    Xu, Weichen
    Fu, Tianhao
    Cao, Jian
    Zhao, Xinyu
    Xu, Xinxin
    Cao, Xixin
    Zhang, Xing
    KNOWLEDGE-BASED SYSTEMS, 2025, 307
  • [29] Self-supervised Transformer-Based Pre-training Method with General Plant Infection Dataset
    Wang, Zhengle
    Wang, Ruifeng
    Wang, Minjuan
    Lai, Tianyun
    Zhang, Man
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II, 2025, 15032 : 189 - 202
  • [30] POS-BERT: Point cloud one-stage BERT pre-training[Formula presented]
    Fu, Kexue
    Gao, Peng
    Liu, Shaolei
    Qu, Linhao
    Gao, Longxiang
    Wang, Manning
    Expert Systems with Applications, 2024, 240