Fractal geometry-based automatic generation of large-scale image database for pre-training in 3D object recognition

被引:0
|
作者
Yamada R.
Okayasu K.
Nakamura A.
Kataoka H.
机构
关键词
3D object recognition; Deep learning; Fractal; Pre-training; Supervised learning;
D O I
10.2493/jjspe.87.374
中图分类号
学科分类号
摘要
We propose the generation method of large-scale image database for pre-training in 3D object recognition. The method is inspired from the principles of nature law. We adopt fractal geometry to represent the principles and build the Fractal Data Base random search (FractalDBrs). In contrast to traditional image database such as ImageNet, Iterated Function System (IFS) automatically generates large amount of image data to build the proposed FractalDBrs in short time without menial labors such as collecting and annotating images. In the experiments, we utilized the FractalDBrs and traditional databases; ImageNet, CIFAR100, Caltech256, or Places365, for pre-training in 3D object recognition with ModelNet40. The model pre-trained with FractalDBrs achieved the highest discrimination accuracy of 97.12% against the second highest accuracy of 96.43% with ImageNet. For reference, the model trained from scratch achieved 91.53% discrimination accuracy. We have verified the effectiveness of the proposed fractal geometry-based image database for pre-training in 3D object recognition. © 2021 Japan Society for Precision Engineering. All rights reserved.
引用
收藏
页码:374 / 379
页数:5
相关论文
共 50 条
  • [1] Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
    Chen, Haoming
    Zhang, Zhizhong
    Qu, Yanyun
    Zhang, Ruixin
    Tan, Xin
    Xie, Yuan
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 19925 - 19935
  • [2] A Large-Scale 3D Object Recognition dataset
    Solund, Thomas
    Buch, Anders Glent
    Kruger, Norbert
    Aanaes, Henrik
    PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 73 - 82
  • [3] BigBIRD: A Large-Scale 3D Database of Object Instances
    Singh, Arjun
    Sha, James
    Narayan, Karthik S.
    Achim, Tudor
    Abbeel, Pieter
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 509 - 516
  • [4] BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
    Cai, Likun
    Zhang, Zhi
    Zhu, Yi
    Zhang, Li
    Li, Mu
    Xue, Xiangyang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4776 - 4786
  • [5] ObjectNet3D: A Large Scale Database for 3D Object Recognition
    Xiang, Yu
    Kim, Wonhui
    Chen, Wei
    Ji, Jingwei
    Choy, Christopher
    Su, Hao
    Mottaghi, Roozbeh
    Guibas, Leonidas
    Savarese, Silvio
    COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 160 - 176
  • [6] Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition
    Masumura, Ryo
    Makishima, Naoki
    Ihori, Mana
    Takashima, Akihiko
    Tanaka, Tomohiro
    Orihashi, Shota
    INTERSPEECH 2020, 2020, : 2822 - 2826
  • [7] DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation
    Zhang, Yizhe
    Sun, Siqi
    Galley, Michel
    Chen, Yen-Chun
    Brockett, Chris
    Gao, Xiang
    Gao, Jianfeng
    Liu, Jingjing
    Dolan, Bill
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, 2020, : 270 - 278
  • [8] Large-scale weakly-supervised pre-training for video action recognition
    Ghadiyaram, Deepti
    Du Tran
    Mahajan, Dhruv
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12038 - 12047
  • [9] SelfPAB: large-scale pre-training on accelerometer data for human activity recognition
    Logacjov, Aleksej
    Herland, Sverre
    Ustad, Astrid
    Bach, Kerstin
    APPLIED INTELLIGENCE, 2024, 54 (06) : 4545 - 4563
  • [10] SelfPAB: large-scale pre-training on accelerometer data for human activity recognition
    Aleksej Logacjov
    Sverre Herland
    Astrid Ustad
    Kerstin Bach
    Applied Intelligence, 2024, 54 : 4545 - 4563