Fractal geometry-based automatic generation of large-scale image database for pre-training in 3D object recognition

被引:0
|
作者
Yamada R.
Okayasu K.
Nakamura A.
Kataoka H.
机构
关键词
3D object recognition; Deep learning; Fractal; Pre-training; Supervised learning;
D O I
10.2493/jjspe.87.374
中图分类号
学科分类号
摘要
We propose the generation method of large-scale image database for pre-training in 3D object recognition. The method is inspired from the principles of nature law. We adopt fractal geometry to represent the principles and build the Fractal Data Base random search (FractalDBrs). In contrast to traditional image database such as ImageNet, Iterated Function System (IFS) automatically generates large amount of image data to build the proposed FractalDBrs in short time without menial labors such as collecting and annotating images. In the experiments, we utilized the FractalDBrs and traditional databases; ImageNet, CIFAR100, Caltech256, or Places365, for pre-training in 3D object recognition with ModelNet40. The model pre-trained with FractalDBrs achieved the highest discrimination accuracy of 97.12% against the second highest accuracy of 96.43% with ImageNet. For reference, the model trained from scratch achieved 91.53% discrimination accuracy. We have verified the effectiveness of the proposed fractal geometry-based image database for pre-training in 3D object recognition. © 2021 Japan Society for Precision Engineering. All rights reserved.
引用
收藏
页码:374 / 379
页数:5
相关论文
共 50 条
  • [41] A Large-Scale Multi-Pose 3D-RGB Object Database
    Sachara, Fabian
    Handmann, Finn
    Cremer, Nico
    Kopinski, Thomas
    Gepperth, Alexander
    Handmann, Uwe
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1326 - 1332
  • [42] Learning from Millions of 3D Scans for Large-scale 3D Face Recognition
    Gilani, Syed Zulqarnain
    Mian, Ajmal
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1896 - 1905
  • [43] Large-scale tests of a keyed, appearance-based 3-D object recognition system
    Nelson, RC
    Selinger, A
    VISION RESEARCH, 1998, 38 (15-16) : 2469 - 2488
  • [44] Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis
    Tang, Yucheng
    Yang, Dong
    Li, Wenqi
    Roth, Holger R.
    Landman, Bennett
    Xu, Daguang
    Nath, Vishwesh
    Hatamizadeh, Ali
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20698 - 20708
  • [45] RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection
    Rao, Yongming
    Liu, Benlin
    Wei, Yi
    Lu, Jiwen
    Hsieh, Cho-Jui
    Zhou, Jie
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3263 - 3272
  • [46] 3D face recognition by applying filter based on geometry image
    Cai, Liang
    Da, Feipeng
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2012, 42 (05): : 859 - 863
  • [47] OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
    Chen, Weifeng
    Qian, Shengyi
    Fan, David
    Kojima, Noriyuki
    Hamilton, Max
    Deng, Jia
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 676 - 685
  • [48] Geometry-Informed Neural Operator for Large-Scale 3D PDEs
    Li, Zongyi
    Kovachki, Nikola Borislavov
    Choy, Chris
    Li, Boyi
    Kossaifi, Jean
    Otta, Shourya Prakash
    Nabian, Mohammad Amin
    Stadler, Maximilian
    Hundt, Christian
    Azizzadenesheli, Kamyar
    Anandkumar, Anima
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [49] 3D Stochastic Geometry Model for Large-Scale Molecular Communication Systems
    Deng, Yansha
    Noel, Adam
    Guo, Weisi
    Nallanathan, Arumugam
    Elkashlan, Maged
    2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [50] Probabilistic 3D Object Recognition Based on Multiple Interpretations Generation
    Lu, Zhaojin
    Lee, Sukhan
    Kim, Hyunwoo
    COMPUTER VISION - ACCV 2010, PT IV, 2011, 6495 : 333 - +