Fractal geometry-based automatic generation of large-scale image database for pre-training in 3D object recognition

被引:0
|
作者
Yamada R.
Okayasu K.
Nakamura A.
Kataoka H.
机构
关键词
3D object recognition; Deep learning; Fractal; Pre-training; Supervised learning;
D O I
10.2493/jjspe.87.374
中图分类号
学科分类号
摘要
We propose the generation method of large-scale image database for pre-training in 3D object recognition. The method is inspired from the principles of nature law. We adopt fractal geometry to represent the principles and build the Fractal Data Base random search (FractalDBrs). In contrast to traditional image database such as ImageNet, Iterated Function System (IFS) automatically generates large amount of image data to build the proposed FractalDBrs in short time without menial labors such as collecting and annotating images. In the experiments, we utilized the FractalDBrs and traditional databases; ImageNet, CIFAR100, Caltech256, or Places365, for pre-training in 3D object recognition with ModelNet40. The model pre-trained with FractalDBrs achieved the highest discrimination accuracy of 97.12% against the second highest accuracy of 96.43% with ImageNet. For reference, the model trained from scratch achieved 91.53% discrimination accuracy. We have verified the effectiveness of the proposed fractal geometry-based image database for pre-training in 3D object recognition. © 2021 Japan Society for Precision Engineering. All rights reserved.
引用
收藏
页码:374 / 379
页数:5
相关论文
共 50 条
  • [21] 3D Object Detection on large-scale dataset
    Zhao, Yan
    Zhu, Jihong
    Liang, Haoyu
    Chen, Lyujie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [22] A Geometry-Based 3D Reconstruction from a Single Omnidirectional Image
    Wahyono
    Joko, Hariyono
    Vavilin, Andrey
    Jo, Kang-Hyun
    PROCEEDINGS OF THE 19TH KOREA-JAPAN JOINT WORKSHOP ON FRONTIERS OF COMPUTER VISION (FCV 2013), 2013, : 295 - 299
  • [23] Advancing 3D medical image analysis with variable dimension transform based supervised 3D pre-training
    Zhang, Shu
    Li, Zihao
    Zhou, Hong-Yu
    Ma, Jiechao
    Yu, Yizhou
    NEUROCOMPUTING, 2023, 529 : 11 - 22
  • [24] 3D geometry-based automatic landmark localization in presence of facial occlusions
    Vezzetti, Enrico
    Marcolin, Federica
    Tornincasa, Stefano
    Ulrich, Luca
    Dagnes, Nicole
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (11) : 14177 - 14205
  • [25] Towards Large-scale 3D Face Recognition
    Gilani, Syed Zulqarnain
    Mian, Ajmal
    2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2016, : 682 - 689
  • [26] 3D geometry-based automatic landmark localization in presence of facial occlusions
    Enrico Vezzetti
    Federica Marcolin
    Stefano Tornincasa
    Luca Ulrich
    Nicole Dagnes
    Multimedia Tools and Applications, 2018, 77 : 14177 - 14205
  • [27] Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training
    Wu, Qiangqiang
    Xia, Yan
    Wan, Jia
    Chan, Antoni B.
    COMPUTER VISION - ECCV 2024, PT XII, 2025, 15070 : 270 - 288
  • [28] ReFs: A hybrid pre-training paradigm for 3D medical image segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Liu, Lingqiao
    Wang, Hu
    Ye, Yiwen
    Verjans, Johan
    Xia, Yong
    MEDICAL IMAGE ANALYSIS, 2024, 91
  • [29] 3D geometry-based face recognition in presence of eye and mouth occlusions
    Nicole Dagnes
    Federica Marcolin
    Francesca Nonis
    Stefano Tornincasa
    Enrico Vezzetti
    International Journal on Interactive Design and Manufacturing (IJIDeM), 2019, 13 : 1617 - 1635
  • [30] 3D geometry-based face recognition in presence of eye and mouth occlusions
    Dagnes, Nicole
    Marcolin, Federica
    Nonis, Francesca
    Tornincasa, Stefano
    Vezzetti, Enrico
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2019, 13 (04): : 1617 - 1635