Viewpoint planning optimization for structure from motion-based 3D reconstruction of industrial products with sim-to-real proximal policy optimization

被引:0
|
作者
Wang, Yuchen [1 ,2 ]
Xiao, Ruxin [1 ,2 ]
Wang, Xinheng [1 ]
Zhang, Junqing [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Renai Rd 111 th,Suzhou Ind Pk, Suzhou 215028, Peoples R China
[2] Univ Liverpool, Dept Elect Engn & Elect, Brownlow Hill, Liverpool L69 7ZX, England
关键词
3D reconstruction optimization; Active structure from motion; Deep reinforcement learning; Proximal policy optimization; Sim-to-real training; STRUCTURE-FROM-MOTION;
D O I
10.1016/j.eswa.2025.126674
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Viewpoint planning determines the accuracy, processing speed, and lightweight of structure from motion. Despite the importance of viewpoint planning optimization to industrial digital services, existing methods show evident shortages in balancing between the reconstruction accuracy and the viewpoint number. Hence, this paper defines a new next-best-view problem for structure from motion, which aims to improve the accuracy, reduce the viewpoint number, and strike a balance between the two, simultaneously. Besides, to resolve the problem, this paper presents a novel viewpoint planning optimization method based on Proximal Policy Optimization. This method incorporates double models, action mask, and sim-to-real training to improve the training efficiency. Additionally, this method applies transfer-learning and fine-tuning to improve the versatility of the optimized viewpoint plan. A case study and experiments with multiple house models illustrate the method. In the experiment, the optimized viewpoint plan achieved 12.42%, 14.87%, 16.39%, 15.58%, and 32.35% reduction in Chamfer Distance, Earth Mover's Distance, the viewpoint number, the file size, and reconstruction processing time compared to the na & iuml;ve baseline, respectively. Also, compared to existing methods, the proposed method showed advantages from different perspectives, particularly in the balance between the reconstruction accuracy and the viewpoint number.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Reconstruction of 3D human motion in real-time using particle swarm optimization with GPU-accelerated fitness function
    Bogdan Kwolek
    Boguslaw Rymut
    Journal of Real-Time Image Processing, 2020, 17 : 821 - 838
  • [22] 3D reconstruction system based on incremental structure from motion using a camera with varying parameters
    Soulaiman El Hazzat
    Mostafa Merras
    Nabil El Akkad
    Abderrahim Saaidi
    Khalid Satori
    The Visual Computer, 2018, 34 : 1443 - 1460
  • [23] Research on Object Panoramic 3D Point Cloud Reconstruction System Based on Structure From Motion
    Zhang, Xuejing
    Liu, Jingyan
    Zhang, Bo
    Sun, Lei
    Zhou, Yuhong
    Li, Yuchao
    Zhang, Jun
    Zhang, Hao
    Fan, Xiaofei
    IEEE ACCESS, 2022, 10 : 110064 - 110075
  • [24] A Scaled Monocular 3D Reconstruction Based on Structure from Motion and Multi-View Stereo
    Zhan, Zhiwen
    Yang, Fan
    Jiang, Jixin
    Du, Jialin
    Li, Fanxing
    Sun, Si
    Wei, Yan
    ELECTRONICS, 2024, 13 (19)
  • [25] 3D reconstruction system based on incremental structure from motion using a camera with varying parameters
    El Hazzat, Soulaiman
    Merras, Mostafa
    El Akkad, Nabil
    Saaidi, Abderrahim
    Satori, Khalid
    VISUAL COMPUTER, 2018, 34 (10): : 1443 - 1460
  • [26] Automated Lofting-Based Reconstruction of CAD Models from 3D Topology Optimization Results
    Amroune, Abdennour
    Cuilliere, Jean-Christophe
    Francois, Vincent
    COMPUTER-AIDED DESIGN, 2022, 145
  • [27] Stochastic Optimization Based 3D Dense Reconstruction from Multiple Views with High Accuracy and Completeness
    Chen, Wen-Chao
    Chen, Zen
    Sung, Ping-Yi
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2015, 31 (01) : 131 - 146
  • [28] Comments on "Plane-Based Optimization for 3D Object Reconstruction from Single Line Drawings"
    Varley, Peter A. C.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (09) : 1723 - 1725
  • [29] Trajectory Optimization for Physics-Based Reconstruction of 3d Human Pose from Monocular Video
    Gartner, Erik
    Andriluka, Mykhaylo
    Xu, Hongyi
    Sminchisescu, Cristian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13096 - 13105
  • [30] 3D real-time path planning based on cognitive behavior optimization algorithm for UAV with TLP model
    Cai, Yawei
    Zhao, Hui
    Li, Mudong
    Huang, Hanqiao
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02): : S5089 - S5098