Viewpoint planning optimization for structure from motion-based 3D reconstruction of industrial products with sim-to-real proximal policy optimization

被引:0
|
作者
Wang, Yuchen [1 ,2 ]
Xiao, Ruxin [1 ,2 ]
Wang, Xinheng [1 ]
Zhang, Junqing [2 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Renai Rd 111 th,Suzhou Ind Pk, Suzhou 215028, Peoples R China
[2] Univ Liverpool, Dept Elect Engn & Elect, Brownlow Hill, Liverpool L69 7ZX, England
关键词
3D reconstruction optimization; Active structure from motion; Deep reinforcement learning; Proximal policy optimization; Sim-to-real training; STRUCTURE-FROM-MOTION;
D O I
10.1016/j.eswa.2025.126674
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Viewpoint planning determines the accuracy, processing speed, and lightweight of structure from motion. Despite the importance of viewpoint planning optimization to industrial digital services, existing methods show evident shortages in balancing between the reconstruction accuracy and the viewpoint number. Hence, this paper defines a new next-best-view problem for structure from motion, which aims to improve the accuracy, reduce the viewpoint number, and strike a balance between the two, simultaneously. Besides, to resolve the problem, this paper presents a novel viewpoint planning optimization method based on Proximal Policy Optimization. This method incorporates double models, action mask, and sim-to-real training to improve the training efficiency. Additionally, this method applies transfer-learning and fine-tuning to improve the versatility of the optimized viewpoint plan. A case study and experiments with multiple house models illustrate the method. In the experiment, the optimized viewpoint plan achieved 12.42%, 14.87%, 16.39%, 15.58%, and 32.35% reduction in Chamfer Distance, Earth Mover's Distance, the viewpoint number, the file size, and reconstruction processing time compared to the na & iuml;ve baseline, respectively. Also, compared to existing methods, the proposed method showed advantages from different perspectives, particularly in the balance between the reconstruction accuracy and the viewpoint number.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] 3D real-time path planning based on cognitive behavior optimization algorithm for UAV with TLP model
    Yawei Cai
    Hui Zhao
    Mudong Li
    Hanqiao Huang
    Cluster Computing, 2019, 22 : 5089 - 5098
  • [32] 3D plant root system reconstruction based on fusion of deep structure-from-motion and IMU
    Yawen Lu
    Yuxing Wang
    Zhanjie Chen
    Awais Khan
    Carl Salvaggio
    Guoyu Lu
    Multimedia Tools and Applications, 2021, 80 : 17315 - 17331
  • [33] 3D plant root system reconstruction based on fusion of deep structure-from-motion and IMU
    Lu, Yawen
    Wang, Yuxing
    Chen, Zhanjie
    Khan, Awais
    Salvaggio, Carl
    Lu, Guoyu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 17315 - 17331
  • [34] A line scan camera-based structure from motion for high-resolution 3D reconstruction
    Zhang, Pengchang
    Arre, Toque Jay
    Ide-Ektessabi, Ari
    JOURNAL OF CULTURAL HERITAGE, 2015, 16 (05) : 656 - 663
  • [35] Responses to the Comments on "Plane-Based Optimization for 3D Object Reconstruction from Single Line Drawings"
    Liu, Jianzhuang
    Cao, Liangliang
    Li, Zhenguo
    Tang, Xiaoou
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (09) : 1726 - 1728
  • [36] Automated 3D Reconstruction Using Optimized View-Planning Algorithms for Iterative Development of Structure-from-Motion Models
    Arce, Samuel
    Vernon, Cory A.
    Hammond, Joshua
    Newell, Valerie
    Janson, Joseph
    Franke, Kevin W.
    Hedengren, John D.
    REMOTE SENSING, 2020, 12 (13)
  • [37] Generalized Stereo Matching Method Based on Iterative Optimization of Hierarchical Graph Structure Consistency Cost for Urban 3D Reconstruction
    Yang, Shuting
    Chen, Hao
    Chen, Wen
    REMOTE SENSING, 2023, 15 (09)
  • [38] 3D Pipe Network Reconstruction Based on Structure from Motion with Incremental Conic Shape Detection and Cylindrical Constraint
    Kagami, Sho
    Taira, Hajime
    Miyashita, Naoyuki
    Torii, Akihiko
    Okutomi, Masatoshi
    2020 IEEE 29TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2020, : 1345 - 1352
  • [39] Comparison of 3D Reconstruction between Neural Radiance Fields and Structure-from-Motion-Based Photogrammetry from 360° Videos
    Gupta, Mohit
    Borrmann, Andre
    Czerniawski, Thomas
    COMPUTING IN CIVIL ENGINEERING 2023-DATA, SENSING, AND ANALYTICS, 2024, : 429 - 436
  • [40] 3D reconstruction and segmentation system for pavement potholes based on improved structure-from-motion (SFM) and deep learning
    Wang, Niannian
    Dong, Jiaxiu
    Fang, Hongyuan
    Li, Bin
    Zhai, Kejie
    Ma, Duo
    Shen, Yibo
    Hu, Haobang
    CONSTRUCTION AND BUILDING MATERIALS, 2023, 398