Stackless KD-Tree traversal for high performance GPU ray tracing

被引:113
|
作者
Popov, Stefan [1 ]
Guenther, Johannes
Seidel, Hans-Peter
Slusallek, Philipp
机构
[1] Univ Saarland, D-6600 Saarbrucken, Germany
[2] MPI Informat, Saarbrucken, Germany
关键词
D O I
10.1111/j.1467-8659.2007.01064.x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Significant advances have been achieved for realtime ray tracing recently, but realtime performance for complex scenes still requires large computational resources not yet available from the CPUs in standard PCs. Incidentally, most of these PCs also contain modern GPUs that do offer much larger raw compute power However limitations in the programming and memory model have so far kept the performance of GPU ray tracers well below that of their CPU counterparts. In this paper we present a novel packet ray traversal implementation that completely eliminates the need for maintaining a stack during kd-tree traversal and that reduces the number of traversal steps per ray. While CPUs benefit moderately from the stackless approach, it improves GPU performance significantly. We achieve a peak performance of over 16 million rays per second,for reasonably complex scenes, including complex shading and secondary rays. Several examples show that with this new technique GPUs can actually outperform equivalent CPU based ray tracers.
引用
收藏
页码:415 / 424
页数:10
相关论文
共 50 条
  • [41] Massive parallelization of approximate nearest neighbor search on KD-tree for high-dimensional image descriptor matching
    Hu, Linjia
    Nooshabadi, Saeid
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 44 : 106 - 115
  • [42] High performance ultrasound simulation using Monte-Carlo simulation: a GPU ray-tracing implementation
    Perez Cambet, Tomas
    Vitale, Santiago
    Larrabide, Ignacio
    16TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2020, 11583
  • [43] Performance Comparison of Rasterization-based Graphics Pipeline and Ray Tracing on GPU Shaders
    Chang, Chun-Fa
    Chen, Kuan-Wei
    Chuang, Chin-Chien
    2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 120 - 123
  • [44] POSTER: High Performance GPU Concurrent B plus tree
    Zhang, Weihua
    Zhao, Chuanlei
    Peng, Lu
    Lin, Yuzhe
    Zhang, Fengzhe
    Jiang, Jinhu
    PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2022, : 443 - 444
  • [45] Engineering a High-Performance GPU B-Tree
    Awad, Muhammad A.
    Ashkiani, Saman
    Johnson, Rob
    Farach-Colton, Martin
    Owens, John D.
    PROCEEDINGS OF THE 24TH SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '19), 2019, : 145 - 157
  • [46] High-dimensional image descriptor matching using highly parallel KD-tree construction and approximate nearest neighbor search
    Hu, Linjia
    Nooshabadi, Saeid
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 132 : 127 - 140
  • [47] RTfact: Generic Concepts for Flexible and High Performance Ray Tracing
    Georgiev, Iliyan
    Slusallek, Philipp
    RT08: IEEE/EG SYMPOSIUM ON INTERACTIVE RAY TRACING 2008, PROCEEDINGS, 2008, : 115 - +
  • [48] Efficient MIMD Architectures for High-Performance Ray Tracing
    Kopta, D.
    Spjut, J.
    Brunvand, E.
    Davis, A.
    2010 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2010, : 9 - 16
  • [49] Tahoe: Tree Structure-Aware High Performance Inference Engine for Decision Tree Ensemble on GPU
    Xie, Zhen
    Dong, Wenqian
    Liu, Jiawen
    Liu, Hang
    Li, Dong
    PROCEEDINGS OF THE SIXTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS '21), 2021, : 426 - 440
  • [50] GPU-Based Ray Tracing Algorithm for High-Speed Propagation Prediction in Typical Indoor Environments
    Guo, Lixin
    Guan, Xiaowei
    Liu, Zhongyu
    HIGH-PERFORMANCE COMPUTING IN REMOTE SENSING V, 2015, 9646