Multi-level texture caching for 3D graphics hardware

被引:13
|
作者
Cox, M [1 ]
Bhandari, N [1 ]
Shantz, M [1 ]
机构
[1] NASA, Ames Res Ctr, MRJ, Moffett Field, CA 94035 USA
关键词
D O I
10.1109/ISCA.1998.694765
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional graphics hardware architectures implement what we call the push architecture for texture mapping. Local memory is dedicated to the accelerator for fast local retrieval of texture during rasterization, and the application is responsible for managing this memory. The push architecture has a bandwidth advantage, but disadvantages of limited texture capacity, escalation of accelerator memory requirements (and therefore cost), and poor memory utilization. The push architecture also requires the programmer to solve the bin-packing problem of managing accelerator memory each frame. More recently graphics hardware on PC-class machines has moved to an implementation of what we call the poll architecture. Texture is stored in system memory and downloaded by the accelerator as needed. The pull architecture has advantages of texture capacity, stems the escalation of accelerator memory requirements, and has good memory utilization. It also frees the programmer from accelerator texture memory management. However, the pull architecture suffers escalating requirements for bandwidth from main memory to the accelerator. In this paper we propose multi-level texture caching to provide the accelerator with the bandwidth advantages of the push architecture combined with the capacity advantages of the pull architecture. We have studied the feasibility of 2-level caching and found the following: (1) significant re-use of texture between frames; (2) L2 caching requires significantly less memory than the push architecture; (3) L2 caching requires significantly less bandwidth from host memory than the pull architecture; (4) L2 caching enables implementation of smaller L1 caches that would otherwise bandwidth-limit accelerators on the workloads in this paper. Results suggest that an L2 cache achieves the original advantage of the pull architecture stemming the growth of local texture memory - while at the same time stemming the current explosion in demand for texture bandwidth between host memory and the accelerator.
引用
收藏
页码:86 / 97
页数:12
相关论文
共 50 条
  • [21] Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency
    Luo, Zhipeng
    Cai, Zhongang
    Zhou, Changqing
    Zhang, Gongjie
    Zhao, Haiyu
    Yi, Shuai
    Lu, Shijian
    Li, Hongsheng
    Zhang, Shanghang
    Liu, Ziwei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8846 - 8855
  • [22] Toward acceleration of RSA using 3D graphics hardware
    Moss, Andrew
    Page, Daniel
    Smart, Nigel P.
    CRYPTOGRAPHY AND CODING, PROCEEDINGS, 2007, 4887 : 364 - 383
  • [24] Multi-level Partial Matching Algorithm for Autoscanning of 3D Shape
    Sun H.
    Xia X.
    Liu L.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (01): : 10 - 16
  • [25] Multi-level Recommendation for Sketch Based 3D Toy Modeling
    Zhu, Guangyu
    Wang, Jin
    Lu, Guodong
    ADVANCED MANUFACTURING TECHNOLOGY, PTS 1-4, 2012, 472-475 : 145 - 150
  • [26] Optimized stereo reconstruction using 3D graphics hardware
    Zach, C
    Klaus, A
    Reitinger, B
    Karner, K
    VISION, MODELING, AND VISUALIZATION 2003, 2003, : 119 - +
  • [27] Multi-level spherical moments based 3D model retrieval
    Liu W.
    He Y.-J.
    Journal of Zhejiang University-SCIENCE A, 2006, 7 (9): : 1500 - 1507
  • [28] Research on 3D reconstruction based on multi-level pattern matching
    2000, J Pattern Recognit Artif Intell, China (13):
  • [29] Generalized map pyramid for multi-level 3D image segmentation
    Grasset-Simon, Carine
    Damiand, Guillaume
    DISCRETE GEOMETRY FOR COMPUTER IMAGERY, PROCEEDINGS, 2006, 4245 : 530 - 541
  • [30] MLF3D: Multi-Level Fusion for Multi-Modal 3D Object Detection
    Jiang, Han
    Wang, Jianbin
    Xiao, Jianru
    Zhao, Yanan
    Chen, Wanqing
    Ren, Yilong
    Yu, Haiyang
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1588 - 1593