3D Workload Subsetting for GPU architecture Pathfinding

被引:2
|
作者
George, Vinod Mohan [1 ]
机构
[1] Intel, Bangalore 560017, Karnataka, India
关键词
D O I
10.1109/IISWC.2015.24
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Growth of high-end 3D gaming, expansion of gaming to new devices like tablets and phones, and evolution of multiple Graphics APIs like Direct3D 10+, and OpenGL 3.0+ have led to an explosion in the number of workloads that need to be evaluated for GPU architecture path-finding. To decide on the optimal architecture configuration, the workloads need to be simulated on a wide range of architecture designs which incurs huge cost, both in terms of time and resources. In order to reduce the simulation cost of path-finding, extracting workload subsets from 3D workloads is essential. This paper presents a methodology to find representative workload subsets from 3D workloads by combining clustering and phase detection. In the first part, this paper presents a methodology to group draw-calls based on performance similarity by clustering on their microarchitecture independent characteristics. Across 717 frames encompassing 828K draw-calls, the clustering solution obtained an average performance prediction error per frame of 1.0% at an average clustering efficiency of 65.8%. The clustering quality is additionally evaluated by calculating cluster outliers, which are clusters with intra cluster prediction error greater than 20%. The clustering quality, measured using cluster outliers, is an indication of the performance similarity of the individual clusters. Across the spectrum of frames, we found that on an average only 3.0% of the clusters are outliers which indicates a high clustering quality. In order to detect repetitive behavior in 3D workloads, we propose characterization of frame intervals using shader vectors and then using shader vector equality to extract the repeating patterns. We show that phases exist in each game in the Bioshock series enabling extraction of small representative subsets from the workloads. Performance improvement of the workload subsets, which are less than one percent of parent workload, with GPU frequency scaling has high correlation (correlation coefficient=99.7%+)to the performance improvement of its parent workload.
引用
收藏
页码:130 / 139
页数:10
相关论文
共 50 条
  • [41] 3D emotional agent architecture
    Ramos, FF
    Razo, L
    Martinez, AV
    Zúñiga, F
    Piza, HI
    INNOVATIVE INTERNET COMMUNITY SYSTEMS, 2006, 3908 : 181 - 194
  • [42] An architecture for digital 3D broadcasting
    Harman, P
    STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS VI, 1999, 3639 : 254 - 259
  • [43] Arc3D: A 3D obfuscation architecture
    Gomathisankaran, M
    Tyagi, A
    HIGH PERFORMANCE EMBEDDED ARCHITECTURES AND COMPILERS, PROCEEDINGS, 2005, 3793 : 184 - 199
  • [44] 3D Digitization in Architecture Curriculum
    Mateus, Luis
    Ferreira, Victor
    ECAADE 2013: COMPUTATION AND PERFORMANCE, VOL 2, 2013, : 445 - 452
  • [45] Mapping the 3D genome architecture
    Tavallaee, Ghazaleh
    Orouji, Elias
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2025, 27 : 89 - 101
  • [46] Architecture support for 3D obfuscation
    Gomathisankaran, M
    Tyagi, A
    IEEE TRANSACTIONS ON COMPUTERS, 2006, 55 (05) : 497 - 507
  • [47] Replication of the 3D architecture of tissues
    Perez-Castillejos, R.
    MATERIALS TODAY, 2010, 13 (1-2) : 32 - 41
  • [48] Building imagination in 3D: The design by children with architecture and 3D printing
    Qurraie, Bahar Sultan
    Ozeren, Omer
    Na-Allah, Maman Sani Guero
    ENTERTAINMENT COMPUTING, 2025, 52
  • [49] A parallel 3D DCT architecture for the compression of integral 3D images
    Aggoun, A
    Jalloh, I
    ICECS 2001: 8TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS I-III, CONFERENCE PROCEEDINGS, 2001, : 229 - 232
  • [50] Dynamic 3D graphics workload characterization and the architectural implications
    Mitra, T
    Chiueh, TC
    32ND ANNUAL INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, (MICRO-32), PROCEEDINGS, 1999, : 62 - 71