3D Workload Subsetting for GPU architecture Pathfinding

被引:2
|
作者
George, Vinod Mohan [1 ]
机构
[1] Intel, Bangalore 560017, Karnataka, India
关键词
D O I
10.1109/IISWC.2015.24
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Growth of high-end 3D gaming, expansion of gaming to new devices like tablets and phones, and evolution of multiple Graphics APIs like Direct3D 10+, and OpenGL 3.0+ have led to an explosion in the number of workloads that need to be evaluated for GPU architecture path-finding. To decide on the optimal architecture configuration, the workloads need to be simulated on a wide range of architecture designs which incurs huge cost, both in terms of time and resources. In order to reduce the simulation cost of path-finding, extracting workload subsets from 3D workloads is essential. This paper presents a methodology to find representative workload subsets from 3D workloads by combining clustering and phase detection. In the first part, this paper presents a methodology to group draw-calls based on performance similarity by clustering on their microarchitecture independent characteristics. Across 717 frames encompassing 828K draw-calls, the clustering solution obtained an average performance prediction error per frame of 1.0% at an average clustering efficiency of 65.8%. The clustering quality is additionally evaluated by calculating cluster outliers, which are clusters with intra cluster prediction error greater than 20%. The clustering quality, measured using cluster outliers, is an indication of the performance similarity of the individual clusters. Across the spectrum of frames, we found that on an average only 3.0% of the clusters are outliers which indicates a high clustering quality. In order to detect repetitive behavior in 3D workloads, we propose characterization of frame intervals using shader vectors and then using shader vector equality to extract the repeating patterns. We show that phases exist in each game in the Bioshock series enabling extraction of small representative subsets from the workloads. Performance improvement of the workload subsets, which are less than one percent of parent workload, with GPU frequency scaling has high correlation (correlation coefficient=99.7%+)to the performance improvement of its parent workload.
引用
收藏
页码:130 / 139
页数:10
相关论文
共 50 条
  • [21] Pathfinding Methodology for Optimal Design and Integration of 2.5D/3D Interconnects
    Yazdani, Farhang
    Park, John
    2014 IEEE 64TH ELECTRONIC COMPONENTS AND TECHNOLOGY CONFERENCE (ECTC), 2014, : 1667 - 1672
  • [22] A 3D DCT architecture for compression of integral 3D images
    Jalloh, I
    Aggoun, A
    McCormick, M
    2000 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS: DESIGN AND IMPLEMENTATION, 2000, : 238 - 244
  • [23] Architecture of 3D Memory Cell Array on 3D IC
    Lee, Sang-Yun
    Park, Junil
    2012 4TH IEEE INTERNATIONAL MEMORY WORKSHOP (IMW), 2012,
  • [24] gHull: A GPU Algorithm for 3D Convex Hull
    Gao, Mingcen
    Thanh-Tung Cao
    Nanjappa, Ashwin
    Tan, Tiow-Seng
    Huang, Zhiyong
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2013, 40 (01):
  • [25] 基于GPU的3D距离变换
    田绪红
    司徒志远
    陈茂资
    韩国强
    计算机应用研究, 2008, (09) : 2847 - 2849
  • [26] Comics Stylizations of 3D Scenes Using GPU
    Suarez, Jordane
    Belhadj, Fares
    Boyer, Vincent
    ADVANCES IN VISUAL COMPUTING, PT III, 2010, 6455 : 524 - 533
  • [27] Fast GPU 3D diffeomorphic image registration
    Brunn, Malte
    Himthani, Naveen
    Biros, George
    Mehl, Miriam
    Mang, Andreas
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 149 : 149 - 162
  • [28] Simplification of Moving 3D Scene Data on GPU
    Chenchu, Rajesh
    Michiels, Nick
    Rogmans, Sammy
    Bekaert, Philippe
    SIGMAP: PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS - VOL. 5, 2016, : 95 - 98
  • [29] A GPU 3D Segmentation Framework for Medical Imaging
    Galluzzo, Francesca
    De Marchi, Luca
    Testoni, Nicola
    Masetti, Guido
    APPLICATIONS IN ELECTRONICS PERVADING INDUSTRY, ENVIRONMENT AND SOCIETY, APPLEPIES 2014, 2016, 351 : 107 - 114
  • [30] GFT: GPU Fast Triangulation of 3D Points
    Sanchez, Jairo R.
    Alvarez, Hugo
    Borro, Diego
    COMPUTER VISION AND GRAPHICS, PT II, 2010, 6375 : 235 - +