共 50 条
- [1] Optimizing GPU Cache Policies for MI Workloads PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2019), 2019, : 243 - 248
- [3] Characterizing Large Dataset GPU Compute Workloads Targeting Systems with Die-Stacked Memory 2015 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2015, : 204 - 213
- [4] A practical performance model for compute and memory bound GPU kernels 23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 651 - 658
- [5] Cache performance of video computation workloads THIRD INTERNATIONAL WORKSHOP ON DIGITAL AND COMPUTATIONAL VIDEO, PROCEEDINGS, 2002, : 169 - 175
- [6] Optimizing Deep Learning Workloads on ARM GPU with TVM 1ST ACM REQUEST WORKSHOP/TOURNAMENT ON REPRODUCIBLE SOFTWARE/HARDWARE CO-DESIGN OF PARETO-EFFICIENT DEEP LEARNING, 2018,
- [7] (Mis)Understanding the NUMA Memory System Performance of Multithreaded Workloads 2013 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2013), 2013, : 11 - 22
- [8] Exploring Shared Memory and Cache to Improve GPU Performance and Energy Efficiency PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2015), 2015, : 397 - 400
- [9] Optimizing Private Memory Performance By Dynamically Deactivating Cache Coherence 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 1112 - 1117
- [10] Optimizing Amazon SageMaker Workloads with Predictive Compute Type Selection Strategies ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT II, 2024, 2091 : 129 - 141