共 50 条
- [1] Optimized Broadcast for Deep Learning Workloads on Dense-GPU InfiniBand Clusters: MPI or NCCL? [J]. EUROMPI 2018: PROCEEDINGS OF THE 25TH EUROPEAN MPI USERS' GROUP MEETING, 2018,
- [2] Understanding of GPU Architectural Vulnerability for Deep Learning Workloads [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT), 2019,
- [3] Optimizing Deep Learning Workloads on ARM GPU with TVM [J]. 1ST ACM REQUEST WORKSHOP/TOURNAMENT ON REPRODUCIBLE SOFTWARE/HARDWARE CO-DESIGN OF PARETO-EFFICIENT DEEP LEARNING, 2018,
- [4] Evaluating On-Node GPU Interconnects for Deep Learning Workloads [J]. HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING, AND SIMULATION (PMBS 2017), 2018, 10724 : 3 - 21
- [5] Reliability of Large Scale GPU Clusters for Deep Learning Workloads [J]. WEB CONFERENCE 2021: COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2021), 2021, : 179 - 181
- [6] Poster Abstract: Deep Learning Workloads Scheduling with Reinforcement Learning on GPU Clusters [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM 2019 WKSHPS), 2019, : 1023 - 1024
- [7] Characterization and Prediction of Deep Learning Workloads in Large -Scale GPU Datacenters [J]. SC21: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2021,
- [8] Predicting GPU Failures With High Precision Under Deep Learning Workloads [J]. PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, SYSTOR 2023, 2023, : 124 - 135
- [9] Accelerating Container-based Deep Learning Hyperparameter Optimization Workloads [J]. PROCEEDINGS OF THE 6TH WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2022, 2022,
- [10] DistDL: A Distributed Deep Learning Service Schema with GPU Accelerating [J]. WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 793 - 804