共 16 条
- [1] GPU-enabled Function-as-a-Service for Machine Learning Inference 2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS, 2023, : 918 - 928
- [2] Network Measurements with Function-as-a-Service for Distributed Low-latency Edge Applications 2ND WORKSHOP ON FLEXIBLE RESOURCE AND APPLICATION MANAGEMENT ON THE EDGE, FRAME 2022, 2022, : 25 - 28
- [5] Deep-Cross-Attention Recommendation Model for Knowledge Sharing Micro Learning Service ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2020), PT II, 2020, 12164 : 168 - 173
- [7] CoFB: latency-constrained co-scheduling of flows and batches for deep learning inference service on the CPU–GPU system The Journal of Supercomputing, 2023, 79 : 14172 - 14199
- [8] Latency-Sensitive Service Function Chains Intelligent Migration in Satellite Communication Driven by Deep Reinforcement Learning TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2024, 35 (11):
- [9] CoFB: latency-constrained co-scheduling of flows and batches for deep learning inference service on the CPU-GPU system JOURNAL OF SUPERCOMPUTING, 2023, 79 (13): : 14172 - 14199
- [10] Ekko: A Large-Scale Deep Learning Recommender System with Low-Latency Model Update PROCEEDINGS OF THE 16TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2022, 2022, : 821 - 839