共 50 条
- [11] Exploring Synergies between Causal Models and Large Language Models for Enhanced Understanding and Inference 2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
- [12] ServerlessLLM: Low-Latency Serverless Inference for Large Language Models PROCEEDINGS OF THE 18TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2024, 2024, : 135 - 153
- [15] Adaptive In-Context Learning with Large Language Models for Bundle PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 966 - 976
- [18] EchoSwift An Inference Benchmarking and Configuration Discovery Tool for Large Language Models (LLMs) COMPANION OF THE 15TH ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, ICPE COMPANION 2024, 2024, : 158 - 162
- [20] Tabi: An Efficient Multi-Level Inference System for Large Language Models PROCEEDINGS OF THE EIGHTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, EUROSYS 2023, 2023, : 233 - 248