共 50 条
- [31] PreAdapter: Sparse Adaptive Parameter-efficient Transfer Learning for Language Models 2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 218 - 225
- [32] Accelerating Matrix-Vector Multiplications of Large Language Models via Efficient Encoding 2024 IEEE 17th International Conference on Solid-State and Integrated Circuit Technology, ICSICT 2024, 2024,
- [33] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism PROCEEDINGS OF THE 2024 USENIX ANNUAL TECHNICAL CONFERENCE, ATC 2024, 2024, : 545 - 561
- [34] Investigation of Layer-Wise Speech Representations in Self-Supervised Learning Models: A Cross-Lingual Study in Detecting Depression INTERSPEECH 2024, 2024, : 3020 - 3024
- [35] SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2134 - 2146
- [36] Layer-Wise Learning Rate Optimization for Task-Dependent Fine-Tuning of Pre-Trained Models: An Evolutionary Approach ACM Transactions on Evolutionary Learning and Optimization, 2024, 4 (04):
- [37] ACCUARTE PREDICTION OF PROCESS-INDUCED DEFORMATIONS IN COMPOSITES USING LAYER-WISE MODELS AND THEORY-GUIDED PROBABILISTIC MACHINE LEARNING PROCEEDINGS OF ASME 2024 AEROSPACE STRUCTURES, STRUCTURAL DYNAMICS, AND MATERIALS CONFERENCE, SSDM2024, 2024,
- [38] Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 423 - 435
- [40] Dynamic Susceptibility and Structural Heterogeneity of Large Reverse Micellar Water: An Examination of the Core-Shell Model via Probing the Layer-wise Features JOURNAL OF PHYSICAL CHEMISTRY B, 2020, 124 (14): : 2848 - 2863