共 50 条
- [1] Foundations and Applications in Large-scale AI Models: Pre-training, Fine-tuning, and Prompt-based LearningPROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5853 - 5854Cheng, Derek论文数: 0 引用数: 0 h-index: 0机构: Google Res, Mountain View, CA 94043 USA Google Res, Mountain View, CA 94043 USAPatel, Dhaval论文数: 0 引用数: 0 h-index: 0机构: IBM Res, Austin, TX USA Google Res, Mountain View, CA 94043 USAPang, Linsey论文数: 0 引用数: 0 h-index: 0机构: Salesforce, San Francisco, CA USA Google Res, Mountain View, CA 94043 USAMehta, Sameep论文数: 0 引用数: 0 h-index: 0机构: IBM Res, Delhi, India Google Res, Mountain View, CA 94043 USAXie, Kexin论文数: 0 引用数: 0 h-index: 0机构: Salesforce, San Francisco, CA USA Google Res, Mountain View, CA 94043 USAChi, Ed H.论文数: 0 引用数: 0 h-index: 0机构: Google Res, Mountain View, CA 94043 USA Google Res, Mountain View, CA 94043 USALiu, Wei论文数: 0 引用数: 0 h-index: 0机构: Univ Technol Sydney, Sydney, NSW, Australia Google Res, Mountain View, CA 94043 USAChawla, Nitesh论文数: 0 引用数: 0 h-index: 0机构: Univ Notre Dame, Notre Dame, IN USA Google Res, Mountain View, CA 94043 USABailey, James论文数: 0 引用数: 0 h-index: 0机构: Univ Melbourne, Melbourne, Vic, Australia Google Res, Mountain View, CA 94043 USA
- [2] SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language ModelsUNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2134 - 2146Thangarasa, Vithursan论文数: 0 引用数: 0 h-index: 0机构: Cerebras Syst Inc, Sunnyvale, CA 94085 USA Cerebras Syst Inc, Sunnyvale, CA 94085 USAGupta, Abhay论文数: 0 引用数: 0 h-index: 0机构: Cerebras Syst Inc, Sunnyvale, CA 94085 USA Cerebras Syst Inc, Sunnyvale, CA 94085 USAMarshall, William论文数: 0 引用数: 0 h-index: 0机构: Cerebras Syst Inc, Sunnyvale, CA 94085 USA Cerebras Syst Inc, Sunnyvale, CA 94085 USALi, Tianda论文数: 0 引用数: 0 h-index: 0机构: Cerebras Syst Inc, Sunnyvale, CA 94085 USA Cerebras Syst Inc, Sunnyvale, CA 94085 USALeong, Kevin论文数: 0 引用数: 0 h-index: 0机构: Cerebras Syst Inc, Sunnyvale, CA 94085 USA Cerebras Syst Inc, Sunnyvale, CA 94085 USADeCoste, Dennis论文数: 0 引用数: 0 h-index: 0机构: Cerebras Syst Inc, Sunnyvale, CA 94085 USA Cerebras Syst Inc, Sunnyvale, CA 94085 USALie, Sean论文数: 0 引用数: 0 h-index: 0机构: Cerebras Syst Inc, Sunnyvale, CA 94085 USA Cerebras Syst Inc, Sunnyvale, CA 94085 USASaxena, Shreyas论文数: 0 引用数: 0 h-index: 0机构: Cerebras Syst Inc, Sunnyvale, CA 94085 USA Cerebras Syst Inc, Sunnyvale, CA 94085 USA
- [3] Improved Fine-Tuning by Better Leveraging Pre-Training DataADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,Liu, Ziquan论文数: 0 引用数: 0 h-index: 0机构: City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R ChinaXu, Yi论文数: 0 引用数: 0 h-index: 0机构: Dalian Univ Technol, Sch Artificial Intelligence, Dalian, Peoples R China City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R ChinaXu, Yuanhong论文数: 0 引用数: 0 h-index: 0机构: Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R ChinaQian, Qi论文数: 0 引用数: 0 h-index: 0机构: Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R ChinaLi, Hao论文数: 0 引用数: 0 h-index: 0机构: Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R ChinaJi, Xiangyang论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Automat, Beijing, Peoples R China City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R ChinaChan, Antoni B.论文数: 0 引用数: 0 h-index: 0机构: City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R ChinaJin, Rong论文数: 0 引用数: 0 h-index: 0机构: Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
- [4] On the Connection between Pre-training Data Diversity and Fine-tuning RobustnessADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,Ramanujan, Vivek论文数: 0 引用数: 0 h-index: 0机构: Univ Washington, Seattle, WA 98195 USA Univ Washington, Seattle, WA 98195 USANguyen, Thao论文数: 0 引用数: 0 h-index: 0机构: Univ Washington, Seattle, WA 98195 USA Univ Washington, Seattle, WA 98195 USAOh, Sewoong论文数: 0 引用数: 0 h-index: 0机构: Univ Washington, Seattle, WA 98195 USA Univ Washington, Seattle, WA 98195 USASchmidt, Ludwig论文数: 0 引用数: 0 h-index: 0机构: Univ Washington, Seattle, WA 98195 USA Allen Inst AI, Seattle, WA USA Univ Washington, Seattle, WA 98195 USAFarhadi, Ali论文数: 0 引用数: 0 h-index: 0机构: Univ Washington, Seattle, WA 98195 USA Allen Inst AI, Seattle, WA USA Univ Washington, Seattle, WA 98195 USA
- [5] From pre-training to fine-tuning: An in-depth analysis of Large Language Models in the biomedical domainARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 157Bonfigli, Agnese论文数: 0 引用数: 0 h-index: 0机构: Univ Campus Biomed Roma, Dept Engn, Res Unit Intelligent Technol Hlth & Wellbeing, Via Alvaro Portillo 21, I-00128 Rome, Italy CNR, Inst Computat Linguist Antonio Zampolli, ItaliaNLP Lab, Via Giuseppe Moruzzi 1, I-56124 Pisa, Italy Univ Campus Biomed Roma, Dept Engn, Res Unit Intelligent Technol Hlth & Wellbeing, Via Alvaro Portillo 21, I-00128 Rome, ItalyBacco, Luca论文数: 0 引用数: 0 h-index: 0机构: CNR, Inst Computat Linguist Antonio Zampolli, ItaliaNLP Lab, Via Giuseppe Moruzzi 1, I-56124 Pisa, Italy Univ Campus Biomed Roma, Dept Engn, Res Unit Comp Syst & Bioinformat, Via Alvaro Portillo 21, I-00128 Rome, Italy Univ Campus Biomed Roma, Dept Engn, Res Unit Intelligent Technol Hlth & Wellbeing, Via Alvaro Portillo 21, I-00128 Rome, ItalyMerone, Mario论文数: 0 引用数: 0 h-index: 0机构: Univ Campus Biomed Roma, Dept Engn, Res Unit Intelligent Technol Hlth & Wellbeing, Via Alvaro Portillo 21, I-00128 Rome, Italy Univ Campus Biomed Roma, Dept Engn, Res Unit Intelligent Technol Hlth & Wellbeing, Via Alvaro Portillo 21, I-00128 Rome, ItalyDell'Orletta, Felice论文数: 0 引用数: 0 h-index: 0机构: CNR, Inst Computat Linguist Antonio Zampolli, ItaliaNLP Lab, Via Giuseppe Moruzzi 1, I-56124 Pisa, Italy Univ Campus Biomed Roma, Dept Engn, Res Unit Intelligent Technol Hlth & Wellbeing, Via Alvaro Portillo 21, I-00128 Rome, Italy
- [6] Pre-training Fine-tuning data Enhancement method based on active learning2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1447 - 1454Cao, Deqi论文数: 0 引用数: 0 h-index: 0机构: Natl Univ Def Technol, Key Lab Informat Syst Engn, Changsha, Peoples R China Natl Univ Def Technol, Key Lab Informat Syst Engn, Changsha, Peoples R ChinaDing, Zhaoyun论文数: 0 引用数: 0 h-index: 0机构: Natl Univ Def Technol, Key Lab Informat Syst Engn, Changsha, Peoples R China Natl Univ Def Technol, Key Lab Informat Syst Engn, Changsha, Peoples R ChinaWang, Fei论文数: 0 引用数: 0 h-index: 0机构: Natl Univ Def Technol, Key Lab Informat Syst Engn, Changsha, Peoples R China Natl Univ Def Technol, Key Lab Informat Syst Engn, Changsha, Peoples R ChinaMa, Haoyang论文数: 0 引用数: 0 h-index: 0机构: Natl Univ Def Technol, Key Lab Informat Syst Engn, Changsha, Peoples R China Natl Univ Def Technol, Key Lab Informat Syst Engn, Changsha, Peoples R China
- [7] SAR-HUB: Pre-Training, Fine-Tuning, and ExplainingREMOTE SENSING, 2023, 15 (23)Yang, Haodong论文数: 0 引用数: 0 h-index: 0机构: Northwestern Polytech Univ, Sch Automation, BRain & Artificial INtelligence Lab, BRAIN LAB, Xian 710072, Peoples R China Northwestern Polytech Univ, Sch Automation, BRain & Artificial INtelligence Lab, BRAIN LAB, Xian 710072, Peoples R ChinaKang, Xinyue论文数: 0 引用数: 0 h-index: 0机构: Northwestern Polytech Univ, Sch Civil Aviat, Xian 710072, Peoples R China Northwestern Polytech Univ, Sch Automation, BRain & Artificial INtelligence Lab, BRAIN LAB, Xian 710072, Peoples R ChinaLiu, Long论文数: 0 引用数: 0 h-index: 0机构: Northwestern Polytech Univ, Sch Automation, BRain & Artificial INtelligence Lab, BRAIN LAB, Xian 710072, Peoples R China Northwestern Polytech Univ, Sch Automation, BRain & Artificial INtelligence Lab, BRAIN LAB, Xian 710072, Peoples R ChinaLiu, Yujiang论文数: 0 引用数: 0 h-index: 0机构: Northwestern Polytech Univ, Sch Automation, BRain & Artificial INtelligence Lab, BRAIN LAB, Xian 710072, Peoples R China Northwestern Polytech Univ, Sch Automation, BRain & Artificial INtelligence Lab, BRAIN LAB, Xian 710072, Peoples R ChinaHuang, Zhongling论文数: 0 引用数: 0 h-index: 0机构: Northwestern Polytech Univ, Sch Automation, BRain & Artificial INtelligence Lab, BRAIN LAB, Xian 710072, Peoples R China Northwestern Polytech Univ, Sch Automation, BRain & Artificial INtelligence Lab, BRAIN LAB, Xian 710072, Peoples R China
- [8] AlignDet: Aligning Pre-training and Fine-tuning in Object Detection2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6843 - 6853Li, Ming论文数: 0 引用数: 0 h-index: 0机构: ByteDance Inc, Beijing, Peoples R China ByteDance Inc, Beijing, Peoples R ChinaWu, Jie论文数: 0 引用数: 0 h-index: 0机构: ByteDance Inc, Beijing, Peoples R China ByteDance Inc, Beijing, Peoples R ChinaWang, Xionghui论文数: 0 引用数: 0 h-index: 0机构: ByteDance Inc, Beijing, Peoples R China ByteDance Inc, Beijing, Peoples R ChinaChen, Chen论文数: 0 引用数: 0 h-index: 0机构: Univ Cent Florida, Ctr Res Comp Vis, Orlando, FL 32816 USA ByteDance Inc, Beijing, Peoples R ChinaQin, Jie论文数: 0 引用数: 0 h-index: 0机构: ByteDance Inc, Beijing, Peoples R China ByteDance Inc, Beijing, Peoples R ChinaXiao, Xuefeng论文数: 0 引用数: 0 h-index: 0机构: ByteDance Inc, Beijing, Peoples R China ByteDance Inc, Beijing, Peoples R ChinaWang, Rui论文数: 0 引用数: 0 h-index: 0机构: ByteDance Inc, Beijing, Peoples R China ByteDance Inc, Beijing, Peoples R ChinaZheng, Min论文数: 0 引用数: 0 h-index: 0机构: ByteDance Inc, Beijing, Peoples R China ByteDance Inc, Beijing, Peoples R ChinaPan, Xin论文数: 0 引用数: 0 h-index: 0机构: ByteDance Inc, Beijing, Peoples R China ByteDance Inc, Beijing, Peoples R China
- [9] Parameter-efficient fine-tuning of large-scale pre-trained language modelsNATURE MACHINE INTELLIGENCE, 2023, 5 (03) : 220 - +Ding, Ning论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaQin, Yujia论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaYang, Guang论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaWei, Fuchao论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaYang, Zonghan论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaSu, Yusheng论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaHu, Shengding论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaChen, Yulin论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaChan, Chi-Min论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaChen, Weize论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaYi, Jing论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaZhao, Weilin论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaWang, Xiaozhi论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaLiu, Zhiyuan论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaZheng, Hai-Tao论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaChen, Jianfei论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaLiu, Yang论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaTang, Jie论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaLi, Juanzi论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R ChinaSun, Maosong论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China Beijing Acad Artificial Intelligence, Beijing, Peoples R China Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
- [10] Parameter-efficient fine-tuning of large-scale pre-trained language modelsNature Machine Intelligence, 2023, 5 : 220 - 235Ning Ding论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyYujia Qin论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyGuang Yang论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyFuchao Wei论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyZonghan Yang论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyYusheng Su论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyShengding Hu论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyYulin Chen论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyChi-Min Chan论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyWeize Chen论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyJing Yi论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyWeilin Zhao论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyXiaozhi Wang论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyZhiyuan Liu论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyHai-Tao Zheng论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyJianfei Chen论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyYang Liu论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyJie Tang论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyJuanzi Li论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and TechnologyMaosong Sun论文数: 0 引用数: 0 h-index: 0机构: Tsinghua University,Department of Computer Science and Technology