MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks

被引:0
|
作者
Zhang, Lei [1 ,3 ]
Zhang, Yuge [1 ]
Ren, Kan [2 ]
Li, Dongsheng [1 ]
Yang, Yuqing [1 ]
机构
[1] Microsoft Res, Redmond, WA USA
[2] ShanghaiTech Univ, Shanghai, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The field of machine learning (ML) has gained widespread adoption, leading to significant demand for adapting ML to specific scenarios, which is yet expensive and non-trivial. The predominant approaches towards the automation of solving ML tasks (e.g., AutoML) are often time-consuming and hard to understand for human developers. In contrast, though human engineers have the incredible ability to understand tasks and reason about solutions, their experience and knowledge are often sparse and difficult to utilize by quantitative approaches. In this paper, we aim to bridge the gap between machine intelligence and human knowledge by introducing a novel framework MLCopilot(1), which leverages the state-of-the-art large language models to develop ML solutions for novel tasks. We showcase the possibility of extending the capability of LLMs to comprehend structured inputs and perform thorough reasoning for solving novel ML tasks. And we find that, after some dedicated design, the LLM can (i) observe from the existing experiences of ML tasks and (ii) reason effectively to deliver promising results for new tasks. The solution generated can be used directly to achieve high levels of competitiveness.
引用
收藏
页码:2931 / 2959
页数:29
相关论文
共 50 条
  • [21] Unleashing the Retrieval Potential of Large Language Models in Conversational Recommender Systems
    Yang, Ting
    Chen, Li
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 43 - 52
  • [22] Unleashing the Unseen: Harnessing Benign Datasets for Jailbreaking Large Language Models
    Zhao, Wei
    Li, Zhe
    Li, Yige
    Sun, Jun
    arXiv,
  • [23] Navigating WebAI: Training Agents to CompleteWeb Tasks with Large Language Models and Reinforcement Learning
    Thil, Lucas-Andrei
    Popa, Mirela
    Spanakis, Gerasimos
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 866 - 874
  • [24] TLRec: A Transfer Learning Framework to Enhance Large Language Models for Sequential Recommendation Tasks
    Lin, Jiaye
    Peng, Shuang
    Zhang, Zhong
    Zhao, Peilin
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 1119 - 1124
  • [25] TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
    Chen, Jingye
    Huang, Yupan
    Lv, Tengchao
    Cui, Lei
    Chen, Qifeng
    Wei, Furu
    COMPUTER VISION - ECCV 2024, PT V, 2025, 15063 : 386 - 402
  • [26] Sources of Hallucination by Large Language Models on Inference Tasks
    McKenna, Nick
    Li, Tianyi
    Cheng, Liang
    Hosseini, Mohammad Javad
    Johnson, Mark
    Steedman, Mark
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2758 - 2774
  • [27] Evaluating Large Language Models on Controlled Generation Tasks
    Sun, Jiao
    Tian, Yufei
    Zhou, Wangchunshu
    Xu, Nan
    Hu, Qian
    Gupta, Rahul
    Wieting, John
    Peng, Nanyun
    Ma, Xuezhe
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3155 - 3168
  • [28] Facilitating Autonomous Driving Tasks With Large Language Models
    Wu, Mengyao
    Yu, F. Richard
    Liu, Peter Xiaoping
    He, Ying
    IEEE INTELLIGENT SYSTEMS, 2025, 40 (01) : 45 - 52
  • [29] Evaluating large language models in theory of mind tasks
    Kosinski, Michal
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2024, 121 (45)
  • [30] Robustness of GPT Large Language Models on Natural Language Processing Tasks
    Xuanting C.
    Junjie Y.
    Can Z.
    Nuo X.
    Tao G.
    Qi Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (05): : 1128 - 1142