MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks

被引:0
|
作者
Zhang, Lei [1 ,3 ]
Zhang, Yuge [1 ]
Ren, Kan [2 ]
Li, Dongsheng [1 ]
Yang, Yuqing [1 ]
机构
[1] Microsoft Res, Redmond, WA USA
[2] ShanghaiTech Univ, Shanghai, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The field of machine learning (ML) has gained widespread adoption, leading to significant demand for adapting ML to specific scenarios, which is yet expensive and non-trivial. The predominant approaches towards the automation of solving ML tasks (e.g., AutoML) are often time-consuming and hard to understand for human developers. In contrast, though human engineers have the incredible ability to understand tasks and reason about solutions, their experience and knowledge are often sparse and difficult to utilize by quantitative approaches. In this paper, we aim to bridge the gap between machine intelligence and human knowledge by introducing a novel framework MLCopilot(1), which leverages the state-of-the-art large language models to develop ML solutions for novel tasks. We showcase the possibility of extending the capability of LLMs to comprehend structured inputs and perform thorough reasoning for solving novel ML tasks. And we find that, after some dedicated design, the LLM can (i) observe from the existing experiences of ML tasks and (ii) reason effectively to deliver promising results for new tasks. The solution generated can be used directly to achieve high levels of competitiveness.
引用
收藏
页码:2931 / 2959
页数:29
相关论文
共 50 条
  • [1] Unleashing the Power of Large Language Models for Legal Applications
    Zhang, Dell
    Petrova, Alina
    Trautmann, Dietrich
    Schilder, Frank
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5257 - 5258
  • [2] SynAsk: unleashing the power of large language models in organic synthesis
    Zhang, Chonghuan
    Lin, Qianghua
    Zhu, Biwei
    Yang, Haopeng
    Lian, Xiao
    Deng, Hao
    Zheng, Jiajun
    Liao, Kuangbiao
    CHEMICAL SCIENCE, 2024, : 43 - 56
  • [3] Unleashing the power of large language models specific for haemophilia research
    Castaldoni, Rodrigo
    Ferreira-Martins, Andre Juan
    Nogueira, Tatiane
    Rios, Ricardo
    Lopes, Tiago Jose da Silva
    HAEMOPHILIA, 2024, 30 : 5 - 5
  • [4] ChatGPT GameJam: Unleashing the power of Large Language Models for Game Jams
    Grow, April M.
    Khosmood, Foaad
    ACM International Conference Proceeding Series, 2023, : 51 - 54
  • [5] Exploring and Unleashing the Power of Large Language Models in Automated Code Translation
    Yang, Zhen
    Liu, Fang
    Yu, Zhongxing
    Keung, Jacky Wai
    Li, Jia
    Liu, Shuo
    Hong, Yifan
    Ma, Xiaoxue
    Jin, Zhi
    Li, Ge
    arXiv,
  • [6] Evaluating the capabilities of large language models using machine learning tasks at inference-time
    Grm, Klemen
    Elektrotehniski Vestnik/Electrotechnical Review, 2023, 90 (05): : 247 - 253
  • [7] Evaluating the capabilities of large language models using machine learning tasks at inference-time
    Grm, Klemen
    ELEKTROTEHNISKI VESTNIK, 2023, 90 (05): : 247 - 253
  • [8] Large language models are less effective at clinical prediction tasks than locally trained machine learning models
    Brown, Katherine E.
    Yan, Chao
    Li, Zhuohang
    Zhang, Xinmeng
    Collins, Benjamin X.
    Chen, You
    Clayton, Ellen Wright
    Kantarcioglu, Murat
    Vorobeychik, Yevgeniy
    Malin, Bradley A.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2025,
  • [9] ChatGPT Game Jam: Unleashing the Power of Large Language Models for Game Jams
    Grow, April
    Khosmood, Foaad
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON GAME JAMS, HACKATHONS AND GAME CREATION EVENTS, ICGJ 2023, 2023, : 51 - 54
  • [10] Multimodal large language models for inclusive collaboration learning tasks
    Lewis, Armanda
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 202 - 210