AutoML to Date and Beyond: Challenges and Opportunities

被引:129
|
作者
Karmaker , Shubhra Kanti [1 ]
Hassan, Md Mahadi [1 ]
Smith, Micah J. [2 ]
Xu, Lei [2 ]
Zhai, Chengxiang [3 ]
Veeramachaneni, Kalyan [2 ]
机构
[1] Auburn Univ, Samuel Ginn Coll Engn, 3106 Shelby Ctr,345 W Magnolia Ave, Auburn, AL 36849 USA
[2] MIT, LIDS, MIT Stata Ctr, 32 Vassar St,Room 32-D712, Cambridge, MA 02139 USA
[3] Univ Illinois, Thomas M Siebel Ctr Comp Sci, 201 North Goodwin Ave MC 258, Urbana, IL 61801 USA
关键词
Automated machine learning; interactive data science; democratization of artificial intelligence; predictive analytics; FEATURE-SELECTION;
D O I
10.1145/3470918
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As big data becomes ubiquitous across domains, and more and more stakeholders aspire to make the most of their data, demand for machine learning tools has spurred researchers to explore the possibilities of automated machine learning (AutoML). AutoML tools aim to make machine learning accessible for non-machine learning experts (domain experts), to improve the efficiency of machine learning, and to accelerate machine learning research. But although automation and efficiency are among AutoML's main selling points, the process still requires human involvement at a number of vital steps, including understanding the attributes of domain-specific data, defining prediction problems, creating a suitable training dataset, and selecting a promising machine learning technique. These steps often require a prolonged back-and-forth that makes this process inefficient for domain experts and data scientists alike and keeps so-called AutoML systems from being truly automatic. In this review article, we introduce a new classification system for AutoML systems, using a seven-tiered schematic to distinguish these systems based on their level of autonomy. We begin by describing what an end-to-end machine learning pipeline actually looks like, and which subtasks of the machine learning pipeline have been automated so far. We highlight those subtasks that are still done manually-generally by a data scientist-and explain how this limits domain experts' access to machine learning. Next, we introduce our novel level-based taxonomy for AutoML systems and define each level according to the scope of automation support provided. Finally, we lay out a roadmap for the future, pinpointing the research required to further automate the end-to-end machine learning pipeline and discussing important challenges that stand in the way of this ambitious goal.
引用
收藏
页数:36
相关论文
共 50 条
  • [1] THE OPPORTUNITIES AND CHALLENGES OF DEMOCRACY: WEIMAR AND BEYOND
    Mueller, Tim B.
    BULLETIN OF THE GERMAN HISTORICAL INSTITUTE, 2019, (65): : 111 - 129
  • [2] Blockchain: challenges and opportunities beyond bitcoin
    Psaila, Giuseppe
    Garcia-Bringas, Pablo
    DYNA, 2017, 92 (05): : 517 - 521
  • [3] Challenges of Accurate and Efficient AutoML
    Dey, Swarnava
    Ghose, Avik
    Das, Soumik
    2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1834 - 1839
  • [4] Partnerships within and beyond universities: Opportunities and challenges
    Silka, L
    PUBLIC HEALTH REPORTS, 2004, 119 (01) : 73 - 78
  • [5] Opportunities and challenges for telehealth within, and beyond, a pandemic
    Blandford, Ann
    Wesson, Janet
    Amalberti, Rene
    AlHazme, Raed
    Allwihan, Ragad
    LANCET GLOBAL HEALTH, 2020, 8 (11): : E1364 - E1365
  • [6] Blockchain Technology in Education: Opportunities, Challenges, and Beyond
    Samala A.D.
    Mhlanga D.
    Bojic L.
    Howard N.-J.
    Coelho D.P.
    International Journal of Interactive Mobile Technologies, 2024, 18 (01): : 20 - 42
  • [7] Biomedical publishing in 2015 and beyond: challenges and opportunities
    Ackland, Stephen
    Shi, Yuan-Kai
    ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2016, 12 (01) : 5 - 6
  • [8] Beyond chatting: The opportunities and challenges of ChatGPT in medicine and radiology
    Ferres, Juan M. Lavista
    Weeks, William B.
    Chu, Linda C.
    Rowe, Steven P.
    Fishman, Elliot K.
    DIAGNOSTIC AND INTERVENTIONAL IMAGING, 2023, 104 (06) : 263 - 264
  • [9] Human Beyond the Machine: Challenges and Opportunities of Microtask Crowdsourcing
    Gadiraju, Ujwal
    Demartini, Gianluca
    Kawase, Ricardo
    Dietze, Stefan
    IEEE INTELLIGENT SYSTEMS, 2015, 30 (04) : 81 - 85
  • [10] Beyond curse or blessing: the opportunities and challenges of aDNA analysis
    Sykes, Naomi
    Spriggs, Matthew
    Evin, Allowen
    WORLD ARCHAEOLOGY, 2019, 51 (04) : 503 - 516