AutoML to Date and Beyond: Challenges and Opportunities

被引:129
|
作者
Karmaker , Shubhra Kanti [1 ]
Hassan, Md Mahadi [1 ]
Smith, Micah J. [2 ]
Xu, Lei [2 ]
Zhai, Chengxiang [3 ]
Veeramachaneni, Kalyan [2 ]
机构
[1] Auburn Univ, Samuel Ginn Coll Engn, 3106 Shelby Ctr,345 W Magnolia Ave, Auburn, AL 36849 USA
[2] MIT, LIDS, MIT Stata Ctr, 32 Vassar St,Room 32-D712, Cambridge, MA 02139 USA
[3] Univ Illinois, Thomas M Siebel Ctr Comp Sci, 201 North Goodwin Ave MC 258, Urbana, IL 61801 USA
关键词
Automated machine learning; interactive data science; democratization of artificial intelligence; predictive analytics; FEATURE-SELECTION;
D O I
10.1145/3470918
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As big data becomes ubiquitous across domains, and more and more stakeholders aspire to make the most of their data, demand for machine learning tools has spurred researchers to explore the possibilities of automated machine learning (AutoML). AutoML tools aim to make machine learning accessible for non-machine learning experts (domain experts), to improve the efficiency of machine learning, and to accelerate machine learning research. But although automation and efficiency are among AutoML's main selling points, the process still requires human involvement at a number of vital steps, including understanding the attributes of domain-specific data, defining prediction problems, creating a suitable training dataset, and selecting a promising machine learning technique. These steps often require a prolonged back-and-forth that makes this process inefficient for domain experts and data scientists alike and keeps so-called AutoML systems from being truly automatic. In this review article, we introduce a new classification system for AutoML systems, using a seven-tiered schematic to distinguish these systems based on their level of autonomy. We begin by describing what an end-to-end machine learning pipeline actually looks like, and which subtasks of the machine learning pipeline have been automated so far. We highlight those subtasks that are still done manually-generally by a data scientist-and explain how this limits domain experts' access to machine learning. Next, we introduce our novel level-based taxonomy for AutoML systems and define each level according to the scope of automation support provided. Finally, we lay out a roadmap for the future, pinpointing the research required to further automate the end-to-end machine learning pipeline and discussing important challenges that stand in the way of this ambitious goal.
引用
收藏
页数:36
相关论文
共 50 条
  • [21] ENERGY AND FOREST INDUSTRY - CHALLENGES AND OPPORTUNITIES FOR THE YEAR 2000 AND BEYOND
    WESTERBERG, EN
    INTERNATIONAL SYMPOSIUM - ENERGY OPTIONS FOR THE YEAR 2000 : CONTEMPORARY CONCEPTS IN TECHNOLOGY AND POLICY, VOLS 1-4: ENERGY SYSTEM ; TECHNOLOGY FUTURES : ENERGY BUILDINGS ; INTERNATIONAL COOPERATION, 1988, : A219 - A231
  • [22] CIVIL-MILITARY COORDINATION: CHALLENGES AND OPPORTUNITIES IN AFGHANISTAN AND BEYOND
    Olson, Lara
    Gregorian, Hrach
    JOURNAL OF MILITARY AND STRATEGIC STUDIES, 2007, 10 (01):
  • [23] CMOS Scaling Beyond 32nm: Challenges and Opportunities
    Kuhn, Kelin J.
    DAC: 2009 46TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2009, : 310 - +
  • [24] Lung cancer susceptibility beyond smoking history: opportunities and challenges
    Hanash, Samir
    TRANSLATIONAL LUNG CANCER RESEARCH, 2022, 11 (07)
  • [25] Beyond Adaptive Sports: Challenges & Opportunities to Improve Accessibility and Analytics
    Khurana, Rushil
    Wang, Ashley
    Carrington, Patrick
    23RD INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, ASSETS 2021, 2021,
  • [26] Lung nodules and beyond: approaches, challenges and opportunities in thoracic CAD
    McNitt-Gray, M
    CARS 2004: COMPUTER ASSISTED RADIOLOGY AND SURGERY, PROCEEDINGS, 2004, 1268 : 896 - 901
  • [27] Animal Proteins: Challenges and Opportunities Beyond "Avoiding Red Meat"
    Zuzelo, Patti Rager
    HOLISTIC NURSING PRACTICE, 2017, 31 (03) : 210 - 212
  • [28] Challenges and Opportunities to Manage Depression During the Menopausal Transition and Beyond
    Soares, Claudio N.
    Frey, Benicio N.
    PSYCHIATRIC CLINICS OF NORTH AMERICA, 2010, 33 (02) : 295 - +
  • [29] Progress, Opportunities and Challenges for Beyond CMOS Information Processing Technologies
    Bourianoff, George
    Nikonov, Dmitri E.
    SILICON COMPATIBLE MATERIALS, PROCESSES, AND TECHNOLOGIES FOR ADVANCED INTEGRATED CIRCUITS AND EMERGING APPLICATIONS, 2011, 35 (02): : 43 - 53
  • [30] Old challenges and new opportunities for the MDGs: now and beyond 2015
    Clarke, Matthew
    Feeny, Simon
    JOURNAL OF THE ASIA PACIFIC ECONOMY, 2011, 16 (04) : 509 - 519