Myriad: a real-world testbed to bridge trajectory optimization and deep learning

被引:0
|
作者
Howe, Nikolaus H. R. [1 ]
Dufort-Labbe, Simon [1 ]
Rajkumar, Nitarshan [2 ]
Bacon, Pierre-Luc [3 ]
机构
[1] Univ Montreal, Mila, Montreal, PQ, Canada
[2] Univ Cambridge, Cambridge, England
[3] Univ Montreal, Mila, Facebook CIFAR AI, IVADO, Montreal, PQ, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Myriad, a testbed written in JAX which enables machine learning researchers to benchmark imitation learning and reinforcement learning algorithms against trajectory optimization-based methods in real-world environments. Myriad contains 17 optimal control problems presented in continuous time which span medicine, ecology, epidemiology, and engineering. As such, Myriad strives to serve as a stepping stone towards application of modern machine learning techniques for impactful real-world tasks. The repository also provides machine learning practitioners access to trajectory optimization techniques, not only for standalone use, but also for integration within a typical automatic differentiation workflow. Indeed, the combination of classical control theory and deep learning in a fully GPU-compatible package unlocks potential for new algorithms to arise. We present one such novel approach for use in optimal control tasks. Trained in a fully end-to-end fashion, our model leverages an implicit planning module over neural ordinary differential equations, enabling simultaneous learning and planning with unknown environment dynamics. All environments, optimizers and tools are available in the software package at https://github.com/nikihowe/myriad.
引用
下载
收藏
页数:15
相关论文
共 50 条
  • [21] Tackling Real-World Autonomous Driving using Deep Reinforcement Learning
    Maramotti, Paolo
    Capasso, Alessandro Paolo
    Bacchiani, Giulio
    Broggi, Alberto
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1274 - 1281
  • [22] Evaluation of a Deep Learning Model on a Real-World Clinical Glaucoma Dataset
    Thakoor, Kaveri
    Leshno, Ari
    La Bruna, Sol
    Tsamis, Emmanouil
    De Moraes, Gustavo
    Sajda, Paul
    Harizman, Noga
    Liebmann, Jeffrey M.
    Cioffi, George A.
    Hood, Donald C.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2022, 63 (07)
  • [23] Face Recognition in Real-world Internet Videos Based on Deep Learning
    Li, Zhaoyang
    Tie, Yun
    Qi, Lin
    2019 8TH INTERNATIONAL SYMPOSIUM ON NEXT GENERATION ELECTRONICS (ISNE), 2019,
  • [24] Real-world dexterous object manipulation based deep reinforcement learning
    Yao, Qingfeng
    Wang, Jilong
    Yang, Shuyu
    arXiv, 2021,
  • [25] A Real-Time Deep Learning Approach for Real-World Video Anomaly Detection
    Petrocchi, Stefano
    Giorgi, Giacomo
    Cimino, Mario G. C. A.
    ARES 2021: 16TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, 2021,
  • [26] Brazilian disaster datasets and real-world instances for optimization and machine learning
    Veloso, Rafaela
    Cespedes, Juliana
    Caunhye, Aakil
    Alem, Douglas
    DATA IN BRIEF, 2022, 42
  • [27] A Comprehensive Study of Real-World Bugs in Machine Learning Model Optimization
    Guan, Hao
    Xiao, Ying
    Li, Jiaying
    Liu, Yepang
    Bai, Guangdong
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 147 - 158
  • [28] Using a virtual lab network testbed to facilitate real-world hands-on learning in a networking course
    Luse, Andy
    Rursch, Julie
    BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2021, 52 (03) : 1244 - 1261
  • [29] Real-World Applications of Multiobjective Optimization
    Stewart, Theodor
    Bandte, Oliver
    Braun, Heinrich
    Chakraborti, Nirupam
    Ehrgott, Matthias
    Goebelt, Mathias
    Jin, Yaochu
    Nakayama, Hirotaka
    Poles, Silvia
    Di Stefano, Danilo
    MULTIOBJECTIVE OPTIMIZATION: INTERACTIVE AND EVOLUTIONARY APPROACHES, 2008, 5252 : 285 - +
  • [30] Real-World Trajectory Sharing with Local Differential Privacy
    Cunningham, Teddy
    Cormode, Graham
    Ferhatosmanoglu, Hakan
    Srivastava, Divesh
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11): : 2283 - 2295