Diffusion policy: Visuomotor policy learning via action diffusion

被引:8
|
作者
Chi, Cheng [1 ]
Xu, Zhenjia [1 ]
Feng, Siyuan [2 ]
Cousineau, Eric [2 ]
Du, Yilun [3 ]
Burchfiel, Benjamin [2 ]
Tedrake, Russ [2 ,3 ]
Song, Shuran [1 ,4 ]
机构
[1] Columbia Univ, Comp Sci, New York, NY USA
[2] Toyota Res Inst, Palo Alto, CA USA
[3] MIT, EECS, Cambridge, MA USA
[4] Stanford Univ, Elect Engn, Stanford, CA USA
关键词
Imitation learning; visuomotor policy; manipulation;
D O I
10.1177/02783649241273668
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This paper introduces Diffusion Policy, a new way of generating robot behavior by representing a robot's visuomotor policy as a conditional denoising diffusion process. We benchmark Diffusion Policy across 15 different tasks from 4 different robot manipulation benchmarks and find that it consistently outperforms existing state-of-the-art robot learning methods with an average improvement of 46.9%. Diffusion Policy learns the gradient of the action-distribution score function and iteratively optimizes with respect to this gradient field during inference via a series of stochastic Langevin dynamics steps. We find that the diffusion formulation yields powerful advantages when used for robot policies, including gracefully handling multimodal action distributions, being suitable for high-dimensional action spaces, and exhibiting impressive training stability. To fully unlock the potential of diffusion models for visuomotor policy learning on physical robots, this paper presents a set of key technical contributions including the incorporation of receding horizon control, visual conditioning, and the time-series diffusion transformer. We hope this work will help motivate a new generation of policy learning techniques that are able to leverage the powerful generative modeling capabilities of diffusion models. Code, data, and training details are available (diffusion-policy.cs.columbia.edu).
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Self-Supervised Correspondence in Visuomotor Policy Learning
    Florence, Peter
    Manuelli, Lucas
    Tedrake, Russ
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) : 492 - 499
  • [42] Visuomotor Policy Learning for Task Automation of Surgical Robot
    Huang, Junhui
    Shi, Qingxin
    Xie, Dongsheng
    Ma, Yiming
    Liu, Xiaoming
    Li, Changsheng
    Duan, Xingguang
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2024, 6 (04): : 1448 - 1457
  • [43] Learning semantic features for action recognition via diffusion maps
    Liu, Jingen
    Yang, Yang
    Saleemi, Imran
    Shah, Mubarak
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2012, 116 (03) : 361 - 377
  • [44] Review Article: The Diffusion of Policy Diffusion Research in Political Science
    Graham, Erin R.
    Shipan, Charles R.
    Volden, Craig
    BRITISH JOURNAL OF POLITICAL SCIENCE, 2013, 43 : 673 - 701
  • [45] Site Visits, Policy Learning, and the Diffusion of Policy Innovation: Evidence from Public Bicycle Programs in China
    Ma, Liang
    JOURNAL OF CHINESE POLITICAL SCIENCE, 2017, 22 (04) : 581 - 599
  • [46] Site Visits, Policy Learning, and the Diffusion of Policy Innovation: Evidence from Public Bicycle Programs in China
    Liang Ma
    Journal of Chinese Political Science, 2017, 22 : 581 - 599
  • [47] Policy Diffusion and Policy Transfer in Comparative Welfare State Research
    Obinger, Herbert
    Schmitt, Carina
    Starke, Peter
    SOCIAL POLICY & ADMINISTRATION, 2013, 47 (01) : 111 - 129
  • [48] Agenda Setting and State Policy Diffusion: The Effects of Media Attention, State Court Decisions, and Policy Learning on Fetal Killing Policy
    Oakley, M. R.
    SOCIAL SCIENCE QUARTERLY, 2009, 90 (01) : 164 - 178
  • [49] The Seeds of Policy Change: Leveraging Diffusion to Disseminate Policy Innovations
    Boehmke, Frederick J.
    Rury, Abigail Matthews
    Desmarais, Bruce A.
    Harden, Jeffrey J.
    JOURNAL OF HEALTH POLITICS POLICY AND LAW, 2017, 42 (02) : 285 - 307
  • [50] Cannabis policy diffusion in Ontario and New Brunswick: Coercion, learning, and replication
    Train, Andrew
    Snow, Dave
    CANADIAN PUBLIC ADMINISTRATION-ADMINISTRATION PUBLIQUE DU CANADA, 2019, : 549 - 572