随笔分类 - RL

【论文翻译-RL×Diffusion】Planning with Diffusion for Flexible Behavior Synthesis

摘要：

【论文翻译-RL×Diffusion】Planning with Diffusion for Flexible Behavior Synthesis

Levine 组 2022 年的顶会之一，扩散模型×强化学习的开山之作。阅读全文

posted @ 2023-05-15 18:44 Be(CN₃H₃)₂ 阅读(4016) 评论(0) 推荐(2)

从 VPG 到 PPO

摘要：

从 VPG 到 PPO

VPG->自然策略梯度->TRPO->PPO 阅读全文

posted @ 2023-05-02 22:32 Be(CN₃H₃)₂ 阅读(279) 评论(0) 推荐(0)