2024 年 11月 5 日随笔档案 - penuel

2024年11月5日

摘要： 1. state:状态，可以是机器人的位置，速度，加速度等 2. action:对于每一个状态，可能的动作 3. state transition:状态转移 3.1 state transition probability: 4. policy：告诉agent在这个状态应该采用哪个action 5. 阅读全文

posted @ 2024-11-05 09:58 penuel 阅读(3) 评论(0) 推荐(0) 编辑

强化学习理论-第0课-汇总

摘要： ![](https://img2024.cnblogs.com/blog/1746850/202411/1746850-20241105093751819-829769841.jpg) ![](https://img2024.cnblogs.com/blog/1746850/202411/1746850-20241105093753475-478576475.jpg) ![](https://im 阅读全文

posted @ 2024-11-05 09:38 penuel 阅读(3) 评论(0) 推荐(0) 编辑

penuel

公告