摘要: make compromise between learnt policy and minimal cost! π hat is using states π theta is using observations 阅读全文
posted @ 2018-05-27 23:01 ecoflex 阅读(185) 评论(0) 推荐(0) 编辑
摘要: MPC means replan every step Every N step, rebuild the dynamic model 阅读全文
posted @ 2018-05-27 18:15 ecoflex 阅读(237) 评论(0) 推荐(0) 编辑