penuel

2025年4月1日

摘要：当state space太大的时候，需要用一个函数来对state value 或action value进行近似，方便处理 1. Algorithm for state value estimation 1.1 Objective function 这里

d_{π}

$d_{\pi}$ 是权重，可以决定哪个st 阅读全文

posted @ 2025-04-01 11:44 penuel 阅读(6) 评论(0) 推荐(0) 编辑

2025年3月19日

强化学习理论-第7课-时序差分方法

摘要： 1. TD learning of state values 公式1是用来根据

s_{t}

$s_t$ 的state value来更新t+1的状态。公式2是没有被访问的状态，下一刻的state value等于上一刻的。 1.1两个概念：TD target ，TD error TD target: TD err 阅读全文

posted @ 2025-03-19 11:46 penuel 阅读(9) 评论(0) 推荐(0) 编辑

2025年1月2日

OCS2::legged_robot::LeggedRobotInterface.cpp

摘要：这个文件主要是对最优问题的构造。 1. setupOptimalConrolProblem void LeggedRobotInterface::setupOptimalConrolProblem(const std::string& taskFile, const std::string& urd 阅读全文

posted @ 2025-01-02 14:13 penuel 阅读(40) 评论(0) 推荐(0) 编辑

OCS2::legged_robot::gait.info

摘要：步态文件： 1. 步态类型 list { [0] stance 静止 [1] trot 快走，一种快速、稳定的交替对角步态 [2] standing_trot 在交替的步伐中插入静止阶段，增加稳定性 [3] flying_trot 在交替的步伐中插入腾空阶段，增加速度 [4] pace 同侧步态，左阅读全文

posted @ 2025-01-02 10:20 penuel 阅读(40) 评论(0) 推荐(0) 编辑

2024年12月30日

OCS2::MPC 启动流程

摘要： 1. 创建MPC_ROS_Interface接口,以sqpMpc为例 //自定义接口 LeggedRobotInterface interface(taskFile, urdfFile, referenceFile); // 创建同步接口 auto gaitReceiverPtr = std::ma 阅读全文

posted @ 2024-12-30 10:55 penuel 阅读(89) 评论(0) 推荐(0) 编辑

2024年12月19日

OCS2::ocs2_centroidal_model_质心动量模型

摘要： 1. ModelHelperFunctions.cpp 1.1 updateCentroidalDynamics() : 质心动力学更新 template <typename SCALAR_T> void updateCentroidalDynamics(PinocchioInterfaceTpl< 阅读全文

posted @ 2024-12-19 17:22 penuel 阅读(264) 评论(0) 推荐(0) 编辑

2024年12月11日

OCS2::legged_robot::SwingTrajectoryPlanner_摆动腿轨迹规划

摘要：计算特定时间点指定腿的垂直速度约束