随笔分类 - 强化学习
摘要:改进 Gt = Gt - mean(Gt) 代码 model.py import torch import torch.nn as nn import torch.nn.functional as F class Model(nn.Module): def __init__(self): super
阅读全文
摘要:策略梯度,入门样例 原文链接: https://www.cnblogs.com/Twobox/ 参考链接: https://datawhalechina.github.io/easy-rl/#/chapter4/chapter4 https://zhuanlan.zhihu.com/p/358700
阅读全文