摘要:
from pathlib import Path 获取当前的绝对路径 : Path.cwd() 阅读全文
摘要:
class ExtendedTimeStep(NamedTuple): # step_type: Any # reward: Any # discount: Any # observation: Any # action: Any def __init__(self, step_type, rewa 阅读全文
摘要:
根据PPO中的算法,经过测试,发现 obs_batch = self.obs[:-1].view(-1, *self.obs.size()[2:])[indices]其中, self.obs.size()[2:].shape = (4,84,84)self.obs[:-1].shape = (128 阅读全文