摘要:
1、知识点 2、Bellman优化目标 3、bellman案例,gridworld.py和ValueIteration.py import numpy as np import sys from gym.envs.toy_text import discrete UP = 0 RIGHT = 1 D 阅读全文
摘要:
1、知识点 2、代码 # coding: utf-8 import numpy as np import tensorflow as tf import pickle import matplotlib.pyplot as plt get_ipython().run_line_magic('matp 阅读全文