摘要: # Python 3.6.5 :: Anaconda, Inc. import numpy as np import pandas as pd import time np.random.seed(2) N_STATUS = 5 ACTIONS = ['left', 'right'] EPSILON = 0.9 ALPHA = 0.1 LAMBDA = 0.9 MAX_EPISODES =... 阅读全文
posted @ 2018-08-23 10:04 路边的十元钱硬币 阅读(313) 评论(0) 推荐(0) 编辑