

# 贝尔曼公式 ## Calculating return 1. Direct calculate 2. Bootstrapping (returns rely on each other) ### Bellman equation - Calculate returns in bootstrapp

# 基本概念 ## State $$ s_i\quad, \quad S = \{s_i\} $$ - 表示状态和状态空间(集合) ## Action $$ a_i \quad , \quad A = \{a_i\} $$ - 表示动作和动作空间(集合) - 可用Tabular representa

One Hot Encoding

One Hot Encoding one method converting categorical variables to convenient variables (e.g. 0-1) using dummy variables Pandas Get dummy columns dummies

Neural Network

Neural Network Consist of many layers with coefficients. Divide one part into several subparts and repeat this step for proper times. Train Make a ran

Gradient Descent

Gradient Descent Use loops and delta to reduce the difference between y_predict and y import pandas as pd import numpy as np import matplotlib.pyplot

Linear Regression packages import pandas as pd import numpy as np import matplotlib.pyplot as plt from sklearn import linear_model model reg = Linear_

Stack Method 优点:代码简单 缺点:不一定是最短路径 自己写的 maze = [ [1, 1, 1, 1, 1, 1, 1, 1, 1, 1], [1, 0, 1, 0, 0, 1, 1, 1, 0, 1], [1, 0, 0, 0, 1, 1, 0, 0, 0, 1], [1, 0,

YOLOv7 CSDN文章 使用Git Bash 输入命令 成功运行keypoint.py #!/usr/bin/env python # coding: utf-8 # In[ ]: import matplotlib.pyplot as plt import torch import cv2 f

学校Java Week9

Week9 W9L1 Static Variable the particular member belongs to a type itself, rather than to an instance of that type. Array of Objects Just like int or

学校Java Week7

Week7 W7L1 Java Virtual Machine (JVM) JDK (Development Kit) JRE (Runtime Environment) JDB (Debugger) .java -> [javac compiler] -> .class -> [JVM] -> U

