给定a,b,扩展欧几里得算法求得最大公约数的同时,还会给出ax+by=gcd(a,b)的整数解x,y 假设 $$ d_{i-2}=d_{i-1} c_i+d_i \ d_{i-1}=d_ic_{i+1}+d_{i+1} $$ 假设a,b的最大公约数为$g$,当某一步的$d_{i-1}=0$时,$1d
Independent Components Analysis Ambiguity ICA is ambiguous to scaling and permutation. but usually it doesn't matter. As long as the data is not Gauss
Principal Components Analysis Intuition PCA tries to identify the subspace in which the data approximately lies. Intuitively, we choose a direction fo
Decision Tree Greedy, Top-down, Recurrent Classification Tree misclassification loss is not suitable for decision tree loss, because $$ L(R_p) - (\lam
Factor Analysis Background when m (number of samples) < n (dimension of samples), the covariance matrix might be singular. So it is necessary to find
EM Algorithm Jensen's inequality convex function: $f''(x) \ge 0$ or $H \ge 0$ (Hessian matrix when x is a vector) $$ E[f(x)] \ge f(EX) $$ EM Algorithm
Reinforcement Learning Background Credit Assignment Problem: Explore how actions in an action sequence contribute to the outcome finally. MDP(Markov D
Learning Mathematical Theorem Feature Selection If we have n features, the VC dimension of hypothesis class would be O(n), if n>>m, over-fitting would
Support Vector Machine Two Margins Functional Margin $\gamma$ and Geometrical Margin $\hat{\gamma}$ $$ \hat{\gamma^{(i)}}=\frac{\gamma^{(i)}}{||w||} $
Error Theorem in ML Basic Concepts $$ \varepsilon(h) = \text{Generalization Error} = E_{(x,y)\sim D}[1(h(x) \neq y)] $$ $$ \hat{\varepsilon}(h)=\text{
