【2020春】李宏毅机器学习(Gradient Descent)

https://www.bilibili.com/video/av94519857?p=5
https://www.bilibili.com/video/av94519857?p=6
https://www.bilibili.com/video/av94519857?p=7

为什么SGD比GD收敛更快?

Feature Scaling

GD的数学

GD的限制

  • stuck at saddle point
  • stuck at local minima
  • very slow at the plateau

posted @ 2020-08-22 12:19  ZH奶酪  阅读(156)  评论(0编辑  收藏  举报