06 2015 档案

摘要:1. vanish of gradientRNN的error相对于某个时间点t的梯度为:\(\frac{\partial E_t}{\partial W}=\sum_{k=1}^{t}\frac{\partial E_t}{\partial y_t}\frac{\partial y_t}{\part... 阅读全文
posted @ 2015-06-02 15:36 冷处理场烟囱 阅读(889) 评论(1) 推荐(1) 编辑