PP: Neural ordinary differential equations

Instead of specifying a discrete sequence of hidden layers, we parameterize the derivative of the hidden state using a neural network.

Before: a discrete sequence of hidden layers.

After: the derivative of the hidden state.

Traditional methods: residual networks, RNN decoders, and normalizing flows build complicated transformations by composing a sequence of transformations to a hidden state.

we parameterize the continuous dynamics of hidden units using an ordinary differential equation (ODE) 常微分函数.

将h(t) 看作一个函数，可以用一个neural network学习h(t)的分布，然后输入层h(0) ----> 输出层h(T);

posted @ 2020-02-12 01:21 keeps_you_warm 阅读(220) 评论(0) 编辑收藏举报

努力加载评论中...

刷新页面返回顶部

PP: Neural ordinary differential equations

公告