Predicting Structured Data
2006
this paper: written by LeCun
http://yann.lecun.com/exdb/publis/orig/lecun-06.pdf
A tutorial on energy-based learning, which provides a common theoretical framework for many models.

1 Introduction: Energy-Based Models

1.1 Energy-Based Inference

example: pixel -> label
contrast functions, value functions, or negative log-likelihood functions, \(E(Y,X)\)
discrete or continuous, dimension...
all kinds of optimization techniques

What is the Y that is most compatible with this X? predict, classify, decision-making
ranking, detection (threshold), conditional probability (given to a human or another system)
\(X\) high \(Y\) low (common); converse: image restoration, CG, generation; both high: complex! e.g. hyper-resolution

uncalibrated, not commensurate, probability, Gibbs distribution, temperature, partition function, statistical physics
convergence, restrict, intractable

a family of energy functions indexed by a parameter \(W\)
architecture, the internal structure of the parameterized function \(E(W, Y, X)\)
- real vectors, a linear combination of basis functions (kernel methods)
- neural
training samples, prior knowledge, loss functional \(\mathcal L(E,\mathcal S)\), loss function, \(\mathcal L(W,\mathcal S)\)
\(W^* = min_{W\in \mathcal W}\mathcal L(W,\mathcal S)\)
- \(\mathcal L(E,\mathcal S) = \frac 1P\sum_{i=1}^P L(Y^i, E(W,\mathcal y, X^i)) + R(W)\)
- \(Y^i\): answer, fixed. \(\mathcal y\): varying answer
- \(R(W)\): regularize, prior
use theories of statistical learning

shape the energy surface
push, pull
the architecture (model), the loss function, the learning algorithm (3 elements shared with common ML), the inference algorithm
prior: architecture and the loss function (regularize)
effective, efficient

concentrate on the data-dependent part
discuss, 'good', 'bad'
energy loss: just the energy, push down, not pull up, collapsed solution, zero
- works: automatically pull, \(E(W, Y^i,X^i)=||Y^i-G(W,X^i)||^2\), inference is trivial, MSE

发表于 2021-11-02 01:19 minor_second 阅读(64) 评论(0) 编辑收藏举报