Deep Learning Computed Tomography【论文简单翻译+理解】

abstract

证明了图像重建能用神经网络表示,具体地说,filtered back-projection(就是CT那个)能够被映射到深度神经网络结构。迭代重建直接用矩阵相乘效果不好,把back-projection层当作fixed function并把梯度当作投影操作更有效。这样就可以找到一个数据驱动的方式在投影域和图像域做joint optimization of correction steps。证明了能够学习weightings和additional filter layers来不断减少重建误差(在一个有限的重建角度by a factor of two【自由度2?】),此时计算复杂度和filtered back projection是相同的。

introduction

提出back-projection layer for parallel-beam and fan-beam projection as well as a weighting layer(然后就能端到端训练)能够学习back projection之前的project domain的heuristic。因此可以把启发式的修正方法替换为学习得到的优化策略,还能同时用其他的修正策略。

methodology

离散FBP:

\[f(u, v) \approx \frac{\pi}{N} \sum_{n=1}^{N} q\left(u \cos \left(\theta_{n}\right)+v \sin \left(\theta_{n}\right), \theta_{n}\right) \]

1D插值:

\[f(u, v) \approx \frac{\pi}{N} \sum_{n=1}^{N} \sum_{m=1}^{M} w_{m}\left(u, v, \theta_{n}\right) \cdot q_{\left\lceil u \cos \left(\theta_{n}\right)+v \sin \left(\theta_{n}\right)-\frac{M+2}{2}+m\right\rceil, n} \]

设置bias=0且激活函数是$ f(x)=x\(时,常见的神经元是\)f\left(y_{i}\right)=\sum_{j=1}^{N} w_{i j} x_{j}$,改一下未知数的含义,把f(y)变成f(x,y)代表pixel(x,y),得到:

\[f\left(x_{i}, y_{j}\right)=\sum_{n=1}^{N} \sum_{m=1}^{M} w_{i+(j-1) \cdot I, m+(n-1) \cdot M} \cdot q_{m, n} \]

同样做插值(the interpolation size big enough to cover the length of the detector by zero-padding the signal as needed):

\[f\left(u_{i}, v_{j}\right) \approx \frac{\pi}{N} \sum_{n=1}^{N} \sum_{m=1}^{M} w_{m}\left(u_{i}, v_{j}, \theta_{n}\right) \cdot q_{m, n} \]

线性插值->will yield only up to two non-zero coefficients for every M interpolation coefficients resulting in an extremely sparse matrix.

最终得到的是:

\[f\left(x_{i}, y_{j}\right)=\max \left[0, \sum_{n=1}^{N} \sum_{m=1}^{M} \frac{\pi}{N} w_{m}\left(u_{i}, v_{j}, \theta_{n}\right) \cdot\left(\sum_{k=-M / 2}^{M / 2} w_{k} \cdot p_{m-k, n}\right)\right] \]

把FBP的每一步都对应到网络中的一层:

image

Parallel-Beam Back-Projection Layer(平行光束)

全连接层:This layer has no adjustable parameters.在forward pass中系数被计算出来,\(\mathbf{y}_{l}=\mathbf{W}_{l} \mathbf{y}_{l-1}\),反向传播时\(\mathbf{E}_{l-1}=\mathbf{W}_{l}^{T} \mathbf{E}_{l}\)。【那么,训练的对象其实是“filtered”的部分吗】

the back-projection operator is the transpose(转置) of the projection operator.【从公式也许可以看出】

Extension to Fan-Beam Reconstruction(扇形波束)

image

Weighting Layer 非常稀疏【对角阵?】

Fan-Beam Back-Projection Layer 【一小段话,但是没看懂】

Convergence and Overfitting

the convolutional layer uses the ramp filter, and the weighting layer accounts for cosine-weighting or redundancy weighting.
These initializations can be used for a very effective pre-training.

experiment & conclusion

没什么有意思的。

个人理解:其实是在train滤波的kernel,其他部分基本上都是fixed。不一定对。关于CT的基础知识可以看上一篇随笔。对于第二个扇形光束的情况,其实对应weighting layer对应的就是从扇形->平行的过程,convolution layer对应的filter,然后FCL对应backprojection。最后那个线性整形没理解,是为了去除小于0的值吗?

posted @ 2021-10-08 20:09  zyx_45889  阅读(220)  评论(0编辑  收藏  举报