【NeurIPS2022】Cross Aggregation Transformer for Image Restoration

请添加图片描述

研究动机：当前方法 Transformer 方法把图像分成8x8的小块处理，the square window lacks inter-window interaction, leading to the slow increase of the receptive field。同时，the channel-wise attention mechanism may lose some spatial information。影响了 Transformer 方法在图像修复里的应用。

为此，作者提出了 Cross Aggregation Transformer，架构如下图所示，主干网络为RCAN（超分辨率中用的非常多的网络），中间是多个 CAT block 的堆叠。CAT block 的核心是作者提出的注意力机制：Rectangle-Window Self-Attention（Rwin-SA）。

请添加图片描述