摘要: [TOC] > [Su J., Lu Y., Pan S., Murtadha A., Wen B. and Liu Y. RoFormer: Enhanced transformer with rotary position embedding. ](http://arxiv.org/abs/21 阅读全文
posted @ 2023-07-24 17:38 馒头and花卷 阅读(248) 评论(0) 推荐(0) 编辑
摘要: [TOC] > [Zhang B. and Sennrich R. Root mean square layer normalization. NIPS, 2019.](http://arxiv.org/abs/1910.07467) ## 概 RMSNorm 节省时间. ## RMSNorm - 阅读全文
posted @ 2023-07-24 10:44 馒头and花卷 阅读(1144) 评论(2) 推荐(0) 编辑