摘要:
[TOC] > [Su J., Lu Y., Pan S., Murtadha A., Wen B. and Liu Y. RoFormer: Enhanced transformer with rotary position embedding. ](http://arxiv.org/abs/21 阅读全文
摘要:
[TOC] > [Zhang B. and Sennrich R. Root mean square layer normalization. NIPS, 2019.](http://arxiv.org/abs/1910.07467) ## 概 RMSNorm 节省时间. ## RMSNorm - 阅读全文