Coursera, Deep Learning 5, Sequence Models, week4, Transformer Network

 

 

 

 self-attention

 multi-head attention

 

posted @ 2021-11-10 11:16  mashuai_191  阅读(39)  评论(0编辑  收藏  举报