Bert结构
2023-02-27 14:44:12 14 举报
基于Transformer的Bert模型基本结构
作者其他创作
大纲/内容
BertLayer
TokenEmbedding
BertAttention
BertIntermediateFeed Forward
SegmentEmbedding
BertSelfOutputLayer Normal & Dropout
Layer Normal & Dropout
BertEncoder
Downstream Tasks
BertOutputFeed ForwardLayer Normal & Dropout
Input Embedding
Position Ids
Input Ids
× N
Transformer Blocks
Token Type Ids
PositionalEmbedding
BertSelfAttentionMulti-head Attention
收藏
收藏
0 条评论
下一页