题目
题目
多项选择题

What operations are part of a standard Transformer block? (Select all that apply.)

选项
A.Layer normalization
B.Convolutional layer
C.Residual connection
D.Self-attention layer
查看解析

查看解析

标准答案
Please login to view
思路分析
The question asks which operations are part of a standard Transformer block and specifies that it is a select-all-that-apply type. Option 1: Layer normalization. In a typical Transformer block, after the multi-head self-attention and feed-forward sublayers, the outputs are usually passed through la......Login to view full explanation

登录即可查看完整答案

我们收录了全球超50000道考试原题与详细解析,现在登录,立即获得答案。

更多留学生实用工具

加入我们,立即解锁 海量真题独家解析,让复习快人一步!