题目
题目

CS-7643-O01, OAN, OSZ Quiz #4: Module 3

匹配题

Find the BEST Match for computational complexity with respect to PER LAYER complexity (ie. what is the complexity of a single layer of each wrt the variables below).   n denotes sequence length d denotes representation dimension 1: Self-Attention (vanilla) 2: Recurrent (vanilla)

选项
A.O(1)
B.O(n^2 * d)
C.O(n^2 * d^2)
D.O(n * d)
E.O(n)
F.O(n * d^2)
查看解析

查看解析

标准答案
Please login to view
思路分析
We are asked to match per-layer computational complexity for two architectures against the given n and d. Option set includes different growth rates; we’ll evaluate each in turn. Option: O(1) This is clearly incorrect for a per-layer computation in either self-attention or recurrent layers, which require processing inputs and generating outputs with linear or quadratic dependencies on n and d. An O(1) cost would imply a constant-time operation regardless of sequence length or representation size, which contradicts the mechanics of both layers. Option: O(n^2 * d) This option aligns with the standard intuition for vanilla self-attention: at each layer, attention scores are computed for every pair of posi......Login to view full explanation

登录即可查看完整答案

我们收录了全球超50000道考试原题与详细解析,现在登录,立即获得答案。

更多留学生实用工具

加入我们,立即解锁 海量真题独家解析,让复习快人一步!