题目
题目

11785/11685/11485 Quiz-09

单项选择题

If you train a seq-to-seq model for translating English to French, how can the probability distribution for the second predicted French word be written if the input English words are and the output French words are , and is the second output of the network? Hint: Refer to Lecture 17 Slides 57-65 (general concepts, not really the specific content)

题目图片
查看解析

查看解析

标准答案
Please login to view
思路分析
The question asks how to write the probability distribution for the second predicted French word in a seq-to-seq translation setup, given the input English sequence is (e1, e2, e3) and the output so far is (f1) on the way to predicting f2. First, consider what the model conditions on when predicting the next target token in a standard encoder–decoder (seq-to-seq) with teacher forcing during training. The second French word f2 is predicted based on the entire source se......Login to view full explanation

登录即可查看完整答案

我们收录了全球超50000道考试原题与详细解析,现在登录,立即获得答案。

更多留学生实用工具

加入我们,立即解锁 海量真题独家解析,让复习快人一步!