题目
题目

CS-7643-O01, OAN, OSZ Quiz #5: Module 4

单项选择题

For the traditional dynamic programming algorithms used for solving MDPs, which of the following statements holds true.  

选项
A.Neither Q-Iteration nor Value Iteration require a discrete action space.
B.V-Iteration requires a discrete action space but Q-Iteration does not.
C.Both Q-Iteration and Value iteration require a discrete action space.
D.Q-Iteration requires a discrete action space but Value Iteration does not.
查看解析

查看解析

标准答案
Please login to view
思路分析
When evaluating traditional dynamic programming approaches for MDPs, we need to consider what each algorithm does at each backup step. Option 1: 'Neither Q-Iteration nor Value Iteration require a discrete action space.' In standard finite MDPs, both methods involve taking a maximum over possible actions at each state (for Q-Iteration, max over actions when updating Q; for Value Iteration, max over actions when updating V). If the action space were continuous, these maximizations could becom......Login to view full explanation

登录即可查看完整答案

我们收录了全球超50000道考试原题与详细解析,现在登录,立即获得答案。

更多留学生实用工具

加入我们,立即解锁 海量真题独家解析,让复习快人一步!