Questions
Single choice
Question4 If you use a decision tree to classify the input matrix for the output then which of the following splits at the root node gives the highest information gain? Assume f1 denotes the first feature, corresponding to the first column of X, and f2 denotes the second feature, corresponding to the second column. (select one) f2> 5 f1 > 6 f2> 2 f1> 7 ResetMaximum marks: 2 Flag question undefined
Options
A.f2> 5
B.f1 > 6
C.f2> 2
D.f1> 7

View Explanation
Verified Answer
Please login to view
Step-by-Step Analysis
We are asked to compare possible root splits for a decision tree using two features, f1 (first column of X) and f2 (second column of X), with thresholds given in the options. Since the actual class labels y and the exact distribution of X are not shown here, we reason conceptually about how information gain would behave for each split.
Option 1: f2 > 5
- This split uses the second feature with a threshold of 5. The effectiveness of this split depends on how many samples with f2 > 5 belong to each class versus how many samples with f2 <= 5 belong to each class.
- If the data is such that f2 > 5 c......Login to view full explanationLog in for full answers
We've collected over 50,000 authentic exam questions and detailed explanations from around the globe. Log in now and get instant access to the answers!
Similar Questions
In a decision tree, variables used closer to the root are generally:
Decision Trees split data to minimize which metric in classification?
Which of the following numbers is closest to the best parameter setting for the Decision Tree algorithm on the income data?
Question11 We will use the dataset below to learn a decision tree which predicts if a patient has COVID-19 (Yes, or No), based on the Temperature (High, Medium, or Low) and whether the patient has dry cough (Yes, or No). (note: [math] )[table] Temp. | Cough | COVID-19 Low | No | No Low | Yes | Yes Medium | No | No Medium | Yes | Yes High | No | Yes High | Yes | Yes [/table]Assuming that H(COVID-19) = 0.8 and Gain(S, Temperature) = 0.3 and Gain(S, Cough) = 0.5, which one of the following would be the full decision tree, learnt for this dataset? (select one) ResetMaximum marks: 1.5 Flag question undefined
More Practical Tools for Students Powered by AI Study Helper
Making Your Study Simpler
Join us and instantly unlock extensive past papers & exclusive solutions to get a head start on your studies!