Questions
Questions

2261 BUSQOM 0102 SEC1200 Project 2 Quiz

Single choice

Why was model-building strategy #1 used to estimate generalization performance on the income data for the decision tree, logistic regression, and random forest models?

Options
A.There were 48,842 rows of data, so nested cross validation would have taken a long time to run
B.Model-building strategy #1 leads to a better estimate of generalization performance, because the estimate is based on more splits of the data
C.Model-building strategy #1 is more appropriate than model-building strategy #2 for logistic regression and decision tree models
D.In Model-building strategy #1 the splits are always stratified, which is the best approach for classification models
View Explanation

View Explanation

Verified Answer
Please login to view
Step-by-Step Analysis
Start by identifying what the question is asking: why was model-building strategy #1 chosen to estimate generalization performance for the income data across three models (decision tree, logistic regression, random forest). Option 1: 'There were 48,842 rows of data, so nested cross validation would have taken a long time to run' — This reflects a practical concern about computational cost. Nested cross-validation is more intensive because it involves an inner loop for hyperparameter tuning and an outer loop for performance estimation. When the dataset is large (e.g., tens of thousands of ......Login to view full explanation

Log in for full answers

We've collected over 50,000 authentic exam questions and detailed explanations from around the globe. Log in now and get instant access to the answers!

Similar Questions

More Practical Tools for Students Powered by AI Study Helper

Join us and instantly unlock extensive past papers & exclusive solutions to get a head start on your studies!