Questions
2261 BUSQOM 0102 SEC1200 Project 2 Quiz
Single choice
Why was model-building strategy #1 used to estimate generalization performance on the income data for the decision tree, logistic regression, and random forest models?
Options
A.There were 48,842 rows of data, so nested cross validation would have taken a long time to run
B.Model-building strategy #1 leads to a better estimate of generalization performance, because the estimate is based on more splits of the data
C.Model-building strategy #1 is more appropriate than model-building strategy #2 for logistic regression and decision tree models
D.In Model-building strategy #1 the splits are always stratified, which is the best approach for classification models
View Explanation
Verified Answer
Please login to view
Step-by-Step Analysis
Start by identifying what the question is asking: why was model-building strategy #1 chosen to estimate generalization performance for the income data across three models (decision tree, logistic regression, random forest).
Option 1: 'There were 48,842 rows of data, so nested cross validation would have taken a long time to run' — This reflects a practical concern about computational cost. Nested cross-validation is more intensive because it involves an inner loop for hyperparameter tuning and an outer loop for performance estimation. When the dataset is large (e.g., tens of thousands of ......Login to view full explanationLog in for full answers
We've collected over 50,000 authentic exam questions and detailed explanations from around the globe. Log in now and get instant access to the answers!
Similar Questions
In k-fold cross-validation, what happens in each iteration?
What is one advantage of k-fold cross-validation over a single holdout set?
Which of the following is TRUE about cross validation?
Question at position 39 Multiple Choice Question: Which of the following is NOT an advantage of cross-validation over simple train-test split? It utilizes the data more effectively than a single train-test split.It reduces the sensitivity of the model's performance to the train-test split.It provides the distribution of a metric to evaluate the model's performance.It always improves the model's generalization to unseen data.
More Practical Tools for Students Powered by AI Study Helper
Making Your Study Simpler
Join us and instantly unlock extensive past papers & exclusive solutions to get a head start on your studies!