Questions
FINTECH 540.01.Fa25 Quiz 5 - Unsupervised Clustering Methods
Single choice
When applying K-means clustering to a dataset consisting of multiple numerical features, consider the following actions and their impact: I The cluster centroids are computed as the mean of the observations assigned to each cluster. II The K-means algorithm can be directly applied to datasets with mixed data types (e.g., numerical and categorical) without any preprocessing. III Running K-means multiple times with different random initializations causes overfitting in any case. IV K-means always finds the same final cluster configuration regardless of the initial centroids chosen.
Options
A.III and IV
B.I and II
C.II, III and IV
D.I
View Explanation
Verified Answer
Please login to view
Step-by-Step Analysis
Start by identifying what each statement is claiming about K-means behavior and data handling.
Option A: III and IV. This combination claims that running K-means multiple times with different random initializations always leads to overfitting, and that K-means always converges to the same final configuration regardless of starting centroids. In reality, multiple restarts are u......Login to view full explanationLog in for full answers
We've collected over 50,000 authentic exam questions and detailed explanations from around the globe. Log in now and get instant access to the answers!
Similar Questions
Which of the following statements about the K-means clustering algorithm is true?
K-Means involves computing distances in input space and assigning data points to the nearest prototype points. What can one say about these prototype points? I Every cluster has its data assigned to the nearest prototype point. II They are not restricted to being data points in the training set. III They are necessarily included in the training set. IV They are usually referred to as centroids.
A retail chain wants to group its 2 million customers based on purchase behavior. Analysts consider using K-Means because they need fast runtime and easy interpretability, but the leadership team worries whether the method assumes overly simple cluster shapes. What limitation of K-Means should the team be most concerned about?
Your retail analytics team is trying to group customers into clear segments based on buying behavior, purchasing channels, and store proximity. The leadership team wants each customer assigned to exactly one group—similar to the customer profiles shown below—so that marketing and pricing strategies can be targeted without any overlap. Which clustering method best fits this requirement?
More Practical Tools for Students Powered by AI Study Helper
Making Your Study Simpler
Join us and instantly unlock extensive past papers & exclusive solutions to get a head start on your studies!