Questions
IS 4487-006 Fall 2025 Final Exam December 10 from 10:30 to 12:30
Single choice
A retail chain wants to group its 2 million customers based on purchase behavior. Analysts consider using K-Means because they need fast runtime and easy interpretability, but the leadership team worries whether the method assumes overly simple cluster shapes. What limitation of K-Means should the team be most concerned about?
Options
A.It assumes clusters are spherical and similar in size
B.It requires Gaussian distributions to work
C.It always produces overlapping clusters
D.It is too computationally slow for large datasets
View Explanation
Verified Answer
Please login to view
Step-by-Step Analysis
To evaluate the options, start by recalling a core property of K-Means clustering: it assigns points to clusters by minimizing within-cluster variance using a Euclidean distance metric, which implicitly favors clusters that are roughly spherical and of similar size.
Option 1: 'It assumes clusters are spherical and similar in size' aligns with this li......Login to view full explanationLog in for full answers
We've collected over 50,000 authentic exam questions and detailed explanations from around the globe. Log in now and get instant access to the answers!
Similar Questions
Which of the following statements about the K-means clustering algorithm is true?
K-Means involves computing distances in input space and assigning data points to the nearest prototype points. What can one say about these prototype points? I Every cluster has its data assigned to the nearest prototype point. II They are not restricted to being data points in the training set. III They are necessarily included in the training set. IV They are usually referred to as centroids.
Your retail analytics team is trying to group customers into clear segments based on buying behavior, purchasing channels, and store proximity. The leadership team wants each customer assigned to exactly one group—similar to the customer profiles shown below—so that marketing and pricing strategies can be targeted without any overlap. Which clustering method best fits this requirement?
Question14 Which of the following aspects do not affect the performance of the standard k-means clustering? (you can choose more than one) Choice of initial cluster centres Probability of data samples belonging to each cluster Distribution of data samples Number of clusters ResetMaximum marks: 1.5 Flag question undefined
More Practical Tools for Students Powered by AI Study Helper
Making Your Study Simpler
Join us and instantly unlock extensive past papers & exclusive solutions to get a head start on your studies!