This item requires the dataset EastWestAirlinesCluster.xls which can be found on the subject Interact site.
The dataset EastWestAirlinesCluster.xls contains information on 3999 passengers who belong to an airline’s frequent flier program. For each passenger the data include information on their mileage history and on different ways they accrued or spent miles in the last year. The goal is to try to identify clusters of passengers that have similar characteristics for the purpose of targeting different segments for different types of mileage offers.
a) Apply hierarchical clustering with Euclidean distance and Ward's method. Make sure to normalize the data first. How many clusters appear?
b) What would happen if the data were not normalized?
c) Compare the cluster centroid to characterize the different clusters, and try to give each cluster a label.
d) Use K-means clustering with the number of clusters that you found above. Does the same picture emerge?
e) Which clusters would you target for offers, and what types of offers would you target to customers in that cluster?