Or so I thought. I'm working on a cluster analysis project. There are multiple data sets, they are massive, and there are several variable subsets by which one could plausibly cluster the observations. Agglomerative hierarchical clustering is the way to go when you don't have any notion of how many clusters there should be, but [...]