Cluster Analysis
The Wolfram Language has broad support for non-hierarchical and hierarchical cluster analysis, allowing data that is similar to be clustered together. There is general support for all forms of data, including numerical, textual, and image data. The system implements efficient versions of both classic and modern machine learning-based clustering analysis methods.
Non-hierarchical Clustering
FindClusters — find a way to divide data into a list of clusters
ClusteringComponents — label data with the index of the cluster it is in
ClusterClassify — use data to create a function to classify new data into clusters
ClusteringMeasurements — analyze the result of a clustering process
MeanShift ▪ NearestNeighborGraph
Hierarchical Clustering
Dendrogram — plot a dendrogram of hierarchical clustering of data
ClusteringTree — generate a symbolic tree of hierarchical clustering of data
Clustering in Graphs »
FindGraphCommunities — find closely connected communities in graphs
CommunityGraphPlot — plot graphs showing their community structure
ConnectedComponents ▪ FindGraphPartition ▪ FindClique ▪ FindKClique
Clustering in Images »
ImageForestingComponents ▪ WatershedComponents ▪ GrowCutComponents ▪ ...
Options
DistanceFunction — how to compute distances between elements
ClusterDissimilarityFunction — how to compute dissimilarity between clusters
Weights — weights for different data elements
PerformanceGoal — whether to optimize for speed, memory, quality, training time, etc.
CriterionFunction — how to assess automatically selected methods
Method — manual override for automatic method selection