Mathematica efficiently implements state-of-the-art data classification algorithms, allowing you to visualize distributions, search for nearest neighbors, and do cluster analysis on large datasets.
Histogram,
Histogram3D — create histograms from data
BinCounts — count the number of elements that lie in bins
FindClusters — find clusters in numerical, discrete, and string data
Nearest — find nearest neighbors in any space of data
MeanShift — use mean shift for feature space analysis
Cases,
Count — list, count elements matching a pattern