Wolfram Language & System 11.0 (2016)|Legacy Documentation

This is documentation for an earlier version of the Wolfram Language.View current documentation (Version 11.2)

ClusteringTree

ClusteringTree[{e1,e2,}]
constructs a weighted tree from the hierarchical clustering of the elements e1, e2, .

ClusteringTree[{e1v1,e2v2,}]
represents ei with vi in the constructed graph.

ClusteringTree[{e1,e2,}{v1,v2,}]
represents ei with vi in the constructed graph.

ClusteringTree[label1e1,label2e2]
represents ei using labels labeli in the constructed graph.

ClusteringTree[data,h]
constructs a weighted tree from the hierarchical clustering of data by joining subclusters at distance less than h.

Details and OptionsDetails and Options

  • The data elements ei can be numbers; numeric lists, matrices, or tensors; lists of Boolean elements; strings or images; geo positions or geographical entities; or colors. If the ei are lists, matrices, or tensors, each must have the same dimensions.
  • The result from ClusteringTree is a binary weighted tree, where the weight of each vertex indicates the distance between the two subtrees that have that vertex as root:
  • By default, the following distance functions are used for different types of elements:
  • ColorDistancecolors
    EditDistancestrings
    EuclideanDistancenumeric data
    GeoDistancegeospatial data/geographical entities
    ImageDistanceimages
    JaccardDissimilarityBoolean data
  • ClusteringTree has the same options as Graph, with the following additions and changes:
  • ClusterDissimilarityFunctionAutomaticthe clustering linkage algorithm to use
    DistanceFunctionAutomaticthe distance or dissimilarity to use
  • ClusterDissimilarityFunction defines the intercluster dissimilarity, given the dissimilarities between member elements.
  • Possible settings for ClusterDissimilarityFunction include:
  • "Average"average intercluster dissimilarity
    "Centroid"distance from cluster centroids
    "Complete"largest intercluster dissimilarity
    "Median"distance from cluster medians
    "Single"smallest intercluster dissimilarity
    "Ward"Ward's minimum variance dissimilarity
    "WeightedAverage"weighted average intercluster dissimilarity
    a pure function
  • The function f defines a distance from any two clusters.
  • The function f needs to be a real-valued function of the DistanceMatrix.

ExamplesExamplesopen allclose all

Basic Examples  (5)Basic Examples  (5)

Obtain a cluster hierarchy from a list of numbers:

In[1]:=
Click for copyable input
Out[1]=

Unify clusters at distance less than 2:

In[2]:=
Click for copyable input
Out[2]=

Obtain a cluster hierarchy from a list of strings:

In[1]:=
Click for copyable input
Out[1]=

Obtain a cluster hierarchy from a list of images:

In[1]:=
Click for copyable input
Out[1]=

Obtain a cluster hierarchy from a list of cities:

In[1]:=
Click for copyable input
Out[1]=

Obtain a cluster hierarchy from a list of Boolean entries:

In[1]:=
Click for copyable input
Out[1]=
Introduced in 2016
(10.4)