PRODUCTS
Products Overview
Mathematica
Mathematica Student Edition
Mathematica Home Edition
Wolfram
CDF Player
(free download)
Computable Document Format (CDF)
web
Mathematica
grid
Mathematica
Wolfram
Workbench
Wolfram
SystemModeler
Wolfram
Finance Platform
Mathematica
Add-Ons
Wolfram|Alpha Products
SOLUTIONS
Solutions Overview
Engineering
Aerospace Engineering & Defense
Chemical Engineering
Control Systems
Electrical Engineering
Image Processing
Industrial Engineering
Materials Science
Mechanical Engineering
Operations Research
Optics
Petroleum Engineering
Biotechnology & Medicine
Bioinformatics
Medical Imaging
Finance, Statistics & Business Analysis
Actuarial Sciences
Data Analysis & Mining
Econometrics
Economics
Financial Engineering & Mathematics
Financial Risk Management
Statistics
Software Engineering & Content Delivery
Authoring & Publishing
Interface Development
Software Engineering
Web Development
Science
Astronomy
Biological Sciences
Chemistry
Environmental Sciences
Geosciences
Social & Behavioral Sciences
Design, Arts & Entertainment
Game Design, Special Effects & Generative Art
Education
STEM Education Initiative
Higher Education
Community & Technical College Education
Primary & Secondary Education
Students
Technology
Computable Document Format (CDF)
High-Performance & Parallel Computing (HPC)
See Also: Technology Guide
PURCHASE
Online Store
Other Ways to Buy
Volume & Site Licensing
Contact Sales
Software
Service
Upgrades
Training
Books
Merchandise
SUPPORT
Support Overview
Mathematica
Documentation
Knowledge Base
Learning Center
Technical Services
Community & Forums
Training
Does My Site Have a License?
Wolfram User Portal
COMPANY
About Wolfram Research
News
Events
Wolfram Blog
Partnerships
Employment Opportunities
History of
Mathematica
Stephen Wolfram's Home Page
Contact Us
OUR SITES
All Sites
Wolfram|Alpha
Demonstrations Project
MathWorld
Integrator
Wolfram Functions Site
Mathematica Journal
Wolfram Media
Wolfram
Tones
Wolfram Science
Stephen Wolfram
THIS IS DOCUMENTATION FOR AN OBSOLETE PRODUCT.
SEE THE
DOCUMENTATION CENTER
FOR THE LATEST INFORMATION.
DOCUMENTATION CENTER SEARCH
New to
Mathematica
?
Find your learning path
»
Mathematica
>
Data Manipulation
>
Numerical Data
>
Exploratory Data Analysis
>
FindClusters
>
Mathematica
>
Data Manipulation
>
Statistical Data Analysis
>
Exploratory Data Analysis
>
FindClusters
>
Mathematica
>
Mathematics and Algorithms
>
Statistical Data Analysis
>
Exploratory Data Analysis
>
FindClusters
>
BUILT-IN MATHEMATICA SYMBOL
Partitioning Data into Clusters
Tutorials »
|
ClusteringComponents
Partition
Split
Gather
Nearest
FindShortestTour
DistanceTransform
MeanShift
See Also »
|
Boolean Computation
Computational Geometry
Exploratory Data Analysis
Handling Arrays of Data
Logic & Boolean Algebra
Numerical Data
Processing Textual Data
Sequence Alignment & Comparison
Statistical Data Analysis
More About »
FindClusters
FindClusters
partitions the
into clusters of similar elements.
FindClusters
returns the
corresponding to the
in each cluster.
FindClusters
gives the same result.
FindClusters
partitions the
into exactly
n
clusters.
MORE INFORMATION
FindClusters
[{
e
1
,
e
2
,
...
},
DistanceFunction
->
f
]
treats pairs of elements as being less similar when their distances
are larger.
If the
are vectors of numbers,
FindClusters
by default in effect uses the Euclidean distance function
EuclideanDistance
.
If the
are lists of
True
and
False
,
FindClusters
by default uses a distance function based on the normalized fraction of elements that disagree.
If the
are strings,
FindClusters
by default uses a distance function based on the number of point changes needed to get from one string to another.
A
Method
option can be used to specify different methods of clustering. Possible settings include:
"Agglomerate"
find clustering hierarchically
"Optimize"
find clustering by local optimization
EXAMPLES
CLOSE ALL
Basic Examples
(3)
Find clusters of nearby values:
Find exactly four clusters:
Represent clustered elements with the right-hand sides of each rule:
Find clusters of nearby values:
In[1]:=
Out[1]=
Find exactly four clusters:
In[1]:=
Out[1]=
Represent clustered elements with the right-hand sides of each rule:
In[1]:=
Out[1]=
Scope
(5)
Cluster vectors of real values:
Cluster data of any precision:
Cluster Boolean 0, 1 or
True
,
False
data:
Cluster string data:
Find clusters in
five-dimensional vectors:
Options
(5)
Use
ManhattanDistance
as the measure of distance for continuous data:
Clusters obtained with the default
SquaredEuclideanDistance
:
Use
DiceDissimilarity
as the measure of distance for Boolean data:
Clusters obtained with the default
JaccardDissimilarity
:
Use
HammingDistance
as the measure of distance for string data:
Clusters obtained with the default
EditDistance
:
Define a distance function as a pure function:
Cluster the data hierarchically:
Clusters obtained with the default method:
Applications
(2)
Find and visualize clusters in bivariate data:
Cluster genomic sequences based on the number of element-wise differences:
Properties & Relations
(1)
FindClusters
groups data while
Nearest
gives the elements closest to a given value:
Possible Issues
(1)
The order of elements can have an effect on the clusters found:
Neat Examples
(2)
Divide a square into
n
segments by clustering uniformly distributed random points:
Cluster words beginning with "ax" in the English dictionary:
SEE ALSO
ClusteringComponents
Partition
Split
Gather
Nearest
FindShortestTour
DistanceTransform
MeanShift
TUTORIALS
Partitioning Data into Clusters
MORE ABOUT
Boolean Computation
Computational Geometry
Exploratory Data Analysis
Handling Arrays of Data
Logic & Boolean Algebra
Numerical Data
Processing Textual Data
Sequence Alignment & Comparison
Statistical Data Analysis
New in 6