PRODUCTS
Products Overview
Mathematica
Mathematica for Students
Mathematica Home Edition
Wolfram
CDF Player
(free download)
Computable Document Format (CDF)
web
Mathematica
grid
Mathematica
Wolfram
Workbench
Mathematica
Add-Ons
Wolfram|Alpha Products
SOLUTIONS
Solutions Overview
Engineering
Aerospace Engineering & Defense
Chemical Engineering
Control Systems
Electrical Engineering
Image Processing
Industrial Engineering
Materials Science
Mechanical Engineering
Operations Research
Optics
Petroleum Engineering
Biotechnology & Medicine
Bioinformatics
Medical Imaging
Finance, Statistics & Business Analysis
Actuarial Sciences
Data Analysis & Mining
Econometrics
Economics
Financial Engineering & Mathematics
Financial Risk Management
Statistics
Software Engineering & Content Delivery
Authoring & Publishing
Interface Development
Software Engineering
Web Development
Science
Astronomy
Biological Sciences
Chemistry
Environmental Sciences
Geosciences
Social & Behavioral Sciences
Design, Arts & Entertainment
Game Design, Special Effects & Generative Art
Education
STEM Education Initiative
Higher Education
Community & Technical College Education
Primary & Secondary Education
Students
Technology
Computable Document Format (CDF)
High-Performance & Parallel Computing (HPC)
See Also: Technology Guide
PURCHASE
Online Store
Other Ways to Buy
Volume & Site Licensing
Contact Sales
Software
Service
Upgrades
Training
Books
SUPPORT
Support Overview
Knowledge Base
Learning Center
Community & Forums
Training & Free Seminars
Does My Site Have a License?
Wolfram User Portal
COMPANY
About Wolfram Research
News & Events
Wolfram Blog
Partnerships
Employment Opportunities
History of
Mathematica
Stephen Wolfram's Home Page
Contact Us
OUR SITES
All Sites
Wolfram|Alpha
Demonstrations Project
MathWorld
Integrator
Wolfram Functions Site
Mathematica Journal
Wolfram Media
Wolfram
Tones
Wolfram Science
Stephen Wolfram
DOCUMENTATION CENTER SEARCH
New to
Mathematica
?
Find your learning path
»
Mathematica
>
Data Manipulation
>
Numerical Data
>
Exploratory Data Analysis
>
FindClusters
>
Mathematica
>
Data Manipulation
>
Statistical Data Analysis
>
Exploratory Data Analysis
>
FindClusters
>
Mathematica
>
Mathematics and Algorithms
>
Statistical Data Analysis
>
Exploratory Data Analysis
>
FindClusters
>
BUILT-IN MATHEMATICA SYMBOL
Partitioning Data into Clusters
Tutorials »
|
ClusteringComponents
Partition
Split
Gather
Nearest
FindShortestTour
DistanceTransform
MeanShift
See Also »
|
Boolean Computation
Computational Geometry
Exploratory Data Analysis
Handling Arrays of Data
Logic & Boolean Algebra
Numerical Data
Processing Textual Data
Sequence Alignment & Comparison
Statistical Data Analysis
More About »
FindClusters
FindClusters
partitions the
into clusters of similar elements.
FindClusters
returns the
corresponding to the
in each cluster.
FindClusters
gives the same result.
FindClusters
partitions the
into exactly
n
clusters.
MORE INFORMATION
FindClusters
[{
e
1
,
e
2
,
...
},
DistanceFunction
->
f
]
treats pairs of elements as being less similar when their distances
are larger.
If the
are vectors of numbers,
FindClusters
by default in effect uses the Euclidean distance function
EuclideanDistance
.
If the
are lists of
True
and
False
,
FindClusters
by default uses a distance function based on the normalized fraction of elements that disagree.
If the
are strings,
FindClusters
by default uses a distance function based on the number of point changes needed to get from one string to another.
A
Method
option can be used to specify different methods of clustering. Possible settings include:
"Agglomerate"
find clustering hierarchically
"Optimize"
find clustering by local optimization
EXAMPLES
CLOSE ALL
Basic Examples
(3)
Find clusters of nearby values:
Find exactly four clusters:
Represent clustered elements with the right-hand sides of each rule:
Find clusters of nearby values:
In[1]:=
Out[1]=
Find exactly four clusters:
In[1]:=
Out[1]=
Represent clustered elements with the right-hand sides of each rule:
In[1]:=
Out[1]=
Scope
(5)
Cluster vectors of real values:
Cluster data of any precision:
Cluster Boolean 0, 1 or
True
,
False
data:
Cluster string data:
Find clusters in
five-dimensional vectors:
Options
(5)
Use
ManhattanDistance
as the measure of distance for continuous data:
Clusters obtained with the default
SquaredEuclideanDistance
:
Use
DiceDissimilarity
as the measure of distance for Boolean data:
Clusters obtained with the default
JaccardDissimilarity
:
Use
HammingDistance
as the measure of distance for string data:
Clusters obtained with the default
EditDistance
:
Define a distance function as a pure function:
Cluster the data hierarchically:
Clusters obtained with the default method:
Applications
(2)
Find and visualize clusters in bivariate data:
Cluster genomic sequences based on the number of element-wise differences:
Properties & Relations
(1)
FindClusters
groups data while
Nearest
gives the elements closest to a given value:
Possible Issues
(1)
The order of elements can have an effect on the clusters found:
Neat Examples
(2)
Divide a square into
n
segments by clustering uniformly distributed random points:
Cluster words beginning with "ax" in the English dictionary:
SEE ALSO
ClusteringComponents
Partition
Split
Gather
Nearest
FindShortestTour
DistanceTransform
MeanShift
TUTORIALS
Partitioning Data into Clusters
MORE ABOUT
Boolean Computation
Computational Geometry
Exploratory Data Analysis
Handling Arrays of Data
Logic & Boolean Algebra
Numerical Data
Processing Textual Data
Sequence Alignment & Comparison
Statistical Data Analysis
New in 6