Wolfram Language & System Documentation Center

Search for all pages containing Classify

Classify   

Listing of Built-in Classifiers »

Classify[{in₁class₁,in₂class₂,…}]

generates a ClassifierFunction that attempts to predict class_i from the example in_i.

Classify[data,input]

attempts to predict the output associated with input from the training examples given.

Classify[data,input,prop]

computes the specified property prop relative to the prediction.

Details and Options

Classify is used to train algorithms to categorize data into classes based on observed patterns.
Classification is a supervised learning approach commonly employed for tasks such as email filtering, image and handwriting recognition, medical diagnosis based on pattern identification, and predicting customer behavior in business analytics.

Classify can be used on many types of data, including numerical, textual, sounds, and images, as well as combinations of these.
Complex expressions are automatically converted to simpler features like numbers or classes.

The final model type and hyperparameter values are selected using cross-validation on the training data.

The training data can have the following structure:

	{in₁out₁,in₂out₂,…}	a list of Rule between input and output
	{in₁,in₂,…}{out₁,out₂,…}	a Rule between inputs and corresponding outputs
	{list₁,list₂,…}n	the nth element of each List as the output
	{assoc₁,assoc₂,…}"key"	the "key" element of each Association as the output
	Dataset[…]column	the specified column of Dataset as the output
	Tabular[…]column	the specified column of Tabular as the output

In addition, special forms of data include:

	"name"	a built-in classifier function
	FittedModel[…]	a fitted model converted into a ClassifierFunction
	NetChain[…],NetGraph[…]	convert a net representing a classifier into a ClassifierFunction

Each example input in_i can be a single data element, a list {feature₁, …} or an association <|"feature₁"value₁,…|> .
Each example output out_i can be any atomic expression like a string, an integer or a boolean.
The prediction properties prop are the same as in ClassifierFunction. They include:

	"Decision"	best class according to probabilities and utility function
	"TopProbabilities"	probabilities for most likely classes
	"TopProbabilities"n	probabilities for the n most likely classes
	"Probability"class	probability for a specific class
	"Probabilities"	association of probabilities for all possible classes
	"SHAPValues"	Shapley additive feature explanations for each example
	"Properties"	list of all properties available

"SHAPValues" assesses the contribution of features by comparing predictions with different sets of features removed and then synthesized. The option MissingValueSynthesis can be used to specify how the missing features are synthesized. SHAP explanations are given as odds ratio multipliers with respect to the class training prior. "SHAPValues"n can be used to control the number of samples used for the numeric estimations of SHAP explanations.
Examples of built-in classifier functions include:

	"CountryFlag"	which country a flag image is for
	"FacebookTopic"	which topic a Facebook post is about
	"FacialAge"	estimated age from a face
	"FacialExpression"	what type of expression a face displays
	"FacialGender"	what gender a face appears to be
	"Language"	which natural language text is in
	"LanguageExtended"	language of a text, including rare languages
	"NameGender"	which gender a first name is
	"NotablePerson"	what notable person an image is of
	"NSFWImage"	whether an image is considered "not safe for work"
	"Profanity"	whether text contains profanity
	"ProgrammingLanguage"	which programming language text is in
	"Sentiment"	sentiment of a social media post
	"Spam"	whether email is spam
	"SpokenLanguage"	which natural language audio recording is in

The following options can be given:

AnomalyDetector	None	anomaly detector used by the classifier
AcceptanceThreshold	Automatic	rarer-probability threshold for anomaly detector
ClassPriors	Automatic	explicit prior probabilities for classes
FeatureExtractor	Identity	how to extract features from which to learn
FeatureNames	Automatic	feature names to assign for input data
FeatureTypes	Automatic	feature types to assume for input data
IndeterminateThreshold	0	below what probability to return Indeterminate
Method	Automatic	which classification algorithm to use
MissingValueSynthesis	Automatic	how to synthesize missing values
PerformanceGoal	Automatic	aspects of performance to try to optimize
RandomSeeding	1234	what seeding of pseudorandom generators should be done internally
RecalibrationFunction	Automatic	how to post-process class probabilities
TargetDevice	"CPU"	the target device on which to perform training
TimeGoal	Automatic	how long to spend training the classifier
TrainingProgressReporting	Automatic	how to report progress during training
UtilityFunction	Automatic	utility as function of actual and predicted class
ValidationSet	Automatic	the set of data on which to evaluate the model during training

Possible settings for Method include:

		"ClassDistributions"	classify using learned distributions
		"DecisionTree"	classify using a decision tree
		"GradientBoostedTrees"	classify using an ensemble of trees trained with gradient boosting
		"LogisticRegression"	classify using probabilities from linear combinations of features
		"Markov"	classify using a Markov model on the sequence of features (only for text, bag of token, etc.)
		"NaiveBayes"	classify by assuming probabilistic independence of features
		"NearestNeighbors"	classify from nearest neighbor examples
		"NeuralNetwork"	classify using an artificial neural network
		"RandomForest"	classify using Breiman–Cutler ensembles of decision trees
		"SupportVectorMachine"	classify using a support vector machine

Using FeatureExtractor"Minimal" indicates that the internal preprocessing should be as simple as possible.
Possible settings for PerformanceGoal include:

	"DirectTraining"	train directly on the full dataset, without model searching
	"Memory"	minimize storage requirements of the classifier
	"Quality"	maximize accuracy of the classifier
	"Speed"	maximize speed of the classifier
	"TrainingSpeed"	minimize time spent producing the classifier
	Automatic	automatic tradeoff among speed, accuracy, and memory
	{goal₁,goal₂,…}	automatically combine goal₁, goal₂, etc.

The following settings for TrainingProgressReporting can be used:

	"Panel"	show a dynamically updating graphical panel
	"Print"	periodically report information using Print
	"ProgressIndicator"	show a simple ProgressIndicator
	"SimplePanel"	dynamically updating panel without learning curves
	None	do not report any information

Possible settings for RandomSeeding include:

	Automatic	automatically reseed every time the function is called
	Inherited	use externally seeded random numbers
	seed	use an explicit integer or strings as a seed

In Classify[ClassifierFunction[…],FeatureExtractorfe], the FeatureExtractorFunction[…] fe will be prepended to the existing feature extractor.
Information can be used on the ClassifierFunction[…] obtained.

Examples

open all close all

Basic Examples (2)

Train a classifier function on labeled examples:

Wolfram Language code: c = Classify[{1 -> "A", 2 -> "A", 3.5 -> "B", 4 -> "B"}]

Use the classifier function to classify a new unlabeled example:

Wolfram Language code: c[3.1]

Plot the probability that the class of an example is "B" as a function of the feature:

Wolfram Language code: Plot[c[x, "Probability" -> "B"], {x, 1, 4}]

Train a classifier with multiple features:

Wolfram Language code:

c = Classify[{{1.5, Blue} -> "A", {3.2, Blue} -> "A", {4.1, Red} -> "B", {5.3, Red} -> "B", {10., Green} -> "C", {12.4, Red} -> "C"}]

Classify a new examples that may contain missing features:

Wolfram Language code: c[{{10.1, Blue}, {1.2, Missing[]}}]

Scope (33)

Data Format (7)

Specify the training set as a list of rules between an input example and the output value:

Wolfram Language code: Classify[{0.63 -> "A", -0.78 -> "B", 0.58 -> "A", -0.62 -> "B", -0.52 -> "B", -0.87 -> "B"}]

Each example can contain a list of features:

Wolfram Language code: Classify[{{0.63, -0.78} -> "A", {0.58, -0.62} -> "A", {-0.52, -0.87} -> "B", {0.08, -0.54} -> "A", {-0.21, 0.4} -> "B"}]

Each example can contain an association of features:

Wolfram Language code:

Classify[{<|"f1" -> 0.63, "f2" -> -0.78|> -> "A", <|"f1" -> 0.58, "f2" -> -0.62|> -> "A", <|"f1" -> -0.52, "f2" -> -0.87|> -> "B", <|"f1" -> 0.08, "f2" -> -0.54|> -> "A", <|"f1" -> -0.21, "f2" -> 0.4|> -> "B"}]

Specify the training set as a list of rules between a list of inputs and a list of outputs:

Wolfram Language code: Classify[{0.63, -0.78, 0.58, -0.62, -0.52, -0.87} -> {"A", "B", "A", "B", "B", "B"}]

Specify all the data in a matrix and mark the output column:

Wolfram Language code: Classify[{{0.63, "A"}, {-0.78, "B"}, {0.58, "A"}, {-0.62, "B"}, {-0.52, "B"}, {-0.87, "B"}} -> 2]

Specify all the data in a list of associations and mark the output key:

Wolfram Language code:

Classify[{<|"f1" -> 0.63, "f2" -> "A"|>, <|"f1" -> -0.78, "f2" -> "B"|>, <|"f1" -> 0.58, "f2" -> "A"|>, <|"f1" -> -0.62, "f2" -> "B"|>, <|"f1" -> -0.52, "f2" -> "B"|>, <|"f1" -> -0.87, "f2" -> "B"|>} -> "f2"]

Specify all the data in a dataset and mark the output column:

Wolfram Language code:

Classify[Dataset[{Association["f1" -> 0.63, "f2" -> "A"], Association["f1" -> -0.78, "f2" -> "B"], 
  Association["f1" -> 0.58, "f2" -> "A"], Association["f1" -> -0.62, "f2" -> "B"], 
  Association["f1" -> -0.52, "f2" -> "B"], Association["f1" -> -0.87, "f2" -> "B"]}] -> "f2"]

Data Types (13)

Numerical (3)

Predict a variable from a number:

Wolfram Language code: Classify[{0.63 -> "A", -0.78 -> "B", 0.58 -> "A", -0.62 -> "B", -0.52 -> "B", -0.87 -> "B"}]

Predict a variable from a numerical vector:

Wolfram Language code: Classify[{{0.63, -0.78} -> "A", {0.58, -0.62} -> "A", {-0.52, -0.87} -> "B", {0.08, -0.54} -> "A", {-0.21, 0.4} -> "B"}]

Predict a variable from a numerical array of arbitrary depth:

Wolfram Language code:

Classify[{{{-0.58, 0.5}, {-0.15, -0.51}} -> "B", {{0.95, 0.65}, {0.85, 0.16}} -> "A", {{-0.41, -0.58}, {0.16, -0.74}} -> "B", {{-0.39, 0.42}, {-0.22, 0.64}} -> "A", {{-0.35, 0.19}, {0.04, -0.66}} -> "B"}]

Nominal (3)

Predict a class from a nominal value:

Wolfram Language code: Classify[{"XXX" -> "A", "YYY" -> "B", "XXX" -> "A", "YYY" -> "B", "YYY" -> "B"}]

Predict a class from several nominal values:

Wolfram Language code:

c = Classify[<|"Treatment" -> {"A", "B", "A", "C", "B", "C", "A", "B", "C", "A"}, "Severity" -> {"High", "Medium", "Low", "High", "Low", "Medium", "Medium", "High", "Low", "High"}, "RecoveryTime" -> {8, 6, 4, 9, 5, 7, 6, 8, 5, 8}|> -> "Treatment"]

Wolfram Language code: c[<|"RecoveryTime" -> 4, "Severity" -> "High"|>]

Predict a class from a mixture of nominal and numerical values:

Wolfram Language code:

c = Classify[Dataset[{Association["Age" -> 35, "Gender" -> "Male", "BloodPressure" -> 120, 
   "CholesterolLevel" -> 180, "Diabetes" -> "No"], Association["Age" -> 42, "Gender" -> "Female", 
   "BloodPressure" -> 130, "CholesterolLevel" -> 210, "Diabetes" -> "Yes"], 
  Association["Age" -> 55, "Gender" -> "Male", "BloodPressure" -> 140, "CholesterolLevel" -> 240, 
   "Diabetes" -> "No"], Association["Age" -> 28, "Gender" -> "Female", "BloodPressure" -> 115, 
   "CholesterolLevel" -> 190, "Diabetes" -> "No"], Association["Age" -> 68, "Gender" -> "Male", 
   "BloodPressure" -> 150, "CholesterolLevel" -> 280, "Diabetes" -> "Yes"], 
  Association["Age" -> 48, "Gender" -> "Female", "BloodPressure" -> 125, "CholesterolLevel" -> 200, 
   "Diabetes" -> "No"]}] -> "Diabetes"]

Wolfram Language code: c[<|"Age" -> 45, "Gender" -> "Male", "BloodPressure" -> 180, "CholesterolLevel" -> 230|>]

Quantities (1)

Train a classifier on data including Quantity objects:

Wolfram Language code:

c = Classify[Dataset[{Association["Neighborhood" -> "Sunnypoint", "Area" -> Quantity[1500, "Feet"^2], 
   "Price" -> Quantity[300000, "USDollars"]], Association["Neighborhood" -> "Moonbrook", 
   "Area" -> Quantity[1800, "Feet"^2], "Price" -> Quantity[360000, "USDollars"]], 
  Association["Neighborhood" -> "Sunnypoint", "Area" -> Quantity[1700, "Feet"^2], 
   "Price" -> Quantity[340000, "USDollars"]], Association["Neighborhood" -> "Starville", 
   "Area" -> Quantity[2000, "Feet"^2], "Price" -> Quantity[500000, "USDollars"]], 
  Association["Neighborhood" -> "Moonbrook", "Area" -> Quantity[1600, "Feet"^2], 
   "Price" -> Quantity[320000, "USDollars"]], Association["Neighborhood" -> "Starville", 
   "Area" -> Quantity[2200, "Feet"^2], "Price" -> Quantity[550000, "USDollars"]], 
  Association["Neighborhood" -> "Sunnypoint", "Area" -> Quantity[1400, "Feet"^2], 
   "Price" -> Quantity[280000, "USDollars"]], Association["Neighborhood" -> "Moonbrook", 
   "Area" -> Quantity[1900, "Feet"^2], "Price" -> Quantity[380000, "USDollars"]], 
  Association["Neighborhood" -> "Starville", "Area" -> Quantity[2100, "Feet"^2], 
   "Price" -> Quantity[520000, "USDollars"]], Association["Neighborhood" -> "Sunnypoint", 
   "Area" -> Quantity[1800, "Feet"^2], "Price" -> Quantity[360000, "USDollars"]]}] -> "Neighborhood"]

Use the classifier on a new example:

Wolfram Language code: c[<|"Price" -> Quantity[290000, "USDollars"], "Area" -> Quantity[1000, "Feet"^2]|>]

Predict the most likely price when only the "Price" is known:

Wolfram Language code: c[<|"Price" -> Quantity[800000, "USDollars"]|>]

Text (1)

Train a classifier on textual data:

Wolfram Language code:

c = Classify[{"the cat is grey" -> [image], "my cat is fast" -> [image], "this dog is scary" -> [image] , "the big dog" -> [image]}]

Classify new examples:

Wolfram Language code: c[{"nice cat", "what a dog"}]

Colors (1)

Predict a variable from a color expression:

Wolfram Language code:

Classify[{RGBColor[1., 0.8431372549019608, 0.] -> "warm", RGBColor[1., 0.27058823529411763, 0.] -> "warm", RGBColor[0., 0.807843137254902, 0.8196078431372549] -> "cold", RGBColor[1., 0.6274509803921569, 0.47843137254901963] -> "warm", RGBColor[1., 0.4117647058823529, 0.7058823529411765] -> "warm", RGBColor[0.12549019607843137, 0.6980392156862745, 0.6666666666666666] -> "cold", RGBColor[0.2549019607843137, 0.4117647058823529, 0.8823529411764706] -> "cold", RGBColor[0.5294117647058824, 0.807843137254902, 0.9215686274509803] -> "cold", RGBColor[0.6901960784313725, 0.8784313725490196, 0.9019607843137255] -> "cold", RGBColor[0.4980392156862745, 1., 0.] -> "warm", RGBColor[0.20392156862745098, 0.596078431372549, 0.8588235294117647] -> "cold", RGBColor[1., 0.3411764705882353, 0.2] -> "warm"}, {Red, Green, Blue}]

Images (1)

Train a predictor to predict an animal species from its picture:

Wolfram Language code:

c = Classify[{[image] -> "dromedar", [image] -> "dromedar", [image] -> "dromedar", [image] -> "dromedar", [image] -> "dromedar", [image] -> "camel", [image] -> "camel", [image] -> "camel", [image] -> "camel", [image] -> "camel"}]

Wolfram Language code: c[[image]]

Sequences (1)

Train a classifier on data where the feature is a sequence of tokens:

Wolfram Language code:

c = Classify[{{"apple", "banana", "cherry"} -> "fruit", {"dog", "cat"} -> "pet", {"carrot", "lettuce", "tomato", "broccoli"} -> "vegetable", {"apple", "grape", "pear", "kiwi", "strawberry"} -> "fruit", {"dog", "rabbit", "hamster"} -> "pet", {"cucumber", "spinach", "zucchini"} -> "vegetable"}]

Wolfram Language code: c[{"carrot", "zucchini"}]

Missing Data (2)

Train a classifier on a dataset with missing features:

Wolfram Language code:

c = Classify[{{2.3, "male"} -> "A", {4.8, Missing[]} -> "B", {Missing[], "female"} -> "B", {5.2, "female"} -> "C", {Missing[], "male"} -> "B", {1.3, "male"} -> "A"}]

Classify examples containing missing features as well:

Wolfram Language code: c[{{1.2, Missing[]}, {Missing[], "female"}}]

Train a classifier on a dataset with named features. The order of the keys does not matter. Keys can be missing:

Wolfram Language code:

c = Classify[{
	<|"age" -> 32, "height" -> 160|> -> "female", 
	<|"height" -> 183, "age" -> 41|> -> "male", 
	<|"height" -> 123|> -> "female", 
	<|"height" -> 175, "age" -> 21|> -> "male", 
	<|"age" -> 11|> -> "male", 
	<|"age" -> 52, "height" -> 164|> -> "female"}]

Classify examples containing missing features:

Wolfram Language code: c[{<|"height" -> 190|>, <|"age" -> 90|>, <|"age" -> 12, "height" -> 120|>, <||>}]

Information (4)

Extract information from a trained predictor:

Wolfram Language code:

Information[ClassifierFunction[Association["ExampleNumber" -> 6, "ClassNumber" -> 2, 
  "Input" -> Association["Preprocessor" -> MachineLearning`MLProcessor["ToMLDataset", 
      Association["Input" -> Association["Age" -> Association["Type" -> "Numerical"], 
 ...   "Date" -> DateObject[{2023, 11, 6, 15, 41, 52.149659`8.469826447952146}, "Instant", 
      "Gregorian", 1.], "ProcessorCount" -> 10, "ProcessorType" -> "ARM64", 
    "OperatingSystem" -> "MacOSX", "SystemWordLength" -> 64, "Evaluations" -> {}]]]]

Get information about the input features:

Wolfram Language code:

Information[ClassifierFunction[Association["ExampleNumber" -> 6, "ClassNumber" -> 2, 
  "Input" -> Association["Preprocessor" -> MachineLearning`MLProcessor["ToMLDataset", 
      Association["Input" -> Association["Age" -> Association["Type" -> "Numerical"], 
 ...   "Date" -> DateObject[{2023, 11, 6, 15, 41, 52.149659`8.469826447952146}, "Instant", 
      "Gregorian", 1.], "ProcessorCount" -> 10, "ProcessorType" -> "ARM64", 
    "OperatingSystem" -> "MacOSX", "SystemWordLength" -> 64, "Evaluations" -> {}]]], #]& /@ {"FeatureNames", "FeatureNumber", "FeatureTypes"}

Get the feature extractor used to process the input features:

Wolfram Language code:

Information[ClassifierFunction[Association["ExampleNumber" -> 6, "ClassNumber" -> 2, 
  "Input" -> Association["Preprocessor" -> MachineLearning`MLProcessor["ToMLDataset", 
      Association["Input" -> Association["Age" -> Association["Type" -> "Numerical"], 
 ...   "Date" -> DateObject[{2023, 11, 6, 15, 41, 52.149659`8.469826447952146}, "Instant", 
      "Gregorian", 1.], "ProcessorCount" -> 10, "ProcessorType" -> "ARM64", 
    "OperatingSystem" -> "MacOSX", "SystemWordLength" -> 64, "Evaluations" -> {}]]], "FeatureExtractor"]

Get a list of the supported properties:

Wolfram Language code:

Information[ClassifierFunction[Association["ExampleNumber" -> 6, "ClassNumber" -> 2, 
  "Input" -> Association["Preprocessor" -> MachineLearning`MLProcessor["ToMLDataset", 
      Association["Input" -> Association["Age" -> Association["Type" -> "Numerical"], 
 ...   "Date" -> DateObject[{2023, 11, 6, 15, 41, 52.149659`8.469826447952146}, "Instant", 
      "Gregorian", 1.], "ProcessorCount" -> 10, "ProcessorType" -> "ARM64", 
    "OperatingSystem" -> "MacOSX", "SystemWordLength" -> 64, "Evaluations" -> {}]]], "Properties"]

Built-in Classifiers (9)

Use the "Language" built-in classifier to detect the language in which a text is written:

Wolfram Language code: Classify["Language", "the house is blue"]

Use it to detect the language of examples:

Wolfram Language code:

Classify["Language", {"the house is blue", "la maison est bleue", "la casa es azul", "das Haus ist blau", "房子是蓝色的", "المنزل باللون الأزرق", "будинок синій"}]

Obtain the probabilities for the most likely languages:

Wolfram Language code: Classify["Language", "the house is blue, la maison est bleue", "TopProbabilities"]

Restrict the classifier to some languages with the option ClassPriors:

Wolfram Language code:

Classify["Language", {"What is this language?", "¿Qué idioma es ese?", "Quelle est cette langue?"}, ClassPriors -> <|Entity["Language", "English"] -> 0.5, Entity["Language", "Spanish"] -> 0.5|>]

Use the "FacebookTopic" built-in classifier to detect the topic of a Facebook post:

Wolfram Language code: Classify["FacebookTopic", "I love eating carrots in the morning"]

Classify multiple examples:

Wolfram Language code: Classify["FacebookTopic", {"I bought a new computer", "happy birthday!", "this skirt looks nice"}]

Unrecognized topics or languages will return Indeterminate:

Wolfram Language code: Classify["FacebookTopic", {"what is this topic?", "qwe niowqe mwoei!"}]

Use the "CountryFlag" built-in classifier to recognize a country from its flag:

Wolfram Language code: Classify["CountryFlag", {[image], [image], [image], [image], [image]}]

Use the "NameGender" built-in classifier to get the probable sex of a person from their first name:

Wolfram Language code: Classify["NameGender", {"Tom", "Stacy", "John", "Natalie"}]

Use the "NotablePerson" built-in classifier to determine what notable person is depicted in the given image:

Wolfram Language code: Classify["NotablePerson", [image]]

Use the "Sentiment" built-in classifier to infer the sentiment of a social media message:

Wolfram Language code: Classify["Sentiment", {"I love this movie", "I am so sad", "My phone broke again"}]

Use the "Profanity" built-in classifier to return True if a text contains strong language:

Wolfram Language code: Classify["Profanity", Import["http://www.urbandictionary.com/random.php"]]

Use the "Spam" built-in classifier to detect if an email is spam from its content:

Wolfram Language code:

Classify["Spam", "Dear recipient,*** Technologies announces the beginning of a new unprecendented global employment campaign.reviser yeller winers butchery twenties
Due to company's exploding growth *** is expanding business to the European region.During last employment campaign over 1500 people worldwide took part in ***'s business
and more than half of them are currently employed by the company.And now we are offering you
one more opportunity to earn extra money working with *** Technologies.druggists blame classy gentry Aladdin
We are looking for honest,responsible,hard-working people that can dedicate 2-4 hours of their
time per day and earn extra Â$300-500 weekly.All offered positions are currently part-time
and give you a chance to work mainly from home.lovelies hockey Malton meager reordered
Please visit ***'s corporate web site (http://www.***.com/sta/home/0077.htm) for more details regarding these vacancies."]

Use the "SpokenLanguage" built-in classifier to detect the language in which a text is written:

Wolfram Language code: Classify["SpokenLanguage", \!\(\*AudioBox["![Embedded Audio Player](audio://content-m3v5b)"]\)]

Options (23)

AcceptanceThreshold (1)

Create a classifier with an anomaly detector:

Wolfram Language code: c = Classify[{1 -> "A", 2 -> "A", 3.5 -> "B", 4 -> "B"}, AnomalyDetector -> Automatic]

Change the value of the acceptance threshold when evaluating the classifier:

Wolfram Language code: c[6, AcceptanceThreshold -> 0.01]

Wolfram Language code: c[6, AcceptanceThreshold -> 0.0001]

Permanently change the value of the acceptance threshold in the classifier:

Wolfram Language code: c2 = Classify[c, AcceptanceThreshold -> 0.01]

Wolfram Language code: c2[6, AcceptanceThreshold -> 0.01]

AnomalyDetector (1)

Create a classifier and specify that an anomaly detector should be included:

Wolfram Language code: c = Classify[{1 -> "A", 2 -> "A", 3.5 -> "B", 4 -> "B"}, AnomalyDetector -> Automatic]

Evaluate the classifier on an non-anomalous input:

Wolfram Language code: c[1.2]

Evaluate the classifier on an anomalous input:

Wolfram Language code: c[100000.2]

The "Probabilities" property is not affected by the anomaly detector:

Wolfram Language code: c[100000.2, "Probabilities"]

Temporarily remove the anomaly detector from the classifier:

Wolfram Language code: c[10000.2, AnomalyDetector -> None]

Permanently remove the anomaly detector from the classifier:

Wolfram Language code: c2 = Classify[c, AnomalyDetector -> None]

Wolfram Language code: c2[10000.2]

ClassPriors (1)

Train a classifier on an imbalanced dataset:

Wolfram Language code: data = {1 -> True, 2 -> True, 3 -> True, 4 -> True, 5 -> False, 6 -> True};

Wolfram Language code: c = Classify[data, Method -> "LogisticRegression"]

The training example 5False is classified as True:

Wolfram Language code: c[5]

Wolfram Language code: c[5, "Probabilities"]

Classify this example with a uniform prior over classes instead of the imbalanced training prior:

Wolfram Language code: c[5, ClassPriors -> <|False -> 0.5, True -> 0.5|>]

Wolfram Language code: c[5, "Probabilities", ClassPriors -> <|False -> 0.5, True -> 0.5|>]

The class priors can be specified during the training:

Wolfram Language code: c2 = Classify[data, Method -> "LogisticRegression", ClassPriors -> <|False -> 0.5, True -> 0.5|>]

Wolfram Language code: c2[5]

Wolfram Language code: c2[5, "Probabilities"]

Wolfram Language code: c2[5, ClassPriors -> <|False -> 0.2, True -> 0.8|>]

The class priors of a classifier can also be changed after training:

Wolfram Language code: c3 = Classify[c2, ClassPriors -> <|False -> 1 / 6, True -> 5 / 6|>]

Wolfram Language code: c2[5, "Probabilities"]

Wolfram Language code: c3[5, "Probabilities"]

FeatureExtractor (3)

Train a FeatureExtractorFunction on a simple dataset:

Wolfram Language code: dataset = {{1.4, "A"}, {1.5, "A"}, {2.3, "B"}, {5.4, "B"}};

Wolfram Language code: fe = FeatureExtraction[dataset]

Use the feature extractor function as a preprocessing step in Classify:

Wolfram Language code: Classify[dataset -> {"Yes", "No", "No", "No"}, FeatureExtractor -> fe]

Train a classifier on texts preprocessed by custom functions and an extractor method:

Wolfram Language code:

c = Classify[{"The cat is grey." -> [image], "My cat is fast." -> [image], "This dog is scary." -> [image] , "The big dog." -> [image]}, FeatureExtractor -> {ToUpperCase, RemoveDiacritics, "SegmentedWords"}]

Wolfram Language code: c[{"Nice CAT", "What a dög"}]

Create a feature extractor and extract features from a dataset of texts:

Wolfram Language code:

{features, fe} = FeatureExtraction[{"The cat is grey.", "My cat is fast.", "This dog is scary.", "The big dog."}, {ToUpperCase, RemoveDiacritics, "SegmentedWords"}, {"ExtractedFeatures", "ExtractorFunction"}]

Train a classifier on the extracted features:

Wolfram Language code: c = Classify[features -> {[image], [image], [image], [image]}]

Join the feature extractor to the classifier:

Wolfram Language code: c2 = Classify[c, FeatureExtractor -> fe]

The classifier can now be used on the initial input type:

Wolfram Language code: c2["Nice CAT"]

FeatureNames (2)

Train a classifier and give a name to each feature:

Wolfram Language code:

c = Classify[{{2.3, "male"} -> "a", {4.8, Missing[]} -> "b", {Missing[], "female"} -> "a", {5.2, "female"} -> "b"}, FeatureNames -> {"age", "gender"}]

Use the association format to predict a new example:

Wolfram Language code: c[<|"age" -> 3.3, "gender" -> "male"|>]

The list format can still be used:

Wolfram Language code: c[{3.3, "male"}]

Train a classifier on a training set with named features and use FeatureNames to set their order:

Wolfram Language code:

c = Classify[{<|"age" -> 2.3, "gender" -> "male"|> -> "a", <|"age" -> 4.6|> -> "b", <|"gender" -> "female"|> -> "a", <|"gender" -> "female", "age" -> 5.2|> -> "b"}, FeatureNames -> {"gender", "age"}]

Features are ordered as specified:

Wolfram Language code: Information[c, FeatureNames]

Classify a new example from a list:

Wolfram Language code: c[{"female", 6.5}]

FeatureTypes (2)

Train a classifier on data where the feature is intended to be a sequence of tokens:

Wolfram Language code: c = Classify[{{"butter", "sugar"} -> "bad", {"flour", "butter"} -> "good", {"tomato", "salt"} -> "good"}]

Classify wrongly assumed that examples contained two different text features:

Wolfram Language code: Information[c, FeatureTypes]

The following classification will output an error message:

Wolfram Language code: c[{"butter", "tomato", "apple"}]

Force Classify to interpret the feature as a "NominalSequence":

Wolfram Language code:

c2 = Classify[{{"butter", "sugar"} -> "bad", {"flour", "butter"} -> "good", {"tomato", "salt"} -> "good"}, FeatureTypes -> "NominalSequence"]

Wolfram Language code: Information[c2, FeatureTypes]

Classify a new example:

Wolfram Language code: c2[{"butter", "tomato", "apple"}]

Train a classifier with named features:

Wolfram Language code:

trainingset = {
	<|"age" -> 32, "gender" -> 1|> -> "tall", 
	<|"age" -> 41, "gender" -> 2|> -> "short", 
	<|"age" -> 17, "gender" -> 2|> -> "short", 
	<|"age" -> 11, "gender" -> 1|> -> "tall"};

Wolfram Language code: c = Classify[trainingset]

Both features have been considered numerical:

Wolfram Language code: Information[c, FeatureTypes]

Specify that the feature "gender" should be considered nominal:

Wolfram Language code: c = Classify[trainingset, FeatureTypes -> <|"gender" -> "Nominal"|>]

Wolfram Language code: Information[c, FeatureTypes]

IndeterminateThreshold (1)

Specify a probability threshold when training the classifier:

Wolfram Language code: data = {1 -> "B", 2 -> "B", 3 -> "A", 4 -> "B", 5 -> "A", 6 -> "A"};

Wolfram Language code: c = Classify[data, IndeterminateThreshold -> 0.9]

Obtain class probabilities for an example:

Wolfram Language code: c[5, "Probabilities"]

As there are no class probabilities above 0.9, no prediction is made:

Wolfram Language code: c[5]

Specifying a threshold when classifying supersedes the trained threshold:

Wolfram Language code: c[5, IndeterminateThreshold -> 0.5]

Update the value of the threshold in the classifier:

Wolfram Language code: c2 = Classify[c, IndeterminateThreshold -> 0.5]

Wolfram Language code: c2[5]

Method (3)

Train a logistic classifier:

Wolfram Language code: trainingset = {1, 2, 3, 4, 5, 6, 7} -> {"a", "a", "b", "a", "b", "b", "b"};

Wolfram Language code: logistic = Classify[trainingset, Method -> "LogisticRegression"]

Train a random forest classifier:

Wolfram Language code: rf = Classify[trainingset, Method -> "RandomForest"]

Plot the probability of class "a" given the feature for both classifiers:

Wolfram Language code:

Plot[{
	logistic[x, "Probability" -> "a"], 
	rf[x, "Probability" -> "a"]
	}, {x, 0, 8}, Exclusions -> None]

Train a nearest neighbors classifier:

Wolfram Language code: trainingset = ExampleData[{"MachineLearning", "UCILetter"}, "TrainingData"];

Wolfram Language code: c1 = Classify[trainingset, Method -> "NearestNeighbors"]

Find the classification accuracy on a test set:

Wolfram Language code: testset = ExampleData[{"MachineLearning", "UCILetter"}, "TestData"];

Wolfram Language code: ClassifierMeasurements[c1, testset, "Accuracy"]

In this example, using a naive Bayes classifier reduces the classification accuracy:

Wolfram Language code: c2 = Classify[trainingset, Method -> "NaiveBayes"]

Wolfram Language code: ClassifierMeasurements[c2, testset, "Accuracy"]

However, using a naive Bayes classifier reduces the classification time:

Wolfram Language code: First[AbsoluteTiming[c1[testset[[All, 1]]]]]

Wolfram Language code: First[AbsoluteTiming[c2[testset[[All, 1]]]]]

MONK's problems consist of synthetic binary classification datasets used for comparing the performance of different classifiers. Generate the dataset for the second MONK problem:

Wolfram Language code: data = Map[# -> Count[#, 1] == 1&, Tuples[{{1, 2, 3}, {1, 2, 3}, {1, 2}, {1, 2, 3}, {1, 2, 3, 4}, {1, 2}}]];

Test the accuracy of each available classifier by training on 169 examples and testing on the entire dataset:

Wolfram Language code: trainingset = RandomSample[data, 169];

Wolfram Language code:

AssociationMap[ ClassifierMeasurements[
	Classify[trainingset, Method -> #], data, "Accuracy"]&, {"RandomForest", "NaiveBayes", "SupportVectorMachine", "NearestNeighbors", "LogisticRegression"}]

MissingValueSynthesis (1)

Train a classifier with two input features:

Wolfram Language code:

x = {{1, 3}, {2, 4}, {3, 5}, {4, 4}, {5, 8}, {6, 9}, {7, 4}, {8, 6}, {9, 12}};
y = {"A", "B", "A", "B", "B", "B", "A", "B", "A"};
c = Classify[x -> y]

Get class probabilities for an example that has a missing value:

Wolfram Language code: c[{5, Missing[]}, "Probabilities"]

Set the missing value synthesis to replace each missing variable with its estimated most likely value given known values (which is the default behavior):

Wolfram Language code: c[{5, Missing[]}, "Probabilities", MissingValueSynthesis -> "ModeFinding"]

Replace missing variables with random samples conditioned on known values:

Wolfram Language code: c[{5, Missing[]}, "Probabilities", MissingValueSynthesis -> "RandomSampling"]

Averaging over many random imputations is usually the best strategy and allows obtaining the uncertainty caused by the imputation:

Wolfram Language code: MeanAround[Table[c[{5, Missing[]}, "Probabilities", MissingValueSynthesis -> "RandomSampling"], 100]]

Specify a learning method during training to control how the distribution of data is learned:

Wolfram Language code: c = Classify[x -> y, MissingValueSynthesis -> "KernelDensityEstimation"]

Classify an example with missing values using the "KernelDensityEstimation" distribution to condition values:

Wolfram Language code: c[{6, Missing[]}, "Probabilities"]

Provide an existing LearnedDistribution at training to use it when imputing missing values during training and later evaluations:

Wolfram Language code:

dist = LearnDistribution[x, Method -> "Multinormal"];
c = Classify[x -> y, MissingValueSynthesis -> dist];
c[{6, Missing[]}, "Probabilities"]

Specify an existing LearnedDistribution to synthesize missing values for an individual evaluation:

Wolfram Language code:

dist2 = LearnDistribution[x, Method -> "KernelDensityEstimation"];
c[{6, Missing[]}, "Probabilities", MissingValueSynthesis -> dist2]

Control both the learning method and the evaluation strategy by passing an association at training:

Wolfram Language code:

c = Classify[x -> y, MissingValueSynthesis -> 
	<|"LearningMethod" -> "Multinormal", "EvaluationStrategy" -> "RandomSampling"|>];
c[{6, Missing[]}, "Probabilities"]

RecalibrationFunction (1)

Load the MNIST dataset:

Wolfram Language code:

training = RandomSample[ResourceData["MNIST", "TrainingData"], 1000];
test = ResourceData["MNIST", "TestData"];

Train a random forest classifier without any recalibration:

Wolfram Language code: c = Classify[training, Method -> "RandomForest", RecalibrationFunction -> None]

Visualize the calibration curve on a test set:

Wolfram Language code: ClassifierMeasurements[c, test, "CalibrationCurve"]

Train a random forest classifier with recalibration:

Wolfram Language code: c2 = Classify[training, Method -> "RandomForest", RecalibrationFunction -> All]

Visualize the calibration curve on a test set:

Wolfram Language code: ClassifierMeasurements[c2, test, "CalibrationCurve"]

PerformanceGoal (1)

Train a classifier with an emphasis on training speed:

Wolfram Language code: trainingset = ExampleData[{"MachineLearning", "Satellite"}, "TrainingData"];

Wolfram Language code: c1 = Classify[trainingset, PerformanceGoal -> "TrainingSpeed"]

Wolfram Language code: Information[c1, "TrainingTime"]

Compute the classification accuracy on a test set:

Wolfram Language code: testset = ExampleData[{"MachineLearning", "Satellite"}, "TestData"];

Wolfram Language code: ClassifierMeasurements[c1, testset, "Accuracy"]

By default, a compromise between classification speed and performance is sought:

Wolfram Language code: c2 = Classify[trainingset]

Wolfram Language code: Information[c2, "TrainingTime"]

Wolfram Language code: ClassifierMeasurements[c2, testset, "Accuracy"]

With the same data, train a classifier with an emphasis on training speed and memory:

Wolfram Language code: c3 = Classify[trainingset, PerformanceGoal -> {"TrainingSpeed", "Memory"}]

The classifier uses less memory, but is also less accurate:

Wolfram Language code: ByteCount /@ {c2, c3}

Wolfram Language code: ClassifierMeasurements[c3, testset, "Accuracy"]

TargetDevice (1)

Train a classifier on the system's default GPU using a neural network and look at the AbsoluteTiming:

Wolfram Language code:

n = 10000;
trainingData = RandomReal[1, {n, 4}] -> RandomChoice[{1, 2, 3}, n];
AbsoluteTiming[classifier = Classify[trainingData, Method -> "NeuralNetwork", TargetDevice -> "GPU"]]

Compare the previous result with the one achieved by using the default CPU computation:

Wolfram Language code: AbsoluteTiming[classifier = Classify[trainingData, Method -> "NeuralNetwork"]]

TimeGoal (2)

Train a classifier while specifying a total training time of 5 seconds:

Wolfram Language code: c = Classify[{1, 2, 3, 4} -> {"A", "A", "B", "B"}, TimeGoal -> 5]

Wolfram Language code: Information[c, "TrainingTime"]

Load the "Mushroom" dataset:

Wolfram Language code: dataset = ExampleData[{"MachineLearning", "Mushroom"}, "Data"];

Train a classifier while specifying a target training time of 0.1 seconds:

Wolfram Language code: c = Classify[dataset, TimeGoal -> .1]

The classifier reached an accuracy of about 90%:

Wolfram Language code: Information[c]

Train a classifier while specifying a target training time of 5 seconds:

Wolfram Language code: c = Classify[dataset, TimeGoal -> 5]

The classifier reached an accuracy of about 99%:

Wolfram Language code: Information[c]

TrainingProgressReporting (1)

Load the "UCILetter" dataset:

Wolfram Language code: dataset = ExampleData[{"MachineLearning", "UCILetter"}, "Data"];

Show training progress interactively during training of a classifier:

Wolfram Language code: Classify[dataset, TrainingProgressReporting -> "Panel"];

Show training progress interactively without plots:

Wolfram Language code: Classify[dataset, TrainingProgressReporting -> "SimplePanel"];

Print training progress periodically during training:

Wolfram Language code: Classify[dataset, TrainingProgressReporting -> "Print"];

Show a simple progress indicator:

Wolfram Language code: Classify[dataset, TrainingProgressReporting -> "ProgressIndicator"];

Do not report progress:

Wolfram Language code: Classify[dataset, TrainingProgressReporting -> None];

UtilityFunction (1)

Train a classifier:

Wolfram Language code: trainingset = {1, 2, 3, 4} -> {"yes", "yes", "no", "no"};

Wolfram Language code: c1 = Classify[trainingset]

By default, the most probable class is predicted:

Wolfram Language code: c1[2.55, "Probabilities"]

Wolfram Language code: c1[2.55]

This corresponds to the following utility specification:

Wolfram Language code: Information[c1, UtilityFunction]

Train a classifier that penalizes examples of class "yes" being misclassified as "no":

Wolfram Language code:

c2 = Classify[trainingset, UtilityFunction -> <|"no" -> <|"no" -> 1, "yes" -> 0|>, "yes" -> <|"no" -> -100, "yes" -> 1|> |>]

The classifier decision is different despite the probabilities being unchanged:

Wolfram Language code: c2[2.55, "Probabilities"]

Wolfram Language code: c2[2.55]

Specifying a utility function when classifying supersedes the utility function specified at training:

Wolfram Language code: c2[2.55, UtilityFunction -> <|"no" -> <|"no" -> 1, "yes" -> 0|>, "yes" -> <|"no" -> 0, "yes" -> 1|> |>]

Update the value of the utility function in the classifier:

Wolfram Language code: c3 = Classify[c2, UtilityFunction -> <|"no" -> <|"no" -> 1, "yes" -> 0|>, "yes" -> <|"no" -> 0, "yes" -> 1|> |>]

Wolfram Language code: c3[2.55]

ValidationSet (1)

Train a logistic regression classifier on the Fisher iris data:

Wolfram Language code: trainingset = ExampleData[{"MachineLearning", "FisherIris"}, "TrainingData"];

Wolfram Language code: c1 = Classify[trainingset, Method -> "LogisticRegression"]

Obtain the L2 regularization coefficient of the trained classifier:

Wolfram Language code: Information[c1, "L2Regularization"]

Specify a validation set:

Wolfram Language code: validationset = ExampleData[{"MachineLearning", "FisherIris"}, "TestData"][[ ;; 10]];

Wolfram Language code: c2 = Classify[trainingset, ValidationSet -> validationset, Method -> "LogisticRegression"]

A different L2 regularization coefficient has been selected:

Wolfram Language code: Information[c2, "L2Regularization"]

Applications (10)

Titanic Survival (2)

Load the "Titanic" dataset, which contains a list of Titanic passengers with their age, sex, ticket class, and survival:

Wolfram Language code: dataset = ExampleData[{"MachineLearning", "Titanic"}, "Data"];

Visualize a sample of the dataset:

Wolfram Language code: RandomSample[dataset, 10] // TableForm

Train a logistic classifier on this dataset:

Wolfram Language code: c = Classify[dataset, Method -> "LogisticRegression"]

Calculate the survival probability of a 10-year-old girl traveling in third class:

Wolfram Language code: c[{"3rd", 10, "female"}, "Probability" -> "survived"]

Plot the survival probability as a function of age for some combinations of "class" and "sex":

Wolfram Language code: p[class_, age_, sex_] := c[{class, age, sex}, {"Probability", "survived"}];

Wolfram Language code:

Plot[{p["1st", x, "female"], p["3rd", x, "female"], p["1st", x, "male"], p["3rd", x, "male"]}, {x, 0, 100}, PlotLegends -> {"female, 1st class", "female, 3rd class", "male, 1st class", "male, 3rd class"}, Frame -> True, FrameLabel -> {"Age (years)", "Survival probability"}, Exclusions -> None]

Train a classifier to predict a person's odds of surviving or dying in the Titanic crash:

Wolfram Language code:

titanic = ResourceData["Sample Data: Titanic Survival"];
c = Classify[titanic -> "SurvivalStatus", Method -> "NearestNeighbors"]

Calculate the prior odds of a passenger dying:

Wolfram Language code:

baseProbability = Information[c, "TrainingClassPriors"]["died"];
priorOdds = baseProbability / (1 - baseProbability)

Use the classifier to predict the odds of a person dying:

Wolfram Language code:

dyingProb = c[{"1st", Quantity[80, "Years"], "male"}, "Probability" -> "died"];
 dyingOdds = dyingProb / (1 - dyingProb)

Get an explanation of how each feature multiplied the model's predicted odds of a class:

Wolfram Language code: shaps = c[{"1st", Quantity[80, "Years"], "male"}, "SHAPValues"]["died"]

Compare the model's explanation of feature impact to the base rate odds:

Wolfram Language code:

priorOdds * shaps["Class"] * shaps["Age"] * shaps["Sex"]
priorOdds * shaps["Class"] * shaps["Age"] * shaps["Sex"] == dyingOdds

Fisher's Iris (3)

Train a classifier on the Fisher iris dataset to predict the species of Iris:

Wolfram Language code: c = Classify[ExampleData[{"MachineLearning", "FisherIris"}, "TrainingData"]]

Predict the species of Iris from a list of features:

Wolfram Language code: c[{4.3, 3.1, 1.2, 0.3}]

Test the accuracy of the classifier on a test set:

Wolfram Language code: cm = ClassifierMeasurements[c, ExampleData[{"MachineLearning", "FisherIris"}, "TestData"]];

Wolfram Language code: cm["Accuracy"]

Generate a confusion matrix of the classifier on this test set:

Wolfram Language code: cm["ConfusionMatrixPlot"]

Train a classifier that classifies movie review snippets as "positive" or "negative":

Wolfram Language code: c = Classify[ExampleData[{"MachineLearning", "MovieReview"}, "TrainingData"]]

Classify an unseen movie review snippet:

Wolfram Language code:

c["the gorgeously elaborate continuation of \" the lord of the rings \" trilogy is so huge that a column of words cannot adequately describe co-writer/director peter jackson's expanded vision of j . r . r . tolkien's middle-earth . "]

Test the accuracy of the classifier on a test set:

Wolfram Language code: ClassifierMeasurements[c, ExampleData[{"MachineLearning", "MovieReview"}, "TestData"], "Accuracy"]

Import examples of the writings of Shakespeare, Oscar Wilde, and Victor Hugo to train a classifier:

Wolfram Language code:

Othello = Import["http://www.gutenberg.org/cache/epub/2267/pg2267.txt"];
Hamlet = Import["http://www.gutenberg.org/cache/epub/2265/pg2265.txt"];
Macbeth = Import["http://www.gutenberg.org/cache/epub/2264/pg2264.txt"];

Wolfram Language code:

TheImportanceOfBeingEarnest = Import["http://www.gutenberg.org/cache/epub/844/pg844.txt"];
ThePictureofDorianGray = Import["http://www.gutenberg.org/cache/epub/174/pg174.txt"];
AnIdealHusband = Import["http://www.gutenberg.org/files/885/885-0.txt"];

Wolfram Language code:

LesMiserables = Import["http://www.gutenberg.org/cache/epub/135/pg135.txt"];
NotreDamedeParis = Import["http://www.gutenberg.org/cache/epub/2610/pg2610.txt"];
TheManWhoLaughs = Import["http://www.gutenberg.org/cache/epub/12587/pg12587.txt"];

Generate an author classifier from these texts:

Wolfram Language code:

author = Classify[<|"William Shakespeare" -> {Othello, Hamlet}, "Oscar Wilde" -> {TheImportanceOfBeingEarnest, ThePictureofDorianGray}, "Victor Hugo" -> {LesMiserables, NotreDamedeParis}|>]

Find which author new texts are from:

Wolfram Language code: author[{Macbeth, AnIdealHusband, TheManWhoLaughs}]

Image Recognition (3)

Train a digit recognizer on 100 examples from the MNIST database of handwritten digits:

Wolfram Language code:

digit = Classify[
	{[image] -> 2, [image] -> 5, [image] -> 8, [image] -> 0, [image] -> 2, [image] -> 7, [image] -> 5, [image] -> 1, [image] -> 3, [image] -> 0, [image] -> 3, [image] -> 9, [image] -> 6, [image] -> 2, [image] -> 8, [image] -> 2, [image] -> 0, [image] -> 6, [image] -> 6, [image] -> 1, [image] -> 1, [image] -> 7, [image] -> 8, [image] -> 5, [image] -> 0, [image] -> 4, [image] -> 7, [image] -> 6, [image] -> 0, [image] -> 2, [image] -> 5, [image] -> 3, [image] -> 1, [image] -> 5, [image] -> 6, [image] -> 7, [image] -> 5, [image] -> 4, [image] -> 1, [image] -> 9, [image] -> 3, [image] -> 6, [image] -> 8, [image] -> 0, [image] -> 9, [image] -> 3, [image] -> 0, [image] -> 3, [image] -> 7, [image] -> 4, [image] -> 4, [image] -> 3, [image] -> 8, [image] -> 0, [image] -> 4, [image] -> 1, [image] -> 3, [image] -> 7, [image] -> 6, [image] -> 4, [image] -> 7, [image] -> 2, [image] -> 7, [image] -> 2, [image] -> 5, [image] -> 2, [image] -> 0, [image] -> 9, [image] -> 8, [image] -> 9, [image] -> 8, [image] -> 1, [image] -> 6, [image] -> 4, [image] -> 8, [image] -> 5, [image] -> 8, [image] -> 0, [image] -> 6, [image] -> 7, [image] -> 4, [image] -> 5, [image] -> 8, [image] -> 4, [image] -> 3, [image] -> 1, [image] -> 5, [image] -> 1, [image] -> 9, [image] -> 9, [image] -> 9, [image] -> 2, [image] -> 4, [image] -> 7, [image] -> 3, [image] -> 1, [image] -> 9, [image] -> 2, [image] -> 9, [image] -> 6}]

Use the classifier to recognize unseen digits:

Wolfram Language code: digit[{[image], [image], [image], [image], [image], [image], [image], [image], [image], [image]}]

Analyze probabilities of a misclassified example:

Wolfram Language code: digit[[image], "TopProbabilities"]

Train a classifier on 32 images of legendary creatures:

Wolfram Language code:

legendary = Classify[<|"Griffin" -> {[image], [image], [image], [image], [image], [image], [image], [image]}, "Centaur" -> {[image], [image], [image], [image], [image], [image], [image], [image]}, "Dragon" -> {[image], [image], [image], [image], [image], [image], [image], [image]}, "Unicorn" -> {[image], [image], [image], [image], [image], [image], [image], [image]}|>]

Use the classifier to recognize unseen creatures:

Wolfram Language code: legendary[{[image], [image], [image], [image]}]

Train a classifier to recognize daytime from nighttime:

Wolfram Language code:

daynight = Classify[
	{[image] -> "Night", [image] -> "Day", [image] -> "Night", [image] -> "Night", [image] -> "Day", [image] -> "Night", [image] -> "Day", [image] -> "Day", [image] -> "Night", [image] -> "Night", [image] -> "Day", [image] -> "Night", [image] -> "Night", [image] -> "Day", [image] -> "Night", [image] -> "Night", [image] -> "Day", [image] -> "Day", [image] -> "Day", [image] -> "Day", [image] -> "Night", [image] -> "Night", [image] -> "Day", [image] -> "Night", [image] -> "Night", [image] -> "Day", [image] -> "Day", [image] -> "Day", [image] -> "Night", [image] -> "Day"}]

Test it on examples:

Wolfram Language code: daynight[{[image], [image], [image], [image], [image]}]

Feature Explanation (1)

Import images of handwritten digits and select the 3s, 5s, and 8s:

Wolfram Language code: images = ResourceData["MNIST"];

Visualize a few of the images:

Wolfram Language code: RandomSample[images, 10]

Convert the images into their pixel values and separate their class:

Wolfram Language code:

pixels = Flatten /@ ImageData /@ images[[All, 1]];
digit = images[[All, 2]];

Train a classifier to identify the digit by its individual pixel values:

Wolfram Language code: c = Classify[pixels -> digit, Method -> "LogisticRegression", PerformanceGoal -> "DirectTraining"]

Learn a simple distribution of the data that treats each pixel as independent (for speed purposes):

Wolfram Language code: dist = LearnDistribution[pixels, Method -> {"Multinormal", "CovarianceType" -> "Diagonal"}]

Use the "SHAPValues" property to estimate how each pixel in an example impacted the predicted class:

Wolfram Language code:

example = Flatten[ImageData[[image]]];
shaps = c[example, "SHAPValues" -> 1, MissingValueSynthesis -> dist];

Take the Log to convert the "odds multiplier" SHAP values onto a scale centered at 0:

Wolfram Language code: pixelimpact = Log[shaps];

Look at the impact of each pixel weighted by its darkness by multiplying by the pixel values:

Wolfram Language code: darkimpact = pixelimpact * (1 - example);

Visualize how the pixels increased (red) or decreased (blue) the model's confidence the digit was a 0 or 6:

Wolfram Language code:

MatrixPlot[ArrayReshape[darkimpact[0], {28, 28}]]
MatrixPlot[ArrayReshape[darkimpact[6], {28, 28}]]

Fraud Detection (1)

Train a classifier to flag suspicious transactions based on a set of features:

Wolfram Language code:

fraudDetector = Classify[Dataset[{Association["Amount" -> Quantity[100.25, "USDollars"], 
   "MerchantCategory" -> "Electronics", "CardType" -> "Credit", "TimeOfDay" -> "Morning", 
   "IsSuspicious" -> False], Association["Amount" -> Quantity[75.5, "USDollars"], 
   "MerchantCategory" -> "Clothing", "CardType" -> "Debit", "TimeOfDay" -> "Afternoon", 
   "IsSuspicious" -> False], Association["Amount" -> Quantity[250., "USDollars"], 
   "MerchantCategory" -> "Jewelry", "CardType" -> "Credit", "TimeOfDay" -> "Evening", 
   "IsSuspicious" -> True], Association["Amount" -> Quantity[55.75, "USDollars"], 
   "MerchantCategory" -> "Groceries", "CardType" -> "Debit", "TimeOfDay" -> "Night", 
   "IsSuspicious" -> False], Association["Amount" -> Quantity[500., "USDollars"], 
   "MerchantCategory" -> "Electronics", "CardType" -> "Credit", "TimeOfDay" -> "Morning", 
   "IsSuspicious" -> True], Association["Amount" -> Quantity[300.2, "USDollars"], 
   "MerchantCategory" -> "Electronics", "CardType" -> "Credit", "TimeOfDay" -> "Afternoon", 
   "IsSuspicious" -> False], Association["Amount" -> Quantity[120.75, "USDollars"], 
   "MerchantCategory" -> "Clothing", "CardType" -> "Debit", "TimeOfDay" -> "Evening", 
   "IsSuspicious" -> False], Association["Amount" -> Quantity[400.5, "USDollars"], 
   "MerchantCategory" -> "Jewelry", "CardType" -> "Credit", "TimeOfDay" -> "Night", 
   "IsSuspicious" -> True], Association["Amount" -> Quantity[85.3, "USDollars"], 
   "MerchantCategory" -> "Groceries", "CardType" -> "Debit", "TimeOfDay" -> "Morning", 
   "IsSuspicious" -> False], Association["Amount" -> Quantity[750., "USDollars"], 
   "MerchantCategory" -> "Electronics", "CardType" -> "Credit", "TimeOfDay" -> "Afternoon", 
   "IsSuspicious" -> True]}] -> "IsSuspicious", Method -> "DecisionTree", FeatureTypes -> <|"Amount" -> "Numerical", "MerchantCategory" -> "Nominal", "CardType" -> "Nominal", "TimeOfDay" -> "Nominal"|>]

Plot the probability of fraud based only on the transaction amount:

Wolfram Language code: Plot[fraudDetector[<|"Amount" -> Quantity[q, "USDollars"]|>, "Probability" -> True], {q, 0, 800}]

Display the probability of fraud based on the card type and the transaction type:

Wolfram Language code:

BubbleChart[Table[{type, time, fraudDetector[<|"CardType" -> type, "TimeOfDay" -> time|>, "Probability" -> True]}, {type, {"Credit", "Debit"}}, {time, {"Morning", "Afternoon", "Evening", "Night"}}], ScalingFunctions -> {NominalScale[Automatic], OrdinalScale[{"Morning", "Afternoon", "Evening", "Night"}], None}, FrameLabel -> {{"TimeOfDay", None}, {"CardType", None}}]

Possible Issues (1)

The RandomSeeding option does not always guarantee reproducibility of the result:

Train several classifiers on the "Titanic" dataset:

Wolfram Language code: dataset = ExampleData[{"MachineLearning", "Titanic"}, "TrainingData"];

Wolfram Language code: classifiers = Table[Classify[dataset, RandomSeeding -> 1234], 4];

Compare the results when tested on a test set:

Wolfram Language code: testset = ExampleData[{"MachineLearning", "Titanic"}, "TestData"];

Wolfram Language code: SameQ@@(#[testset[[All, 1]]]& /@ classifiers)

Neat Examples (2)

Define and plot clusters sampled from normal distributions:

Wolfram Language code:

gaussian[μ_, σ_, n_] := RandomVariate[MultinormalDistribution[μ, {{σ, 0}, {0, σ}}], n];
positions = {{4, 2}, {-2, 2}, {0, -3}, {3, 0}};
sizes = {2, 1, 5, 0.5};
colors = {RGBColor[1, 0, 0], RGBColor[0, 0, 1], RGBColor[0, 1, 0], RGBColor[1., 0.77, 0.]};
nums = {100, 100, 50, 20};
	
clusters = MapThread[gaussian, {positions, sizes, nums}];
clusters = Table[RandomVariate[BinormalDistribution[
	RandomReal[{-3, 3}, 2], 
	RandomReal[{0.5, 2}, 2], 
	RandomReal[{0.2, 0.8}]], RandomInteger[{30, 40}]], {4}];
plot = ListPlot[clusters, PlotStyle -> Darker[colors, 0.1], ImageSize -> 200, PlotRange -> {{-5, 5}, {-5, 5}}, Frame -> True, AspectRatio -> 1, PlotLabel -> "data"]

Blend colors to reflect the probability density of the different classes for each method:

Wolfram Language code:

line = Range[-5, 5, 0.25];
points = Tuples[line, 2];

Wolfram Language code:

makecolormap[probs_]  := Transpose @ Partition[
	Map[Blend[Keys[#], Values[#]]&, probs], 
	Length[line]];

Wolfram Language code:

data = AssociationThread[colors, clusters];
methods = {"LogisticRegression", "NaiveBayes", "NearestNeighbors", "NeuralNetwork", "RandomForest", "SupportVectorMachine"};
Table[
	ArrayPlot[
	makecolormap @ Classify[data, points, "Probabilities", Method -> method], 
	PlotLabel -> method, DataReversed -> True, ImageSize -> 150], 
	{method, methods}]~Multicolumn~2 ~Legended~plot

Draw in the box to test a logistic classifier trained on the dataset ExampleData[{"MachineLearning","MNIST"}]:

Top

Classify

Details and Options

Examples

Basic Examples (2)

Scope (33)

Data Format (7)

Data Types (13)

Numerical (3)

Nominal (3)

Quantities (1)

Text (1)

Colors (1)

Images (1)

Sequences (1)

Missing Data (2)

Information (4)

Built-in Classifiers (9)

Options (23)

AcceptanceThreshold (1)

AnomalyDetector (1)

ClassPriors (1)

FeatureExtractor (3)

FeatureNames (2)

FeatureTypes (2)

IndeterminateThreshold (1)

Method (3)

MissingValueSynthesis (1)

RecalibrationFunction (1)

PerformanceGoal (1)

TargetDevice (1)

TimeGoal (2)

TrainingProgressReporting (1)

UtilityFunction (1)

ValidationSet (1)

Applications (10)

Titanic Survival (2)

Fisher's Iris (3)

Image Recognition (3)

Feature Explanation (1)

Fraud Detection (1)

Possible Issues (1)

Neat Examples (2)

See Also

Related Guides

Related Links

History

Text

CMS

APA

BibTeX

BibLaTeX

Classify