Wolfram Language & System Documentation Center

Search for all pages containing Predict

Predict

Predict[{in₁out₁,in₂out₂,…}]

generates a PredictorFunction that attempts to predict out_i from the example in_i.

Predict[data,input]

attempts to predict the output associated with input from the training examples given.

Predict[data,input,prop]

computes the specified property prop relative to the prediction.

Details and Options

Predict is used to model the relationship between a scalar variable and examples of many types of data, including numerical, textual, sounds and images.
This type of modelling, also known as regression analysis, is typically used for tasks like customer behavior analysis, healthcare outcomes prediction, credit risk assessment and more.

Complex expressions are automatically converted to simpler features like numbers or classes.

The final model type and hyperparameter values are selected using cross-validation on the training data.

The training data can have the following structure:

	{in₁out₁,in₂out₂,…}	a list of Rule between input and output
	{in₁,in₂,…}{out₁,out₂,…}	a Rule between inputs and corresponding outputs
	{list₁,list₂,…}n	the nth element of each List as the output
	{assoc₁,assoc₂,…}"key"	the "key" element of each Association as the output
	Dataset[…]column	the specified column of Dataset as the output
	Tabular[…]column	the specified column of Tabular as the output

In addition, special form of data include:
"name" a built-in prediction function

FittedModel[…] a fitted model converted into a PredictorFunction[…]
Each example input in_i can be a single data element, a list {feature₁, …} or an association <|"feature₁"value₁,…|> .
Each example output out_i must be a numerical value.
The prediction properties prop are the same as in PredictorFunction. They include:

	"Decision"	best prediction according to distribution and utility function
	"Distribution"	distribution of value conditioned on input
	"SHAPValues"	Shapley additive feature explanations for each example
	"SHAPValues"n	SHAP explanations using n samples
	"Properties"	list of all properties available

"SHAPValues" assesses the contribution of features by comparing predictions with different sets of features removed and then synthesized. The option MissingValueSynthesis can be used to specify how the missing features are synthesized. SHAP explanations are given as deviation from the training output mean.
Examples of built-in predictor functions include:
"NameAge" age of a person, given their first name
The following options can be given:

AnomalyDetector	None	anomaly detector used by the predictor
AcceptanceThreshold	Automatic	rarer probability threshold for anomaly detector
FeatureExtractor	Identity	how to extract features from which to learn
FeatureNames	Automatic	feature names to assign for input data
FeatureTypes	Automatic	feature types to assume for input data
IndeterminateThreshold	0	below what probability density to return Indeterminate
Method	Automatic	which regression algorithm to use
MissingValueSynthesis	Automatic	how to synthesize missing values
PerformanceGoal	Automatic	aspects of performance to try to optimize
RecalibrationFunction	Automatic	how to post-process predicted value
RandomSeeding	1234	what seeding of pseudorandom generators should be done internally
TargetDevice	"CPU"	the target device on which to perform training
TimeGoal	Automatic	how long to spend training the classifier
TrainingProgressReporting	Automatic	how to report progress during training
UtilityFunction	Automatic	utility as function of actual and predicted value
ValidationSet	Automatic	data on which to validate the model generated

Using FeatureExtractor"Minimal" indicates that the internal preprocessing should be as simple as possible.
Possible settings for Method include:

		"DecisionTree"	predict using a decision tree
		"GradientBoostedTrees"	predict using an ensemble of trees trained with gradient boosting
		"LinearRegression"	predict from linear combinations of features
		"NearestNeighbors"	predict from nearest neighboring examples
		"NeuralNetwork"	predict using an artificial neural network
		"RandomForest"	predict from Breiman–Cutler ensembles of decision trees
		"GaussianProcess"	predict using a Gaussian process prior over functions

Possible settings for PerformanceGoal include:

	"DirectTraining"	train directly on the full dataset, without model searching
	"Memory"	minimize storage requirements of the predictor
	"Quality"	maximize accuracy of the predictor
	"Speed"	maximize speed of the predictor
	"TrainingSpeed"	minimize time spent producing the predictor
	Automatic	automatic tradeoff among speed, accuracy and memory
	{goal₁,goal₂,…}	automatically combine goal₁, goal₂, etc.

The following settings for TrainingProgressReporting can be used:

	"Panel"	show a dynamically updating graphical panel
	"Print"	periodically report information using Print
	"ProgressIndicator"	show a simple ProgressIndicator
	"SimplePanel"	dynamically updating panel without learning curves
	None	do not report any information

Information can be used on the PredictorFunction[…] obtained.

Examples

open all close all

Basic Examples (2)

Learn to predict the third column of a matrix using the features in the first two columns:

Wolfram Language code:

p = Predict[(⁠|      |     |      |
| ---- | --- | ---- |
| 1.3  | "P" | 1    |
| 1.8  | "Q" | 2.5  |
| 1.9  | "Q" | 3    |
| 0.2  | "P" | 1    |
| -3.2 | "P" | -4.2 |
| 0.3  | "Q" | 2    |⁠) -> 3]

Predict the value of a new example, given its features:

Wolfram Language code: p[{1.8, "Q"}]

Predict the value of a new example that has a missing feature:

Wolfram Language code: p[{1.8, Missing[]}]

Predict the value of a multiple examples at the same time:

Wolfram Language code: p[{{1.8, "Q"}, {.4, "P"}}]

Train a linear regression on a set of examples:

Wolfram Language code: data = {1 -> 1.3, 2 -> 2.4, 3 -> 4.4, 4 -> 5.1, 6 -> 7.3};p = Predict[data, Method -> "LinearRegression"]

Get the conditional distribution of the predicted value, given the example feature:

Wolfram Language code: 𝒟 = p[1.5, "Distribution"]

Plot the probability density of the distribution:

Wolfram Language code: Plot[PDF[𝒟, x], {x, 0, 4}]

Plot the prediction with a confidence band together with the training data:

Wolfram Language code:

Show[
	ListPlot[List@@@data, PlotStyle -> {Red, PointSize@Large}], 
	ListLinePlot[Table[{x, Around@@p[x, "Distribution"]}, {x, 0, 10}], IntervalMarkers -> "Bands"]
	]

Scope (24)

Data Format (7)

Specify the training set as a list of rules between an input examples and the output value:

Wolfram Language code: Predict[{1 -> -1.14, 2 -> -0.34, 3 -> 0.46, 4 -> 1.26, 5 -> 2.06}]

Each example can contain a list of features:

Wolfram Language code:

Predict[{{-0.78, 0.58} -> 0.86, {-0.62, -0.52} -> 2.28, {-0.87, 0.08} -> 1.18, {-0.54, -0.21} -> 2.13, {0.4, -0.58} -> 4.38}]

Each example can contain an association of features:

Wolfram Language code:

Predict[{<|"f1" -> -0.78, "f2" -> 0.58|> -> 0.86, <|"f1" -> -0.62, "f2" -> -0.52|> -> 2.28, <|"f1" -> -0.87, "f2" -> 0.08|> -> 1.18, <|"f1" -> -0.54, "f2" -> -0.21|> -> 2.13, <|"f1" -> 0.4, "f2" -> -0.58|> -> 4.38}]

Specify the training set a list of rule between a list of input and a list of output:

Wolfram Language code: Predict[{1, 2, 3, 4, 5} -> {-1.14, -0.34, 0.46, 1.26, 2.06}]

Specify all the data in a matrix and mark the output column:

Wolfram Language code: Predict[{{1, -1.14}, {2, -0.34}, {3, 0.46}, {4, 1.26}, {5, 2.06}} -> 2]

Specify all the data in a list of associations and mark the output key:

Wolfram Language code:

Predict[{<|"f1" -> 1, "f2" -> -1.14|>, <|"f1" -> 2, "f2" -> -0.34|>, <|"f1" -> 3, "f2" -> 0.46|>, <|"f1" -> 4, "f2" -> 1.26|>, <|"f1" -> 5, "f2" -> 2.06|>} -> "f2"]

Specify all the data in a dataset and mark the output column:

Wolfram Language code:

Predict[Dataset[{Association["f1" -> 1, "f2" -> -1.1400000000000001], 
  Association["f1" -> 2, "f2" -> -0.34], Association["f1" -> 3, "f2" -> 0.46], 
  Association["f1" -> 4, "f2" -> 1.26], Association["f1" -> 5, "f2" -> 2.06]}] -> "f2"]

Data Types (13)

Numerical (3)

Predict a variable from a number:

Wolfram Language code: Predict[{0.72 -> -1.56, 0.36 -> -2.28, -0.18 -> -3.36, -0.4 -> -3.8, 0.06 -> -2.88}]

Predict a variable from a numerical vector:

Wolfram Language code:

Predict[{{0.19, 0.44} -> 2.94, {0.82, -0.99} -> 5.63, {-0.82, 0.83} -> 0.53, {-0.25, -0.27} -> 2.77, {-0.9, 0.81} -> 0.39}]

Predict a variable from a numerical array or arbitrary depth:

Wolfram Language code:

Predict[{{{0.08, 0.73}, {0.33, -0.45}} -> -0.37, {{0.28, 0.4}, {-0.34, -0.92}} -> -0.64, {{-0.82, 0.35}, {-0.17, 0.93}} -> 0.11, {{0.46, -0.33}, {0.67, 0.82}} -> 1.28, {{0.39, -0.6}, {0.7, 0.34}} -> 0.73}]

Nominal (3)

Predict a variable from a nominal value:

Wolfram Language code: Predict[{"B" -> 0.61, "A" -> -0.93, "B" -> 0.26, "B" -> 0.65, "B" -> 0.47}]

Predict a variable from several nominal values:

Wolfram Language code:

p = Predict[<|"Treatment" -> {"A", "B", "A", "C", "B", "C", "A", "B", "C", "A"}, "Severity" -> {"High", "Medium", "Low", "High", "Low", "Medium", "Medium", "High", "Low", "High"}, "Recovery Time" -> {8, 6, 4, 9, 5, 7, 6, 8, 5, 8}|> -> "Recovery Time"]

Wolfram Language code: p[<|"Treatment" -> "C", "Severity" -> "High"|>]

Predict a variable from a mixture of nominal and numerical values:

Wolfram Language code:

p = Predict[Dataset[{Association["Treatment" -> "A", "Severity" -> "High", "Patient's History" -> "Yes", 
   "Age" -> 55], Association["Treatment" -> "B", "Severity" -> "Medium", 
   "Patient's History" -> "No", "Age" -> 45], Association["Treatment" -> "A", "Severity" -> "Low", 
   "Patient's History" -> "Yes", "Age" -> 30], Association["Treatment" -> "C", 
   "Severity" -> "High", "Patient's History" -> "No", "Age" -> 60], 
  Association["Treatment" -> "B", "Severity" -> "Low", "Patient's History" -> "Yes", "Age" -> 35], 
  Association["Treatment" -> "C", "Severity" -> "Medium", "Patient's History" -> "No", 
   "Age" -> 50]}] -> Dataset[{Association["Recovery Time" -> 8], Association["Recovery Time" -> 6], 
  Association["Recovery Time" -> 4], Association["Recovery Time" -> 9], 
  Association["Recovery Time" -> 5], Association["Recovery Time" -> 7]}]]

Wolfram Language code: p[<|"Treatment" -> "C", "Age" -> 42, "Patient's History" -> "No"|>]

Quantities (1)

Train a predictor on data including Quantity objects:

Wolfram Language code:

p = Predict[Dataset[{Association["Neighborhood" -> "Sunnypoint", "Area" -> Quantity[1500, "Feet"^2], 
   "Price" -> Quantity[300000, "USDollars"]], Association["Neighborhood" -> "Moonbrook", 
   "Area" -> Quantity[1800, "Feet"^2], "Price" -> Quantity[360000, "USDollars"]], 
  Association["Neighborhood" -> "Sunnypoint", "Area" -> Quantity[1700, "Feet"^2], 
   "Price" -> Quantity[340000, "USDollars"]], Association["Neighborhood" -> "Starville", 
   "Area" -> Quantity[2000, "Feet"^2], "Price" -> Quantity[500000, "USDollars"]], 
  Association["Neighborhood" -> "Moonbrook", "Area" -> Quantity[1600, "Feet"^2], 
   "Price" -> Quantity[320000, "USDollars"]], Association["Neighborhood" -> "Starville", 
   "Area" -> Quantity[2200, "Feet"^2], "Price" -> Quantity[550000, "USDollars"]], 
  Association["Neighborhood" -> "Sunnypoint", "Area" -> Quantity[1400, "Feet"^2], 
   "Price" -> Quantity[280000, "USDollars"]], Association["Neighborhood" -> "Moonbrook", 
   "Area" -> Quantity[1900, "Feet"^2], "Price" -> Quantity[380000, "USDollars"]], 
  Association["Neighborhood" -> "Starville", "Area" -> Quantity[2100, "Feet"^2], 
   "Price" -> Quantity[520000, "USDollars"]], Association["Neighborhood" -> "Sunnypoint", 
   "Area" -> Quantity[1800, "Feet"^2], "Price" -> Quantity[360000, "USDollars"]]}] -> "Price"]

Use the predictor on a new example:

Wolfram Language code: p[<|"Neighborhood" -> "Moonbrook", "Area" -> Quantity[900, "Feet"^2]|>]

Predict the most likely price when only the "Neighborhood" is known:

Wolfram Language code: p[<|"Neighborhood" -> "Moonbrook"|>]

Text (1)

Get some spam probability data:

Wolfram Language code:

data = IconizedObject[«spam score»];
First[data]

Train a predictor on the text:

Wolfram Language code: p = Predict[data]

Use the predictor on new examples:

Wolfram Language code: p[{"Monday meeting rescheduled", "You have won a yacht!"}]

Colors (1)

Predict a variable from a color expression:

Wolfram Language code:

Predict[{RGBColor[0.374533954339318, 0.07318946913456625, 0.34948076712998266], RGBColor[0.7085138807325013, 0.5508514126801727, 0.270668676806443], RGBColor[0.482424250096966, 0.9866683639813978, 0.8763664701273515], RGBColor[0.9143901752719417, 0.5933340939498908, 0.12490074751913904], RGBColor[0.5435848746538026, 0.3320246736743131, 0.10546267186910474], RGBColor[0.4838618865454345, 0.1408408308353548, 0.4182003855819225], RGBColor[0.09486596097451105, 0.4061453278622089, 0.19070989282627604], RGBColor[0.9978640282936782, 0.9850053326406092, 0.6079629315168287], RGBColor[0.7423358808871543, 0.14441876671667964, 0.44156741749306994], RGBColor[0.11647231911978806, 0.14156287966553882, 0.7585884183769138], RGBColor[0.413441306192059, 0.35488417001075656, 0.4547933589911124], RGBColor[0.6479088100736257, 0.06575533518383936, 0.33956768046537045], RGBColor[0.43617515288692754, 0.395321922119783, 0.03284974966398546], RGBColor[0.08988572988782773, 0.45387554090065807, 0.565445988533821], RGBColor[0.5962404964416488, 0.7115775209469246, 0.5937470103610984], RGBColor[0.5425615924830165, 0.9052557429558079, 0.7950524706591187], RGBColor[0.258709017611374, 0.05898221147135585, 0.04550888101080042], RGBColor[0.733150150511475, 0.5972827779202361, 0.34440914488265517], RGBColor[0.9604086664357256, 0.9762505577301217, 0.6004232657263842], RGBColor[0.10316670441111997, 0.3667737206326349, 0.46355431351130827]} -> {0.22, 0.61, 0.91, 0.7, 0.42, 0.31, 0.38, 0.97, 0.44, 0.26, 0.41, 0.37, 0.43, 0.45, 0.71, 0.85, 0.13, 0.65, 0.96, 0.37}, {Red, Green, Blue}]

Images (1)

Train a predictor to predict the colored area of an image:

Wolfram Language code:

Predict[{[image] -> 40.2, [image] -> 8.9, [image] -> 11., [image] -> 4.9, [image] -> 13.6, [image] -> 15.6, [image] -> 14.7, [image] -> 3.8, [image] -> 34.7, [image] -> 10.8, [image] -> 4., [image] -> 16.1, [image] -> 3.3, [image] -> 8.3, [image] -> 12.6}]

Sequences (1)

Train a predictor on data where the feature is a sequence of tokens:

Wolfram Language code: Predict[{{"butter", "sugar", "flour"} -> 0.2, {"flour", "butter"} -> 1.4, {"tomato", "salt"} -> 0.9}]

Missing Data (2)

Train on a dataset containing missing features:

Wolfram Language code:

Predict[{{2.3, "male"} -> 1, {4.8, Missing[]} -> 2.5, {Missing[], "female"} -> 8.4, {5.2, "female"} -> -2, {Missing[], "male"} -> -4.2, {1.3, "male"} -> 10}]

Train a predictor on a dataset with named features. The order of the keys does not matter. Keys can be missing:

Wolfram Language code:

p = Predict[{
	<|"age" -> 24, "sex" -> "female"|> -> 10.4, 
	<|"sex" -> "male", "age" -> 13|> -> 5.2, 
	<|"age" -> 57|> -> 23.3, 
	<|"sex" -> "male"|> -> 14.3}]

Predict examples containing missing features:

Wolfram Language code: p[{<|"age" -> 31|>, <|"sex" -> "male"|>, <||>}]

Information (4)

Extract information from a trained predictor:

Wolfram Language code:

Information[PredictorFunction[Association["ExampleNumber" -> 6, 
  "Input" -> Association["Preprocessor" -> MachineLearning`MLProcessor["ToMLDataset", 
      Association["Input" -> Association["f1" -> Association["Type" -> "Numerical"], 
         "f2" -> Assoc ...   "Date" -> DateObject[{2023, 7, 26, 16, 30, 52.286118`8.470961373666972}, "Instant", 
      "Gregorian", 2.], "ProcessorCount" -> 10, "ProcessorType" -> "ARM64", 
    "OperatingSystem" -> "MacOSX", "SystemWordLength" -> 64, "Evaluations" -> {}]]]]

Get information about the input features:

Wolfram Language code:

Information[PredictorFunction[Association["ExampleNumber" -> 6, 
  "Input" -> Association["Preprocessor" -> MachineLearning`MLProcessor["ToMLDataset", 
      Association["Input" -> Association["f1" -> Association["Type" -> "Numerical"], 
         "f2" -> Assoc ...   "Date" -> DateObject[{2023, 7, 26, 16, 30, 52.286118`8.470961373666972}, "Instant", 
      "Gregorian", 2.], "ProcessorCount" -> 10, "ProcessorType" -> "ARM64", 
    "OperatingSystem" -> "MacOSX", "SystemWordLength" -> 64, "Evaluations" -> {}]]], #]& /@ {"FeatureNames", "FeatureNumber", "FeatureTypes"}

Get the feature extractor used to process the input features:

Wolfram Language code:

Information[PredictorFunction[Association["ExampleNumber" -> 6, 
  "Input" -> Association["Preprocessor" -> MachineLearning`MLProcessor["ToMLDataset", 
      Association["Input" -> Association["f1" -> Association["Type" -> "Numerical"], 
         "f2" -> Assoc ...   "Date" -> DateObject[{2023, 7, 26, 16, 30, 52.286118`8.470961373666972}, "Instant", 
      "Gregorian", 2.], "ProcessorCount" -> 10, "ProcessorType" -> "ARM64", 
    "OperatingSystem" -> "MacOSX", "SystemWordLength" -> 64, "Evaluations" -> {}]]], "FeatureExtractor"]

Get a list of the supported properties

Wolfram Language code:

Information[PredictorFunction[Association["ExampleNumber" -> 6, 
  "Input" -> Association["Preprocessor" -> MachineLearning`MLProcessor["ToMLDataset", 
      Association["Input" -> Association["f1" -> Association["Type" -> "Numerical"], 
         "f2" -> Assoc ...   "Date" -> DateObject[{2023, 7, 26, 16, 30, 52.286118`8.470961373666972}, "Instant", 
      "Gregorian", 2.], "ProcessorCount" -> 10, "ProcessorType" -> "ARM64", 
    "OperatingSystem" -> "MacOSX", "SystemWordLength" -> 64, "Evaluations" -> {}]]], "Properties"]

Options (23)

AcceptanceThreshold (1)

Create a predictor with an anomaly detector:

Wolfram Language code: p = Predict[{1 -> 1.2, 2 -> 3.5, 3.5 -> 5.4, 4 -> 2.3}, AnomalyDetector -> Automatic]

Change the value of the acceptance threshold when evaluating the predictor:

Wolfram Language code: p[6, AcceptanceThreshold -> 0.01]

Wolfram Language code: p[6, AcceptanceThreshold -> 0.0001]

Permanently change the value of the acceptance threshold in the predictor:

Wolfram Language code: p2 = Predict[p, AcceptanceThreshold -> 0.01]

Wolfram Language code: p2[6]

AnomalyDetector (1)

Create a predictor and specify that an anomaly detector should be included:

Wolfram Language code: p = Predict[{1 -> 1.2, 2 -> 3.5, 3.5 -> 5.4, 4 -> 2.3}, AnomalyDetector -> Automatic]

Evaluate the predictor on a non-anomalous input:

Wolfram Language code: p[1.2]

Evaluate the predictor on an anomalous input:

Wolfram Language code: p[100000.2]

The "Distribution" property is not affected by the anomaly detector:

Wolfram Language code: p[100000.2, "Distribution"]

Temporarily remove the anomaly detector from the predictor:

Wolfram Language code: p[10000.2, AnomalyDetector -> None]

Permanently remove the anomaly detector from the predictor:

Wolfram Language code: p2 = Predict[p, AnomalyDetector -> None]

Wolfram Language code: p2[10000.2]

FeatureExtractor (2)

Generate a predictor function using FeatureExtractor to preprocess the data using a custom function:

Wolfram Language code:

data = {DateObject[{2014, 5, 5}, TimeObject[{9, 53, 6.30158}, TimeZone -> -5.], TimeZone -> -5.] -> 1, DateObject[{2000, 1, 1}, TimeObject[{0, 0, 0.}, TimeZone -> -5.], TimeZone -> -5.] -> 2, DateObject[{2007, 8, 23}] -> 3, DateObject[{2016, 4, 4}, TimeObject[{15, 59, 18.2738}, TimeZone -> -4.], TimeZone -> -4.] -> 4};

Wolfram Language code: p = Predict[data, FeatureExtractor -> ({AbsoluteTime[#], #["Year"]}&)]

Add the "StandardizedVector" method to the preprocessing pipeline:

Wolfram Language code: p = Predict[data, FeatureExtractor -> {{AbsoluteTime[#], #["Year"]}&, "StandardizedVector"}]

Use the predictor on new data:

Wolfram Language code: p[DateObject[{2017, 1, 18}, TimeObject[{23, 24, 10.099}, TimeZone -> -5.], TimeZone -> -5.]]

Create a feature extractor and extract features from a dataset:

Wolfram Language code:

{features, fe} = FeatureExtraction[{DateObject[{2014, 5, 5, 9, 53, 6.30158}, "Instant", "Gregorian", -5.], DateObject[{2000, 1, 1, 0, 0, 0.}, "Instant", "Gregorian", -5.], DateObject[{2007, 8, 23}, "Day", "Gregorian", -5.], DateObject[{2016, 4, 4, 15, 59, 18.2738}, "Instant", "Gregorian", -4.]}, {{AbsoluteTime[#], #["Year"]}&, "StandardizedVector"}, {"ExtractedFeatures", "ExtractorFunction"}]

Train a predictor on the extracted features:

Wolfram Language code: p = Predict[features -> {1, 2, 3, 4}]

Join the feature extractor to the predictor:

Wolfram Language code: p2 = Predict[p, FeatureExtractor -> fe]

The predictor can now be used on the initial input type:

Wolfram Language code: p2[DateObject[{2017, 1, 18}, TimeObject[{23, 24, 10.099}, TimeZone -> -5.], TimeZone -> -5.]]

FeatureNames (2)

Train a predictor and give a name to each feature:

Wolfram Language code:

p = Predict[{{2.3, "male"} -> 1, {4.8, Missing[]} -> 2.5, {Missing[], "female"} -> 8.4, {5.2, "female"} -> -2, {Missing[], "male"} -> -4.2, {1.3, "male"} -> 10}, FeatureNames -> {"age", "gender"}]

Use the association format to predict a new example:

Wolfram Language code: p[<|"age" -> 3.3, "gender" -> "male"|>]

The list format can still be used:

Wolfram Language code: p[{3.3, "male"}]

Train a predictor on a training set with named features and use FeatureNames to set their order:

Wolfram Language code:

p = Predict[{<|"age" -> 2.3, "gender" -> "male"|> -> 1, <|"age" -> 4.6|> -> 2.5, <|"gender" -> "female"|> -> 8.4, <|"gender" -> "female", "age" -> 5.2|> -> -2}, FeatureNames -> {"gender", "age"}]

Features are ordered as specified:

Wolfram Language code: Information[p, FeatureNames]

Predict a new example from a list:

Wolfram Language code: p[{"female", 6.5}]

FeatureTypes (2)

Train a predictor on textual and nominal data:

Wolfram Language code: trainingset = {{"example", "a"} -> 1.4, {"example", "a"} -> 2.7, {"an example again", "b"} -> 2.7};

Wolfram Language code: p = Predict[trainingset]

The first feature has been wrongly interpreted as a nominal feature:

Wolfram Language code: Information[p, FeatureTypes]

Specify that the first feature should be considered textual:

Wolfram Language code: p = Predict[trainingset, FeatureTypes -> {"Text", "Nominal"}]

Wolfram Language code: Information[p, FeatureTypes]

Predict a new example:

Wolfram Language code: p[{"a new example", "b"}]

Train a predictor with named features:

Wolfram Language code:

trainingset = {
	<|"age" -> 32, "gender" -> 1|> -> 4.3, 
	<|"age" -> 41, "gender" -> 2|> -> 1.2, 
	<|"age" -> 17, "gender" -> 2|> -> 1.4, 
	<|"age" -> 11, "gender" -> 1|> -> 5.1};

Wolfram Language code: p = Predict[trainingset]

Both features have been considered numerical:

Wolfram Language code: Information[p, FeatureTypes]

Specify that the feature "gender" should be considered nominal:

Wolfram Language code: p = Predict[trainingset, FeatureTypes -> <|"gender" -> "Nominal"|>]

Wolfram Language code: Information[p, FeatureTypes]

IndeterminateThreshold (1)

Specify a probability density threshold when training the predictor:

Wolfram Language code: p = Predict[{1 -> 1.2, 2 -> 1.4, 3 -> 4.5, 4 -> 6.8}, IndeterminateThreshold -> 0.5]

Visualize the probability density for a given example:

Wolfram Language code:

example = 3.4;
pdf = PDF[p[example, "Distribution"]]

Wolfram Language code: Plot[pdf[x], {x, 2, 8}, PlotRange -> All]

As no value has a probability density above 0.5, no prediction is made:

Wolfram Language code: p[example]

Specifying a threshold when predicting supersedes the trained threshold:

Wolfram Language code: p[example, IndeterminateThreshold -> 0.]

Update the value of the threshold in the predictor:

Wolfram Language code: p2 = Predict[p, IndeterminateThreshold -> 0.]

Wolfram Language code: p2[example]

Method (4)

Train a linear predictor:

Wolfram Language code: trainingset = {1, 2, 3, 4, 5, 6} -> {2, 3, 5, 8, 9, 7};

Wolfram Language code: linear = Predict[trainingset, Method -> "LinearRegression"]

Train a nearest-neighbors predictor:

Wolfram Language code: nn = Predict[trainingset, Method -> "NearestNeighbors"]

Plot the predicted value as a function of the feature for both predictors:

Wolfram Language code: Plot[{linear[x], nn[x]}, {x, 0, 7}, Exclusions -> None]

Train a random forest predictor:

Wolfram Language code: trainingset = ExampleData[{"MachineLearning", "BostonHomes"}, "TrainingData"];

Wolfram Language code: p1 = Predict[trainingset, Method -> "RandomForest"]

Find the standard deviation of the residuals on a test set:

Wolfram Language code: testset = ExampleData[{"MachineLearning", "BostonHomes"}, "TestData"];

Wolfram Language code: PredictorMeasurements[p1, testset, "StandardDeviation"]

In this example, using a linear regression predictor increases the standard deviation of the residuals:

Wolfram Language code: p2 = Predict[trainingset, Method -> "LinearRegression"]

Wolfram Language code: PredictorMeasurements[p2, testset, "StandardDeviation"]

However, using a linear regression predictor reduces the training time:

Wolfram Language code: Information[#, "TrainingTime"] & /@ {p1, p2}

Train a linear regression, neural network, and Gaussian process predictor:

Wolfram Language code: data = {1, 2, 3, 4, 5, 6, 7, 8, 9} -> {1, 2, 3, 4, 5, 6, 7, 8, 9} ^ 4;

Wolfram Language code: {neural, linear, gaussprocess} = Predict[data, Method -> #]& /@ {"NeuralNetwork", "LinearRegression", "GaussianProcess"}

These methods produce smooth predictors:

Wolfram Language code: Show[ListPlot@data[[2]], Plot[{neural[x], linear[x], gaussprocess[x]}, {x, 0, 10}, Exclusions -> None]]

Train a random forest and nearest-neighbor predictor:

Wolfram Language code: {nearest, forest} = Predict[data, Method -> #]& /@ {"NearestNeighbors", "RandomForest"}

These methods produce non-smooth predictors:

Wolfram Language code: Show[ListPlot@data[[2]], Plot[{forest[x], nearest[x]}, {x, 0, 10}, Exclusions -> None]]

Train a neural network, a random forest, and a Gaussian process predictor:

Wolfram Language code: data = Table[n -> Sin[n], {n, 1, 10}];

Wolfram Language code:

{neuralnetwork, randomforest, gaussianprocess} = Predict[data, Method -> #]& /@ {"NeuralNetwork", "RandomForest", "GaussianProcess"}

The Gaussian process predictor is smooth and handles small datasets well:

Wolfram Language code:

Show[Plot[{neuralnetwork[x], randomforest[x], gaussianprocess[x]}, {x, 1, 10}, PlotLegends -> {"NeuralNetwork", "RandomForest", "GaussianProcess"}, Frame -> True, Exclusions -> None], ListPlot[List@@@data, PlotStyle -> Directive[PointSize[Medium], Red]]]

MissingValueSynthesis (1)

Train a predictor with two input features:

Wolfram Language code:

x = {{1, 3}, {2, 4}, {3, 5}, {4, 4}, {5, 8}, {6, 9}, {7, 4}, {8, 6}, {9, 12}};
y = {2, 4, 5, 4, 6, 7, 4, 5, 9};
p = Predict[x -> y]

Get the prediction for an example that has a missing value:

Wolfram Language code: p[{5, Missing[]}]

Set the missing value synthesis to replace each missing variable with its estimated most likely value given known values (which is the default behavior):

Wolfram Language code: p[{5, Missing[]}, MissingValueSynthesis -> "ModeFinding"]

Replace missing variables with random samples conditioned on known values:

Wolfram Language code: p[{5, Missing[]}, MissingValueSynthesis -> "RandomSampling"]

Averaging over many random imputations is usually the best strategy and allows obtaining the uncertainty caused by the imputation:

Wolfram Language code: MeanAround[Table[p[{5, Missing[]}, MissingValueSynthesis -> "RandomSampling"], 100]]

Specify a learning method during training to control how the distribution of data is learned:

Wolfram Language code: p = Predict[x -> y, MissingValueSynthesis -> "KernelDensityEstimation"]

Predict an example with missing values using the "KernelDensityEstimation" distribution to condition values:

Wolfram Language code: p[{5, Missing[]}]

Provide an existing LearnedDistribution at training to use it when imputing missing values during training and later evaluations:

Wolfram Language code:

dist = LearnDistribution[x, Method -> "Multinormal"];
p = Predict[x -> y, MissingValueSynthesis -> dist];
p[{5, Missing[]}]

Specify an existing LearnedDistribution to synthesize missing values for an individual evaluation:

Wolfram Language code:

dist2 = LearnDistribution[x, Method -> "KernelDensityEstimation"];
p[{5, Missing[]}, MissingValueSynthesis -> dist2]

Control both the learning method and the evaluation strategy by passing an association at training:

Wolfram Language code:

p = Predict[x -> y, MissingValueSynthesis -> 
	<|"LearningMethod" -> "Multinormal", "EvaluationStrategy" -> "RandomSampling"|>];
p[{5, Missing[]}]

PerformanceGoal (1)

Train a predictor with an emphasis on training speed:

Wolfram Language code: trainingset = ExampleData[{"MachineLearning", "WineQuality"}, "TrainingData"];

Wolfram Language code: p1 = Predict[trainingset, PerformanceGoal -> "TrainingSpeed"]

Wolfram Language code: Information[p1, "TrainingTime"]

Find the standard deviation of the residuals on a test set:

Wolfram Language code: testset = ExampleData[{"MachineLearning", "WineQuality"}, "TestData"];

Wolfram Language code: PredictorMeasurements[p1, testset, "StandardDeviation"]

By default, a compromise between prediction speed and performance is sought:

Wolfram Language code: p2 = Predict[trainingset]

Wolfram Language code: Information[p2, "TrainingTime"]

Wolfram Language code: PredictorMeasurements[p2, testset, "StandardDeviation"]

With the same data, train a predictor with an emphasis on training speed and memory:

Wolfram Language code: p3 = Predict[trainingset, PerformanceGoal -> {"TrainingSpeed", "Memory"}]

The predictor uses less memory, but is also less accurate:

Wolfram Language code: ByteCount /@ {p2, p3}

Wolfram Language code: PredictorMeasurements[p3, testset, "StandardDeviation"]

RecalibrationFunction (1)

Load the Boston Homes dataset:

Wolfram Language code:

training = RandomSample[ResourceData["Sample Data: Boston Homes", "TrainingData"]];
test = ResourceData["Sample Data: Boston Homes", "TestData"];

Train a predictor with model calibration:

Wolfram Language code: p = Predict[training, Method -> "RandomForest", RecalibrationFunction -> All]

Visualize the comparison plot on a test set:

Wolfram Language code: PredictorMeasurements[p, test, "ComparisonPlot"]

Remove the recalibration function from the predictor:

Wolfram Language code: p2 = Predict[p, RecalibrationFunction -> None]

Visualize the new comparison plot:

Wolfram Language code: PredictorMeasurements[p2, test, "ComparisonPlot"]

TargetDevice (1)

Train a predictor on the system's default GPU using a neural network and look at the AbsoluteTiming:

Wolfram Language code:

n = 10000;
trainingData = RandomReal[1, {n, 4}] -> RandomReal[1, n];
AbsoluteTiming[predictor = Predict[trainingData, Method -> "NeuralNetwork", TargetDevice -> "GPU"]]

Compare the previous result with the one achieved by using the default CPU computation:

Wolfram Language code: AbsoluteTiming[predictor = Predict[trainingData, Method -> "NeuralNetwork"]]

TimeGoal (2)

Train a predictor while specifying a total training time of 3 seconds:

Wolfram Language code: p = Predict[{1, 2, 3, 4} -> {1, 2, 3, 4}, TimeGoal -> Quantity[3, "Seconds"]]

Wolfram Language code: Information[p, "TrainingTime"]

Load the "BostonHomes" dataset:

Wolfram Language code:

dataset = ExampleData[{"MachineLearning", "BostonHomes"}, "Data"];
testset = ExampleData[{"MachineLearning", "BostonHomes"}, "TestData"];

Train a predictor while specifying a target training time of 0.1 seconds:

Wolfram Language code: p = Predict[dataset, TimeGoal -> .1]

The predictor reached a standard deviation of about 3.2:

Wolfram Language code: PredictorMeasurements[p, testset, "StandardDeviation"]

Train a classifier while specifying a target training time of 5 seconds:

Wolfram Language code: p = Predict[dataset, TimeGoal -> 5]

The standard deviation of the predictor is now around 2.7:

Wolfram Language code: PredictorMeasurements[p, testset, "StandardDeviation"]

TrainingProgressReporting (1)

Load the "WineQuality" dataset:

Wolfram Language code: dataset = ExampleData[{"MachineLearning", "WineQuality"}, "Data"];

Show training progress interactively during training of a predictor:

Wolfram Language code: Predict[dataset, TrainingProgressReporting -> "Panel"];

Show training progress interactively without plots:

Wolfram Language code: Predict[dataset, TrainingProgressReporting -> "SimplePanel"];

Print training progress periodically during training:

Wolfram Language code: Predict[dataset, TrainingProgressReporting -> "Print"];

Show a simple progress indicator:

Wolfram Language code: Predict[dataset, TrainingProgressReporting -> "ProgressIndicator"];

Do not report progress:

Wolfram Language code: Predict[dataset, TrainingProgressReporting -> None];

UtilityFunction (2)

Train a predictor:

Wolfram Language code: trainingset = {1 -> 1.1, 2 -> 4.4, 3 -> 6.1, 4 -> 7.1, 5 -> 9.2};

Wolfram Language code: p1 = Predict[trainingset]

Visualize the probability density for a given example:

Wolfram Language code:

example = 2.4;
pdf = PDF[p1[example, "Distribution"]]

Wolfram Language code: Plot[pdf[x], {x, 1, 7}, PlotRange -> All]

By default, the value with the highest probability density is predicted:

Wolfram Language code: p1[example]

This corresponds to a Dirac delta utility function:

Wolfram Language code: Information[p1, UtilityFunction]

Define a utility function that penalizes the predicted value's being smaller than the actual value:

Wolfram Language code: utility[a_, p_] := -Piecewise[{{Exp[p - a], a < p}, {Exp[3 * (a - p)], a ≥ p}}]

Plot this function for a given actual value:

Wolfram Language code: Plot[utility[0, p], {p, -1, 2}]

Train a predictor with this utility function:

Wolfram Language code: p2 = Predict[trainingset, UtilityFunction -> utility]

The predictor decision is now changed despite the probability density's being unchanged:

Wolfram Language code: p2[example]

Wolfram Language code: Plot[PDF[p2[example, "Distribution"]][x], {x, 1, 7}, PlotRange -> All]

Specifying a utility function when predicting supersedes the utility function specified at training:

Wolfram Language code: p2[example, UtilityFunction -> (DiracDelta[#2 - #1]&)]

Update the predictor utility:

Wolfram Language code: p3 = Predict[p2, UtilityFunction -> (DiracDelta[#2 - #1]&)]

Wolfram Language code: p3[example]

Visualize the distribution of age for the name "Claire" with the built-in predictor "NameAge":

Wolfram Language code: distribution = Predict["NameAge", "Claire", "Distribution"]

Wolfram Language code: Plot[PDF[distribution, x], {x, 0, 100}, Exclusions -> None, PlotRange -> All]

The most likely value of this distribution is the following:

Wolfram Language code: Predict["NameAge", "Claire"]

Change the utility function to predict the mean value instead of the most likely value:

Wolfram Language code: Predict["NameAge", "Claire", UtilityFunction -> Function[-(#2 - #1) ^ 2], IndeterminateThreshold -> 0]

ValidationSet (1)

Train a linear regression predictor on the "WineQuality" data:

Wolfram Language code: trainingset = ExampleData[{"MachineLearning", "WineQuality"}, "TrainingData"];

Wolfram Language code: p1 = Predict[trainingset, Method -> "LinearRegression"]

Obtain the L2 regularization coefficient of the trained predictor:

Wolfram Language code: Information[p1, "L2Regularization"]

Specify a validation set:

Wolfram Language code: validationset = ExampleData[{"MachineLearning", "WineQuality"}, "TestData"];

Wolfram Language code: p2 = Predict[trainingset, ValidationSet -> validationset, Method -> "LinearRegression"]

A different L2 regularization coefficient has been selected:

Wolfram Language code: Information[p2, "L2Regularization"]

Applications (6)

Basic Linear Regression (1)

Train a predictor that predicts the median value of properties in a neighborhood of Boston, given some features of the neighborhood:

Wolfram Language code: p = Predict[ExampleData[{"MachineLearning", "BostonHomes"}, "TrainingData"], PerformanceGoal -> "Quality"]

Generate a PredictorMeasurementsObject to analyze the performance of the predictor on a test set:

Wolfram Language code: pm = PredictorMeasurements[p, ExampleData[{"MachineLearning", "BostonHomes"}, "TestData"]]

Visualize a scatter plot of the values of the test set as a function of the predicted values:

Wolfram Language code: pm["ComparisonPlot"]

Compute the root mean square of the residuals:

Wolfram Language code: pm["StandardDeviation"]

Weather Analysis (1)

Load a dataset of the average monthly temperature as a function of the city, the year, and the month:

Wolfram Language code:

dataset = RandomSample[{#2, ToExpression[#3], #4} -> (#1 - 32) / 1.8& @@@ExampleData[{"Statistics", "USCityTemperature"}]];

Visualize a sample of the dataset:

Wolfram Language code: RandomSample[dataset, 5] // TableForm

Train a linear predictor on the dataset:

Wolfram Language code: p = Predict[dataset, Method -> "LinearRegression"]

Plot the predicted temperature distribution of the city "Lincoln" in 2020 for different months:

Wolfram Language code:

Plot[{
	PDF[p[{"Lincoln", 2020, "January"}, "Distribution"], x], 
	PDF[p[{"Lincoln", 2020, "May"}, "Distribution"], x], PDF[p[{"Lincoln", 2020, "August"}, "Distribution"], x]
	}, {x, -10, 30}, PlotLegends -> {"January", "May", "August"}]

For every month, plot the predicted temperature and its error bar (standard deviation):

Wolfram Language code:

months = {"January", "February", "March", "April", "May", "June", "July", "August", "September", "October", "November", "December"};

Wolfram Language code:

distributions = MapIndexed[
	{First[#2], p[{"Lincoln", 2020, #1}, "Distribution"]}&, months];

Wolfram Language code:

ListPlot[
	{#1, Around[#2[[1]], #2[[2]]]}&@@@distributions, IconizedObject[«options»]]

Quality Assessment (1)

Load a dataset of wine quality as a function of the wines' physical properties:

Wolfram Language code: trainingset = ExampleData[{"MachineLearning", "WineQuality"}, "TrainingData"];

Visualize a few data points:

Wolfram Language code: RandomSample[trainingset, 3] // TableForm

Get a description of the variables in the dataset:

Wolfram Language code: ExampleData[{"MachineLearning", "WineQuality"}, "VariableDescriptions"]

Visualize the distribution of the "alcohol" and "pH" variables:

Wolfram Language code: {Histogram[trainingset[[All, 1, 11]], PlotLabel -> "alcohol"], Histogram[trainingset[[All, 1, 9]], PlotLabel -> "pH"]}

Train a predictor on the training set:

Wolfram Language code: p = Predict[trainingset]

Predict the quality of an unknown wine:

Wolfram Language code: unknownwine = {7.6, 0.48, 0.31, 9.4, 0.046, 6., 194., 0.99714, 3.07, 0.61, 9.4};

Wolfram Language code: p[unknownwine]

Create a function that predicts the quality of the unknown wine as a function of its pH and alcohol level:

Wolfram Language code: quality[pH_, alcohol_] := p[{7.6, 0.48, 0.31, 9.4, 0.046, 6., 194., 0.99714, pH, 0.61, alcohol}];

Plot this function to have a hint on how to improve this wine:

Wolfram Language code:

Show[Plot3D[quality[pH, alcohol], {pH, 2.8, 3.8} , {alcohol, 8, 14}, AxesLabel -> Automatic, Exclusions -> None], ListPointPlot3D[{{3.07, 9.4, p[unknownwine]}}, PlotStyle -> {Red, PointSize[.05]}]]

Interpretable Machine Learning (1)

Load a dataset of wine quality as a function of the wines' physical properties:

Wolfram Language code: wine = ResourceData["Sample Data: Wine Quality"];

Train a predictor to estimate wine quality:

Wolfram Language code:

p = Predict[wine -> "WineQuality"]
x = KeyDrop[wine, "WineQuality"];

Examine an example bottle:

Wolfram Language code: bottle = Last[x]

Predict the example bottle's quality:

Wolfram Language code: predictedquality = p[bottle]

Calculate how much higher or lower this bottle's predicted quality is than the mean:

Wolfram Language code:

meanquality = Information[p, "TrainingLabelMean"];
predictedquality - meanquality

Get an estimation for how much each feature impacted the predictor's output for this bottle:

Wolfram Language code: impacts = p[bottle, "SHAPValues"]

Visualize these feature impacts:

Wolfram Language code:

BarChart[Sort[impacts], ChartLabels -> Placed[Automatic, After], ImageSize -> Medium, BarOrigin -> Left, PlotLabel -> "Impact of Feature on Prediction"]

Confirm that the Shapley values fully explain the predicted quality:

Wolfram Language code:

Total[impacts]
meanquality + Total[impacts] == predictedquality

Learn a distribution of the data that treats each feature as independent:

Wolfram Language code: dist = LearnDistribution[x, Method -> {"Multinormal", "CovarianceType" -> "Diagonal"}]

Estimate SHAP value feature importance for 100 bottles of wine, using 5 samples for each estimation:

Wolfram Language code:

winebottles = RandomSample[x, 100];
shaps = p[winebottles, "SHAPValues" -> 5, MissingValueSynthesis -> dist];

Calculate how important each feature is to the model:

Wolfram Language code: importance = Mean[Abs[shaps]]

Visualize the model's feature importance:

Wolfram Language code:

BarChart[Sort[importance], ChartLabels -> Placed[Automatic, After], ImageSize -> Medium, BarOrigin -> Left, PlotLabel -> "Average Feature Impact"]

Visualize a nonlinear relationship between a feature's value and its impact on the model's prediction:

Wolfram Language code:

ListPlot[Thread[{Normal[winebottles][[All, "TotalSulfurDioxide"]], shaps[[All, "TotalSulfurDioxide"]]}], AxesLabel -> {"Feature Value", "Feature Impact"}, PlotMarkers -> Automatic]

Computer Vision (1)

Generate images of gauges associated with their values:

Wolfram Language code: trainingset = Image[AngularGauge[#]] -> #& /@ RandomReal[1, 300];

Wolfram Language code: Export["AngularGauge_example.mx", trainingset]

Wolfram Language code: RandomSample[trainingset, 3]

Train a predictor on this dataset:

Wolfram Language code: predictor = Predict[trainingset]

Predict the value of a gauge from its image:

Wolfram Language code: predictor[[image]]

Interact with the predictor using Dynamic:

Wolfram Language code:

Row[{AngularGauge[Dynamic[t]], Style["->", Large], Dynamic[Labeled[AngularGauge[predictor[Image[AngularGauge[t]]]], "(predicted value)"]]}, BaseStyle -> FontFamily -> "Sans Serif"]

Customer Behavior Analysis (1)

Import a dataset with data about customer purchases:

Wolfram Language code: dataset = IconizedObject[«customer data»];

Wolfram Language code: RandomChoice[dataset]

Train a "GradientBoostedTrees" model to predict the total spending based on the other features:

Wolfram Language code: model = Predict[dataset -> "Total_Spent", Method -> "GradientBoostedTrees"]

Use the model to predict the most likely spending by location:

Wolfram Language code: spendingByLocation = AssociationMap[model[<|"Location" -> #|>]&, dataset[Union, "Location"]]

Visualize the data on a map:

Wolfram Language code: GeoBubbleChart[spendingByLocation]

For the top three locations, estimate the spending amount as a function of the customer age:

Wolfram Language code: topCities = Take[spendingByLocation, 3]//Keys//Normal

Define an year range:

Wolfram Language code: years = Range@@dataset[MinMax, "Age"]//Normal;

Compute the model predictions:

Wolfram Language code:

predictions = Table[
	model[Table[<|"Age" -> y, "Location" -> city|>, {y, years}], "Distribution"], 
	{city, topCities}
	] /. {QuantityDistribution -> Quantity, NormalDistribution -> Around};

Create the dataset to plot:

Wolfram Language code: points = Table[Thread[{years, res}], {res, predictions}];

Visualize it:

Wolfram Language code:

ListLinePlot[points, Frame -> True, FrameLabel -> Automatic, IntervalMarkers -> "Bands", ImageSize -> Medium, PlotLegends -> topCities]

Properties & Relations (1)

The linear regression predictor without regularization and LinearModelFit can train equivalent models:

Wolfram Language code: data = Table[{i, RandomReal[{i - 1, i}]}, {i, 10}]

Wolfram Language code: p = Predict[Rule@@@data, Method -> {"LinearRegression", "L2Regularization" -> 0}]

Wolfram Language code: Information[p, "Function"]

Wolfram Language code: LinearModelFit[data, x, x]

Fit and NonlinearModelFit can also be equivalent:

Wolfram Language code: Fit[data, {1, x}, x]

Wolfram Language code: NonlinearModelFit[data, a + b x, {a, b}, x]

Possible Issues (1)

The RandomSeeding option does not always guarantee reproducibility of the result:

Train several predictors on the "WineQuality" dataset:

Wolfram Language code: dataset = ExampleData[{"MachineLearning", "WineQuality"}, "Data"];

Wolfram Language code: predictors = Table[Predict[dataset], 3];

Compare the results when tested on a test set:

Wolfram Language code: testset = ExampleData[{"MachineLearning", "WineQuality"}, "TestData"];

Wolfram Language code: SameQ@@(#[testset[[All, 1]]]& /@ predictors)

Neat Examples (1)

Create a function to visualize the predictions of a given method after learning from 1D data:

Wolfram Language code:

visualizePrediction[data_, method_] := Module[
	{p, predictionplot, dataplot, xs}, 
	dataplot = ListPlot[List@@@data, PlotStyle -> Red, PlotLegends -> {"Data"}];
	xs = data[[All, 1]];
	p = Predict[data, Method -> method];
	predictionplot = Plot[{
	p[x], 
	p[x] + StandardDeviation[p[x, "Distribution"]], p[x] - StandardDeviation[p[x, "Distribution"]]
	}, {x, Min[xs] - 1, Max[xs] + 1}, PlotStyle -> {Blue, Gray, Gray}, Filling -> {2 -> {3}}, Exclusions -> False, PerformanceGoal -> "Speed", PlotLegends -> {"Prediction", "Confidence Interval"}];
	Show[predictionplot, dataplot, PlotLabel -> method, ImageSize -> 250]
	];

Try the function with the "GaussianProcess" method on a simple dataset:

Wolfram Language code: visualizePrediction[{-1.2 -> 1.2, 1.4 -> 1.4, 3.1 -> 1.8, 4.5 -> 1.6}, "GaussianProcess"]

Visualize the prediction of other methods:

Wolfram Language code:

Grid[Partition[visualizePrediction[{-1.2 -> 1.2, 1.4 -> 1.4, 3.1 -> 1.8, 4.5 -> 1.6}, #][[1, 1]]& /@ {"LinearRegression", "NearestNeighbors", "RandomForest", "NeuralNetwork"}, 2], Frame -> All, FrameStyle -> LightGray]

Top

	"name"	a built-in prediction function
	FittedModel[…]	a fitted model converted into a PredictorFunction[…]

Predict

Details and Options

Examples

Basic Examples (2)

Scope (24)

Data Format (7)

Data Types (13)

Numerical (3)

Nominal (3)

Quantities (1)

Text (1)

Colors (1)

Images (1)

Sequences (1)

Missing Data (2)

Information (4)

Options (23)

AcceptanceThreshold (1)

AnomalyDetector (1)

FeatureExtractor (2)

FeatureNames (2)

FeatureTypes (2)

IndeterminateThreshold (1)

Method (4)

MissingValueSynthesis (1)

PerformanceGoal (1)

RecalibrationFunction (1)

TargetDevice (1)

TimeGoal (2)

TrainingProgressReporting (1)

UtilityFunction (2)

ValidationSet (1)

Applications (6)

Basic Linear Regression (1)

Weather Analysis (1)

Quality Assessment (1)

Interpretable Machine Learning (1)

Computer Vision (1)

Customer Behavior Analysis (1)

Properties & Relations (1)

Possible Issues (1)

Neat Examples (1)

See Also

Related Guides

Related Links

History

Text

CMS

APA

BibTeX

BibLaTeX