LogitModelFit

LogitModelFit[{{x₁,y₁},{x₂,y₂},…},{f₁,f₂,…},x]

constructs a binomial logistic regression model of the form that fits the y_i for each x_i.

LogitModelFit[data,{f₁,…},{x₁,x₂,…}]

constructs a binomial logistic regression model of the form where the f_i depend on the variables x_k.

LogitModelFit[{m,v}]

constructs a binomial logistic regression model from the design matrix m and response vector v.

Details and Options

LogitModelFit attempts to model the data using a linear combination of basis functions composed with a logistic sigmoid.
LogitModelFit is typically used in classification to model probability values.
LogitModelFit produces a generalized linear model of the form under the assumption that the original are independent realizations of Bernoulli trials with probabilities .
The function is the LogisticSigmoid.
LogitModelFit returns a symbolic FittedModel object to represent the logistic model it constructs. The properties and diagnostics of the model can be obtained from model["property"].
The value of the best-fit function from LogitModelFit at a particular point x₁, … can be found from model[x₁,…].
Possible forms of data are:

	{y₁,y₂,…}	equivalent to the form {{1,y₁},{2,y₂},…}
	{{x₁₁,x₁₂,…,y₁},…}	a list of independent values x_ij and the responses y_i
	{{x₁₁,x₁₂,…}y₁,…}	a list of rules between input values and responses
	{{x₁₁,x₁₂,…},…}{y₁,y₂,…}	a rule between a list of input values and responses
	{{x₁₁,…,y₁,…},…}n	fit the nth column of a matrix

With multivariate data such as ${{x_(11),x_(12),... ,y_(1)},{x_(21),x_(22),... ,y_(2)},...}$ , the number of coordinates x_i1, x_i2, … should equal the number of variables x_i.
The y_i are probabilities between 0 and 1.
Additionally, data can be specified using a design matrix without specifying functions and variables:
{m,v} a design matrix m and response vector v
In LogitModelFit[{m,v}], the design matrix m is formed from the values of basis functions f_i at data points in the form {{f₁,f₂,…},{f₁,f₂,…},…}. The response vector v is the list of responses {y₁,y₂,…}.
For a design matrix m and response vector v, the model is , where is the vector of parameters to be estimated.
When a design matrix is used, the basis functions f_i can be specified using the form LogitModelFit[{m,v},{f₁,f₂,…}].
LogitModelFit is equivalent to GeneralizedLinearModelFit with ExponentialFamily->"Binomial" and LinkFunction->Automatic.
LogitModelFit takes the same options as GeneralizedLinearModelFit, with the exception of ExponentialFamily and LinkFunction.

Examples

open allclose all

Basic Examples (1)

Define a dataset:

Fit a logistic model to the data:

Evaluate the model at a point:

Plot the data points and the models:

Scope (13)

Data (6)

Fit data with success probability responses, assuming increasing integer-independent values:

This is equivalent to:

Weight by the number of observations for each predictor value:

This gives the same best fit function as success failure data:

Fit a list of rules:

Fit a rule of input values and responses:

Specify a column as the response:

Fit a model given a design matrix and response vector:

See the functional form:

Fit the model referring to the basis functions as x and y:

Obtain a list of available properties:

Properties (7)

Data & Fitted Functions (1)

Fit a logit model:

Extract the original data:

Obtain and plot the best fit:

Obtain the fitted function as a pure function:

Get the design matrix and response vector for the fitting:

Residuals (1)

Examine residuals for a fit:

Visualize the raw residuals:

Visualize Anscombe residuals and standardized Pearson residuals in stem plots:

Dispersion and Deviances (1)

Fit a logit model to some data:

The estimated dispersion is 1 by default:

Use Pearson's as the dispersion estimator instead:

Plot the deviances for each point:

Obtain the analysis of deviance table:

Get the residual deviances from the table:

Parameter Estimation Diagnostics (1)

Obtain a formatted table of parameter information:

Extract the column of -statistic values:

Influence Measures (1)

Fit some data containing extreme values to a logit model:

Check Cook distances to identify highly influential points:

Check the diagonal elements of the hat matrix to assess influence of points on the fitting:

Prediction Values (1)

Fit a logit model:

Plot the predicted values against the observed values:

Goodness-of-Fit Measures (1)

Obtain a table of goodness-of-fit measures for a logit model:

Compute goodness-of-fit measures for all subsets of predictor variables:

Rank the models by AIC:

Generalizations & Extensions (1)

Perform other mathematical operations on the functional form of the model:

Integrate symbolically and numerically:

Find a predictor value that gives a particular value for the model:

Options (8)

ConfidenceLevel (1)

The default gives 95% confidence intervals:

Use 99% intervals instead:

Set the level to 90% within FittedModel:

CovarianceEstimatorFunction (1)

Fit a logit model:

Compute the covariance matrix using the expected information matrix:

Use the observed information matrix instead:

DispersionEstimatorFunction (1)

Fit a logit model:

Compute the covariance matrix:

Compute the covariance matrix estimating the dispersion by Pearson's :

IncludeConstantBasis (1)

Fit a logit model:

Fit the model with no constant term:

LinearOffsetFunction (1)

Fit data to a logit model:

Fit data to a model with a known Sqrt[x] term:

NominalVariables (1)

Fit the data treating the first variable as a nominal variable:

Treat both variables as nominal:

Weights (1)

Fit a model using equal weights:

Give explicit weights for the data points:

WorkingPrecision (1)

Use WorkingPrecision to get higher precision in parameter estimates:

Obtain the fitted function:

Reduce the precision in property computations after the fitting:

Properties & Relations (4)

A default "Binomial" model from GeneralizedLinearModelFit is equivalent to the model for LogitModelFit:

ProbitModelFit is equivalent to a "Binomial" model with "ProbitLink":

LogitModelFit assumes binomially distributed responses:

NonlinearModelFit assumes normally distributed responses:

The fits are not identical:

LogitModelFit will use the time stamps of a TimeSeries as variables:

Rescale the time stamps and fit again:

Find fit for the values:

LogitModelFit acts pathwise on a multipath TemporalData:

Possible Issues (1)

Responses outside the interval from 0 to 1 are not valid for logit models:

Top

LogitModelFit

Details and Options

Examples

Basic Examples (1)

Scope (13)

Data (6)

Properties (7)

Data & Fitted Functions (1)

Residuals (1)

Dispersion and Deviances (1)

Parameter Estimation Diagnostics (1)

Influence Measures (1)

Prediction Values (1)

Goodness-of-Fit Measures (1)

Generalizations & Extensions (1)

Options (8)

ConfidenceLevel (1)

CovarianceEstimatorFunction (1)

DispersionEstimatorFunction (1)

IncludeConstantBasis (1)

LinearOffsetFunction (1)

NominalVariables (1)

Weights (1)

WorkingPrecision (1)

Properties & Relations (4)

Possible Issues (1)

See Also

Tech Notes

Related Guides

History

Text

CMS

APA

BibTeX

BibLaTeX