Compare the distribution of data to a theoretical distribution:

Compare multivariate data to a theoretical distribution:

The difference:

Produce a smoothed representation with

SmoothKernelDistribution:

Using

HistogramDistribution with bin delimiters set to the data creates a linear interpolation of

EmpiricalDistribution:

Ten letters published in 1861 under the name Quintus Curtius Snodgrass are claimed to have been authored by Mark Twain. Compare the word length distribution for the letters to some works by Mark Twain:

Comparison to the English language in general emphasizes the similarity:

A test for goodness of fit suggests, however, that Twain did not write the QCS letters:

Compare the distributions of winning times in Scottish hill races for those who take the high road and those who take the low road:

It appears that it is faster to take the low road:

The National Institutes of Health estimates that 2% of the population has a certain disease. A test for the disease is proposed that detects its presence 95% of the time with a false positive rate of 5%. Given that a patient tests positive, find the probability that he or she actually has the disease:

Equations for the unknown probabilities based on the information given:

Solve the equations assuming the probabilities sum to unity:

The probability a patient has the disease given a positive test result:

A group of 21 students was selected at random to participate in a new directed reading program. A control group of 23 students was educated with traditional methods. Reading test scores for students in the two groups were recorded following their programs. Perform a permutation-based test on the scores to determine if the directed reading program was successful:

The mean difference in test scores across the groups can be used as a test statistic:

Simulate the null distribution of the test statistic by randomly permuting the groups:

At the 5% level there is evidence that the new program made a difference:

