How to | Perform Calculations on Columns of Data
You will often need to perform calculations on the columns in a dataset, particularly when the columns represent variables. While some functions automatically operate on columns of data when a rectangular array is given, others may require some manipulation of the data in order to operate on the columns.
Create some data to work with (
SeedRandom ensures a predictable result):
| Out[1]= |  |
Mathematica characterizes data by grouping lists within other lists. Every list is interpreted as a row within the matrix of data:
Out[2]//MatrixForm= |
| |  |
The
Grid function displays data in the same fashion, only without the braces:
| Out[3]= |  |
By default many functions operate on each column when a rectangular list of lists is given as the argument.
Find the mean of each column:
| Out[4]= |  |
Find the standard deviation of the columns:
| Out[5]= |  |
Find the median of each column:
| Out[6]= |  |
You can also select individual columns for calculations. Here, the first column from
data is selected:
| Out[7]= |  |
| Out[8]= |  |
| Out[9]= |  |
| Out[10]= |  |
For matrices with more than two columns, plot the rows as separate data sets:
| Out[11]= |  |
Plot the columns by transposing the data:
| Out[12]= |  |
For functions that operate on vectors, map the function onto the transposed data to operate on columns:
| Out[13]= |  |
Transpose the result to get a matrix with normalized columns:
| Out[14]= |  |
Transposing and mapping can also be done for functions that flatten their argument:
| Out[15]= |  |