Univariate Data Summary

In this section we look at how to summarize each column individually; e.g. taking the mean of a numerical column.

This section is structured into three parts as follows:

  1. Dataset Summary: Quickly obtaining a basic summary of each column in a Data Frame at once.
  2. Numeric Columns: Summarizing numeric columns e.g. obtaining the sum.
  3. Non Numeric Columns: Summarizing non-numeric columns (which may be of non-numeric data types such as factor, string, or date) e.g. obtaining the unique possible values.

In Multivariate Summary we will look at summarizing multiple columns together e.g. taking the mean of a numeric column for each group of rows where groups are defined by the value of a factor (categorical) column.

SQL
I/O